ABSTRACT
In
recent times, the rate of growth in information available on the internet has
resulted in large amounts of data and an increase in online users. The Recommendation
System has been employed to empower users to make informed and accurate
decisions from the vast abundance of information. In this Research, we propose
a hybrid recommender engine which combines Content-Based and Collaborative
filtering recommendations. This seeks to explore how prediction accuracy can be
enhanced in existing collaborative filtering frameworks. We investigate to see
if a Recommendation System combining Content-based and Collaborative filtering,
using a Mahout Framework and built on Hadoop will improve recommendation
accuracy and also alleviate scalability issues currently experienced in
processing large volumes of data for recommending items to users. We employed
the Feature augmentation hybrid technique where the output from the
Content-based recommendation is used as an input to Collaborative filtering.
The well-known MovieLens data was matched with the Internet Movie Database
(IMDB) in order to extract user and item content features. The input files
generated from the integration of both databases was converted to text files
which serve as an input into the Collaborative filtering framework in Mahout.
By means of various experiments, the best parameter optimization for Mahout
Components was determined for our model. We further examined these models by
comparing the Root Mean Square Error of our model against the state of art
model. The proposed model showed significant improvement when compared with the
pure collaborative model. It was demonstrated from our analysis that the extracted
user and items content features can, in some cases, lead to a better prediction
accuracy. To be more precise, it was discovered that the user feature, gender,
has no marginal impact on our underlying model while an item feature like
Country is more beneficial than genre, contrary to findings in some other
research work.
CHAPTER
ONE
INTRODUCTION
1.1
BACKGROUND OF THE STUDY
The
rate at which information is growing on the internet has resulted in large
amounts of data and an increase in online users. This huge explosion of data
has flooded users with large volumes of information and hence poses a great
challenge in terms of information overload. Resultantly, this has made it very
difficult for human beings to process such information manually and quite
difficult for them to find the right information. The ability to make informed
and accurate decisions from the sheer abundance of information by users often
creates immense confusion. . Large internet companies like Amazon, Google, and
Facebook have been faced with a difficulty in managing this explosion of
information. Recommendation systems have been employed in order to transform
this problem in a smart way. Figure 1.1 shows how recommender engines have
stepped in this regard to rescue users from such confusion. The
vast increase in online data and users led to the rise of big data. The Big
Data world has paid the most attention to the Recommendation System. Big Data
has improved the capacity to do recommendations on a large scale. It has made
the Recommendation System more important for the users as it predicts right
piece of information out of vast amounts of information. The system is a
particular form of information filtering that exploits users past behaviors or
by the behavior of similar users to generate a list of information items that
is personally tailored to an end user's preferences. At present, in E-commerce,
Recommendation Systems (RSs) are broadly used for information filtering
processes to deliver personalized information by predicting user’s preferences
to particular items [1]. RSs attempt to suggest items (Movies, music, books,
news, web pages, etc.) that are most likely to interest the users. Amazon,
Netflix and other such portals use RSs extensively for suggesting content to
their users. RSs aim to alleviate information overload problems by presenting
the most attractive and relevant content. RSs have become a basic need of every
e-commerce portal.
Department: Computer Science (M.Sc Thesis)
Format: MS Word
Chapters: 1 - 5, Preliminary Pages, Abstract, References, Appendix.
No. of Pages: 68
NB: The Complete Thesis is well written and ready to use.
Price: 20,000 NGN
In Stock
Our Customers are Happy!!!
No comments:
Post a Comment
Add Comment