Friday, November 2, 2012

Matching Recommendation Technologies and Domains - Part 2

 
 
http://danceswithfat.files.wordpress.com/2011/05/choices-sign.jpg
 
 
Matching Domain Characteristics with Knowledge Sources
 
The choice of domain and the characteristics of the application place certain constraints on the kinds of knowledge sources that a recommender system may deploy. In turn, the availability and quality of knowledge sources influences what recommendation technologies a recommender can profitably use.

 
 
Effect @Social Knowledge
 
In heterogenous domains, social knowledge should be considered as a knowledge source since it is gathered by user’s input and does not need extensive knowledge engineering.
 
However, social knowledge is not sufficiently accurate and reliable for high risk domains or for domains which need explanation.
 
Social knowledge will tend to be sparse for high churn domains.
 
Using social knowledge is appropriate for domains with implicit type of interaction since it is possible to mine the users’ behavior using machine learning and statistical techniques which are the typical algorithms in collaborative filtering.
 
In domains with unstable user preference, the social knowledge can be misleading since the historical data is unreliable.
 
Effect @Individual Knowledge

In heterogenous domains, it might be difficult to transfer user’s input on certain items for recommending other items. For example, it is not certain that two users who have similar taste about movies, would also like similar music.
 
* A domain that requires knowledge of the user’s short-term requirements is most likely suited to some kind of knowledge-based recommendation.Constraints and preferences allow the user to limit and to rank options. For example, a dog owner might have a strict constraint that any apartment he rents accept his pet. A parent with young children might have a preference to be close to parks and playgrounds.
 
In high risk domains and domains which need explanation, it is usually necessary to have explicit requirements and constraints from the user. Similarly, user requirements are more likely to be needed in domains with unstable preferences since the historical data are unreliable.
 
Effect @Content Knowledge
 
The most basic kind of content knowledge is item features. These features can typically be used as is in a recommender system, although implementers will often want to restrict the feature space. If items are represented by unstructured documents such as news stories, the implementer will need to draw from information extraction (IE) techniques to extract and select features for use in recommendation. Features can be reduced further by applying more sophisticated feature selection techniques.Content knowledge in multimedia format presents an additional challenge.
 
The quality of recommendations produced by a content-based or knowledgebased recommender will be entirely dependent on the quality of the content data on which its decisions are based. Indeed, the lack of reliable item features is often cited as a motivating factor for avoiding content-based recommendation. The cost involved in creating and maintaining a database of useful item features should not be underestimated, particularly for heterogeneous domains.
 
New technical innovations arrive regularly, requiring that the schema and the individual entries for each item be updated. If there are a large number of not-entirely-independent features extracted in a variety of ways, the system may be tolerant of noisy feature data. On the other hand, applications with high risk will need to pay special attention to having clean item features.
 
Effect @Domain Knowledge (sub-content)
 
A knowledge-based recommender will typically need to know more than just what features are associated with what items. The most basic form of domain knowledge that a recommender can employ is an ontology over the item features. Such an ontology allows the system to reason about the relationship between features at a level deeper than just raw equality or difference.

Many high risk choices have constraints imposed by the domain that a recommender needs to obey.

The recommendation problem can be in some cases formulated entirely as constraint satisfaction with constraints being contributed both by the user and by the system.

A final category of domain knowledge is means-ends knowledge, which is the knowledge that enables a system to map between the user’s goals (ends) and the products that might satisfy them (means).

Part of the reason that users benefit from recommender systems is that they can make good choices without necessarily being conversant with all of the complexities of the product space.

Mapping Domains to Technologies



@Collaborative

some domain types for which social knowledge seems not very useful, in particular, high risk domains and ones with high churn.

In high churn domains, there may not be enough time for an item to build up a reputation among a large number of peer users before it is replaced with other items.

When there is large risk associated with a domain, most users are going to need a more convincing explanation of the appropriateness of a recommendation beyond simply that others liked it. This is particularly important if we consider the problem of robustness in collaborative systems.

@Knowledge-based

Similarly, if we look at the interaction, we can see that it is not always possible to gather every kind of knowledge type from every type of interaction.In systems with implicit inputs, we do not gather any kind of direct requirements from the user.

Preference instability favors knowledge-based techniques. Learning over a user’s prior interactions may turn out to be a hinderance rather than a help. However, in certain cases, such as web personalization, users may provide enough implicit data in a single session to form a useful profile that can be compared to others.
 @General Tips

 In cases where the criteria do not help to reach a definitive conclusion, it is worth noting that the different technologies do have different implementation and maintenance costs.

Collaborative recommendation is likely to be the least expensive to implement. It requires a database of user ratings, but it does not require clean, wellengineered item features, which is the minimum requirement for the other recommendation technologies.

Knowledge-based technologies are going to be the most expensive approach requiring knowledge engineering and continuing maintenance. So, a developer might wish to start by implementing the least expensive solution compatible with the domain.

Another factor to consider is that with hybrid recommendation it is possible to combine techniques. For example, to deal with a heterogeneous environment with unstable preferences, a hybrid between content-based and collaborative recommendation may be desirable.

 Sample Recommendation Domains

Table 3 illustrates the application of these criteria in 10 different domains where recommendation applications exist. Not all combinations of the six criteria are represented, but we can see that the considerations given above are fairly predictive.



High-risk domains generally lead to knowledge-based recommendation; scrutability is also a good predictor of this. Heterogeneous domains are handled largely with collaborative recommendation.

Web page recommendation looks a bit contradictory when we consider high churn and preference instability, which would seem to militate against collaborative methods. However, as discussed above, database size can compensate for preference instability and these recommenders do collect large amounts of implicit preference data in each session. Also, heterogeneity is high, which argues in favor of using social knowledge.

Conclusion

This chapter considers recommender systems as intelligent systems, and as such, dependent on knowledge. The differences between recommendation approaches can be best understood through reference to the different knowledge sources that they employ. By considering how domain characteristics impact the availability and quality of knowledge sources, we can connect recommendation technologies and domain characteristics.

We have examined 6 different factors: heterogeneity, risk, churn, preference stability, interaction style, and scrutability, and considered their impact on the knowledge sources available for recommendation. From this analysis, we derive constraints on what recommendation technologies will be most appropriate for domains according to their characteristics. Application of these criteria to some existing systems shows that they do a reasonably good job of predicting what technologies have been successfully employed both in research and applications.
 

Thursday, October 25, 2012

Matching Recommendation Technologies and Domains - Part 1

 
http://news-accounts.com/wp-content/uploads/2011/10/matching-puzzle-pieces-sky.jpg
 
This paper is written by Burke and Ramezani, they introduce a way of mapping between domain characteristics and recommendation technologies, which is very interesting point to talk about.

Paper Info.:
Burke, R. & Ramezani, M. Matching Recommendation Technologies and Domains. In Kantor, P., Ricci, F., Rokach, L. & Shapira, B. eds. Handbook of Recommender Systems. Springer, 2010. (PDF
 
Paper Abstract:
Recommender systems form an extremely diverse body of technologies and approaches. The chapter aims to assist researchers and developers identify the recommendation technology that are most likely to be applicable to different domains of recommendation. Unlike other taxonomies of recommender systems, our approach is centered on the question of knowledge: what knowledge does a recommender system need in order to function, and where does that knowledge come from? Different recommendation domains (books vs condominiums, for example) provide different opportunities for the gathering and application of knowledge. These considerations give rise to a mapping between domain characteristics and recommendation technologies.
 
What is a Recommender system?
 
A recommender system is defined by a particular kind of semantics of interaction with the user: “any system that produces individualized recommendations as output or has the effect of guiding the user in a personalized way to interesting or useful objects in a large space of possible options”. This expansive definition makes the scope of recommender systems research quite broad, but it fails to give much guidance to the implementer.
 
Related Work (Recommender Systems Taxonomies)
 
The Author introduced 5 papers, which offered different taxonomies.
  1. R. Burke, “Hybrid recommender systems: Survey and experiments,” User Modeling and User- Adapted Interaction, vol. 12, no. 4, pp. 331–370, 2002.
  2. Contribution: Five recommendation techniques, Advantages and Disadvantages per each, and hybrid methods.
     
  3. P. Resnick and H. R. Varian, “Recommender systems,” Commun. ACM, vol. 40, no. 3, pp. 56– 58, 1997. 
  4. Contribution: Identifies 5 dimensions, which characterize properties of users' interactions with the recommender and the aggregation methods of users' evaluations (ratings).
     
  5. J. B. Schafer, J. A. Konstan, and J. Riedl, “E-commerce recommendation applications,” Data Mining and Knowledge Discovery, vol. 5, no. 1-2, pp. 115–153, 2001.
  6. Contribution: Taxonomy of collaboratice e-commerce recommender applications into 3 categories: functional I/O, the recommendation method, and other design issues.
     
  7. M. Montaner, B. L´opez, and J. L. D. L. Rosa, “A taxonomy of recommender agents on the Internet,” Artif. Intell. Rev., vol. 19, no. 4, pp. 285–330, 2003.
  8. Contribution: using two main criteria: user profile generation and maintenance, and user profile exploitation (Common patterns are extracted by an analysis of systems in the same domain).
     
  9. G. Adomavicius and A. Tuzhilin, “Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions,” IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 6, pp. 734–749, 2005.
  10. Contribution: Comparing between 3 main categories (content, collaborative, and hybrid.. omitting knowledge-based and classifies recommenders in each category into either heuristics-based or model-based).

Paper Contribution
AI-centric approach, focused on the knowledge sources required for recommendation and the constraints related to them. The chapter discusses the applicability of different recommendation techniques to different types of problems and aims to guide decision making in choosing among these techniques. As such, it might be considered to serve as a sort of recommender for recommender system implementers.

Knowledge Sources

 
  • Social: Knowledge about the larger community of users other than the target user.
  • Individual: Knowledge about the target user.
  • Content: Knowledge about the items being recommended and, more generally, about their uses.

The Recommender System use the knowledge sources based on the type of problem it aimed to solve.

Recommendation types vs Knowledge Sources

 
  • Collaborative recommendation matches an individual knowledge source with a social knowledge source of the same type and extrapolates the target user’s preferences from his or her peers. Usually, in collaborative recommendation, individual requirements are not used, or applied very simply as filters.

  • Content-based recommendation on the other hand is individually-focused, using item features and user opinions to learn a classifier that can predict user preferences on new items.

  • Knowledge-based recommendation is more of a catch-all category in which the recommender applies any kind of domain knowledge more substantive than item features.

Domain Characteristics

A domain of recommendation is the set of items that the recommender will operate over, but may also include the set of aims or purposes that the recommender is intended to support. These characteristics of the domain affect the availability and utility of different knowledge sources.

Six important characteristics of the domain that an implementer should consider:

1. Heterogeneity: A heterogeneous item space encompasses many items with different characteristics and most importantly, different goals they can satisfy.

2. Risk: Recommendation domains can be distinguished by the degree of risk that a user incurs in accepting a recommendation.

3. Churn: Recommender systems are used in domains with long-lived items like books, but they are also used in domains where the value or relevance of an item has a very short time span, such as news stories. A high churn domain is one in which items come and go rapidly.

4. Interaction Style: In systems in which the user makes no special effort to interact with the recommender system, the system extracts the implicit expressed preferences from user behaviour.

5. Preference Stability: User preferences can also have varying degrees of duration.

6. Scrutability: Certain applications (for example, high-risk ones) may require that the system be able to explain its recommendations, to answer questions like “why was this item Matching Recommendation Technologies and Domains recommended?” Such explanations enhance user confidence that a recommendation is appropriate and increase the liklihood of recommendations being accepted.
 

Wednesday, October 24, 2012

Hybrid Recommender Systems

 
http://ars.els-cdn.com/content/image/1-s2.0-S016412120800099X-gr1.jpg
 
 
 
This is a Review for a paper written by Robin Burke, I'll try to summarize it briefly and mention the most important points on this paper.
 
Paper Info.:
Burke, R. Hybrid Recommender Systems: Survey and Experiments. User Modeling and User-Adapted Interaction. 12(4), pages 331-370. [PDF]
 
Paper Abstract:
Recommender systems represent user preferences for the purpose of suggesting items to purchase or examine. They have become fundamental applications in electronic commerce and information access, providing suggestions that effectively prune large information spaces so that users are directed toward those items that best meet their needs and preferences. A variety of techniques have been proposed for performing recommendation, including content-based, collaborative, knowledge-based and other techniques. To improve performance, these methods have sometimes been combined in hybrid recommenders. This paper surveys the landscape of actual and possible hybrid recommenders, and introduces a novel hybrid, EntreeC, a system that combines knowledge-based recommendation and collaborative filtering to recommend restaurants. Further, we show that semantic ratings obtained from the knowledge-based part of the system enhance the effectiveness of collaborative filtering.
 
Recommender System Definition Past vs. Present
 
Recommender systems were originally defined as “people provide recommendations as inputs, which the system then aggregates and directs to appropriate recipients”, like using experts knowledge as input for the system to enrich its ability to recommend to people according to the given knowledge.
The term now has a broader connotation, describing any system that produces individualized recommendations as output or has the effect of guiding the user in a personalized way to interesting or useful objects in a large space of possible options.
 
Recommender System vs. Search Engines (Information Retrieval)
 
It is the criteria of “individualized” and “interesting and useful” that separate the recommender system from information retrieval systems or search engines. The semantics of a search engine are “matching”: the system is supposed to return all those items that match the query ranked by degree of match. Unlike recommender system it aims for personalizing the content based on assumptions (Recommender system approaches) which find something similar to user's taste and desire.
 
Recommender System Hybridization
 
All of the known recommendation techniques have strengths and weaknesses, and many researchers have chosen to combine techniques in different ways.
 
Recommendation Techniques
 
 
Comparing Recommendation Techniques 
 
 
Hybrid Recommender Systems Methods
 
 
Possible Recommendation Hybrids
 
The author explains his "EntreeC" experiment by combining Knowledge-based and Collaborative Filtering Approach using Cascade hybrid method and also explained how did he convert his semantic ratings into numeric.
 
The author presented an important information about the Order of Implicit Ratings strength (from Nichols paper,1997).
 
 
 
Hybrid or No Hybrid Choice
 
While the space remains to be fully explored, research has provided some insight into the question of which hybrid to employ in particular situations. The hybridization strategy must be a function of the characteristics of the recommenders being combined.
 
With demographic, content and collaborative recommenders, this is largely a function of the quality and quantity of data available for learning. With knowledge-based recommenders, it is a function of the available knowledge base.
 
We can distinguish two cases: the uniform case, in which one recommender has better accuracy than another over the whole space of recommendation, and the non-uniform case, in which the two recommenders have different strengths in different parts of the space.
 
If the recommenders are uniformly unequal, it may make sense to employ a hybrid in which the inaccuracies of the weaker recommender can be contained: for example, a cascade scheme with the stronger recommender given higher priority, an augmentation hybrid in which the weaker recommender acts as a “bot” contributing a small amount of information, or a meta-level combination in which the stronger technique produces a dense representation that strengthens the performance of the weaker one.
 
In the non-uniform case, the system will need to be able to employ both recommenders at different times. A switching hybrid is a natural choice here, but it requires that the system be able to detect when one recommender should be preferred. Feature combination and mixed hybrids can be used to allow output from both recommenders without having to implement a switching criterion. More research is needed to establish the tradeoffs between these hybridization options.