Context Navigation

← Previous Change
Wiki History
Next Change →

Changes between Version 3 and Version 4 of ExpertRecommender

Timestamp:: 10/17/08 10:51:56 (17 years ago)
Author:: fmittag
Comment:: raw description of the expert recommendation process (Pearson-correlation)

Legend:

: Unmodified
: Added
: Removed
: Modified

ExpertRecommender

-                      v3
+                      v4
+Example 3:
+== Idea ==
+ * Player_order
+  * All_players_simultaneously
+  * Player_order_changes
+   * Bidding_on_player_order
+We postpone the problem of calculating an overall similarity between users by looking at each feature class individually. For this, we take advantage of a method widely used^[citation needed]^ in collaborative filtering/recommendation. The [http://en.wikipedia.org/wiki/Correlation#Pearson.27s_product-moment_coefficient Pearson Produkt-Moment Correlation Coefficient] (or Pearson-correlation for short) "indicates the strength and direction of a linear relationship between two random variables".
+Let there be the following opinions:
+In Collaborative Filtering (CF), the ratings of user X and user Y represent these two random variables and a (positive) linear relationship between these two is equivalent to both users rating items in the same way (but not necessarily identical). The Pearson-correlation normalizes both random variables to their respective mean value, meaning that two random variables are considered positively correlated if both differ from their respective mean in the same direction. (TODO: example needed)
+ * User A says that game G has the features: +All_players_simultaneously
+ * User B says that game G has the features: +Player_order_changes
+ * User C says that game G has the features: -Bidding_on_player_order
+ * User D says that game G has the features: +Player_order_changes, -Bidding_on_player_order
+Our approach is to look at one feature class at a time and interpret the opinion of a user as rating. The applicability represents the rating value whereas the confidence acts as a weight of this rating, therefor needing a more generalized formula for the Pearson-correlation allowing weighted ratings. (TODO: include formula)
+Naturally, one would say that users B and D have a similar opinions, so have users C and D. But what about B and C? One might be tempted to say, that the opinions of users A and B exclude each other, but this can't be known for sure, because there might be some game that has different phases, one with all players playing simultaneously, one with changing player order. Stating that the opinion of A and B are equal because both state that there is some playing order would also be wrong, because here the feature Player_order is merely for grouping purposes and has no own meaning.
+== Conclusion ==
+I looked into the !SkipTrax and the Ludopinions ontology and only found two cases:
+ * a feature with sub-features is for grouping purpose only and it would have no meaning to state something about this feature
+ * the sub-features of a feature are specialized cases of the super-feature, so the super-feature should have at least the maximum appliance value of all of its sub-features
+== Suggestion ==
+=== Comparing features ===
+Define a similarity metric that compares two features x and y of the same (direct) type. Until now, the value of a feature is only the applicability value of the feature.
+{{{
+sim(x,y) = 1 - dist(x,y)/2
+}}}
+The distance between two features x and y is defined as follows:
+{{{
+dist(x,y) = |x-y|
+}}}
+This means, that two features with the same applicability have the distance 0 and thus the similary 1 - 0/2 = 1. Two features with applicability -1 and 1 would have the distance 2 and the similarity 1 - 2/2 = 0. (TODO: prove the properties of a metric)
+=== Comparing items ===
+The similarity of two items is defined through the similarity of their features. The outline of a potential algorithm looks like this:
+ * The similarity of two items is the arithmetic mean of the similarities of all features
+ * Features that are not annotated will be ignored
+ * If a feature type is only annotated in one item, feature values need to be inferred until they can be compared
+Example:
+Let there be a simple feature-hierarchy as follows:
+{{{
+  A
+ / \
+B   C
+}}}
+Example similarities would be: ("-" means: not annotated)
+{{{
+      -1
+ / \  ;  / \      => similarity = 0
+-   -   -   -
+}}}
+{{{
+       1
+  / \  ;  / \     => similarity = (1 + 0) / 2 = 0.5
+-1   -   1   -
+}}}
+Some non-trivial cases:
+{{{
+   -      -1                               -       -
+  / \  ;  / \     => similarity = ?       / \  ;  / \     => similarity = ?
+-1   -   -   -                          +1   -   -  -1
+}}}
+Suggestion: Propagate possible values as intervals up or down the hierarchy
+We extend the distance metric on intervals, where x1 and x2 denote the interval bounds of x = [x1;x2] (if x1 = x2, we just write [x1], which is equal to the value x1)
+{{{
+dist(x,y) = (|x1-y1| + |x2-y2|) / 2
+}}}
+The above example can then be compared:
+{{{
+   -      -1        [-1;+1]       -1
+  / \  ;  / \   =>   /   \   ;    / \    => similarity = (sim(-1,[-1]) + sim([-1;+1],-1)) / 2 = (1 + 0.5) / 2 = 0.75
+-1   -   -   -     -1     -    [-1]  -
+}}}
+The resulting correlation coefficient always is in range [-1;+1] and indicated how similar two users opinions are regarding only this specific feature class. Users with a high correlation are then called experts regarding this feature class. This correlation can also be used to add an additional weight to this users opinion about an item. (TODO: example)