Standardize predict interface using SAR standard by miguelgfierro · Pull Request #1039 · recommenders-team/recommenders

miguelgfierro · 2020-01-22T10:58:57Z

Description

for rating: predict
for ranking: recommend_top_k
also applied black

Related Issues

Checklist:

I have followed the contribution guidelines and code style for this project.
I have added tests covering my contributions.
I have updated the documentation accordingly.
This PR is being made to staging and not master.

review-notebook-app · 2020-01-22T10:59:03Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

anargyri · 2020-01-22T11:12:43Z

reco_utils/recommender/surprise/surprise_utils.py



-def compute_ranking_predictions(
+def recommend_k_items(


This function does not recommend k items. It computes the predictions for all users and items.

ohh good catch, the cut of k is done in fact when the metric is computed https://github.com/microsoft/recommenders/blob/master/notebooks/02_model/surprise_svd_deep_dive.ipynb, the output is a massive matrix instead of n_users x k

after the meeting, we need to check whether the compute_ranking_pred method is used in another function that needs the full matrix, if not, we can optimize it and

for user in data[usercol].unique(): for item in data[itemcol].unique(): preds_lst.append(algo.predict(user, item).est) preds = sort(preds_lst) preds = preds[:k] preds_lst.append(preds)

also move this up:

if remove_seen: tempdf = pd.concat( [ data[[usercol, itemcol]], pd.DataFrame( data=np.ones(data.shape[0]), columns=["dummycol"], index=data.index ), ], axis=1, ) merged = pd.merge(tempdf, all_predictions, on=[usercol, itemcol], how="outer") return merged[merged["dummycol"].isnull()].drop("dummycol", axis=1) else: return all_predictions

so we remove the seen items before sorting

also check with @yueguoguo whether or not we are using the threshold cut #1041

if we don't want to work with the scores as a matrix like sar does we could use a heap

from heapq import heappush, heappushpop users = data[usercol].unique() items = data[itemcol].unique() preds_lst = np.zeros([len(users), k]) for user_idx, user in enumerate(users): heap = [] for item in items: score = algo.predict(user, item).est if len(heap) < k: heappush(heap, score) elif score > heap[k - 1]: heappushpop(heap, score); preds_lst[user_idx] = heap[::-1]

heapq also has an nlargest() method. Not sure which way would be faster (depends on which algorithm it uses).

true, but it wasn't clear to me if that method tries to sort the items (which would be unnecessary) or leverages the heap?

https://github.com/python/cpython/blob/61b3484cdf27ceca1c1069a351487d2db4b2b48c/Lib/heapq.py#L395
It looks similar to your for loop.

@anargyri

…FYI @anargyri

miguelgfierro · 2020-01-24T14:38:30Z

hey guys @anargyri @gramhagen, I reverted the interface of ranking in surprise to do the work on a different PR. The related issue is here: #1042.

Please let me know if you see anything else on this PR

miguelgfierro added 4 commits January 22, 2020 10:50

surprise recommend_k_items

a77d62f

surprise predict

ed5860a

cornac predict

877d001

cornac recommend_k_items

5d4d73e

miguelgfierro requested a review from anargyri January 22, 2020 10:58

miguelgfierro requested review from gramhagen and yueguoguo as code owners January 22, 2020 10:58

miguelgfierro mentioned this pull request Jan 22, 2020

RippleNet using Wikidata and Movielens #1014

Open

3 tasks

anargyri reviewed Jan 22, 2020

View reviewed changes

revert recommend_top_k in surprise to work with it on a different PR …

dc5fb03

…FYI @anargyri

miguelgfierro mentioned this pull request Jan 23, 2020

[FEATURE] Review surprise recommend items method #1042

Open

miguelgfierro self-assigned this Jan 24, 2020

miguelgfierro added 2 commits January 24, 2020 14:17

🐛

e339471

use recommender timer with context manager

181b8dc

bpr

ce22577

anargyri approved these changes Jan 24, 2020

View reviewed changes

miguelgfierro added 2 commits January 24, 2020 18:14

🐛

26d323e

🐛

3b1de1b

miguelgfierro merged commit 9f3dfdd into staging Jan 24, 2020

miguelgfierro deleted the miguel/predict_interface branch January 24, 2020 23:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardize predict interface using SAR standard#1039

Standardize predict interface using SAR standard#1039
miguelgfierro merged 10 commits intostagingfrom
miguel/predict_interface

miguelgfierro commented Jan 22, 2020

Uh oh!

review-notebook-app bot commented Jan 22, 2020

Uh oh!

anargyri Jan 22, 2020

Uh oh!

miguelgfierro Jan 22, 2020 •

edited

Loading

Uh oh!

miguelgfierro Jan 23, 2020

Uh oh!

miguelgfierro Jan 23, 2020

Uh oh!

miguelgfierro Jan 23, 2020

Uh oh!

gramhagen Jan 23, 2020

Uh oh!

anargyri Jan 23, 2020

Uh oh!

gramhagen Jan 23, 2020

Uh oh!

anargyri Jan 24, 2020

Uh oh!

miguelgfierro commented Jan 24, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		def compute_ranking_predictions(
		def recommend_k_items(

Conversation

miguelgfierro commented Jan 22, 2020

Description

Related Issues

Checklist:

Uh oh!

review-notebook-app bot commented Jan 22, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

miguelgfierro Jan 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

miguelgfierro commented Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

miguelgfierro Jan 22, 2020 •

edited

Loading

miguelgfierro commented Jan 24, 2020 •

edited

Loading