feat: add support for creating a Matrix Factorization model #1330

rey-esp · 2025-01-28T14:47:14Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

bigframes/ml/decomposition.py

Co-authored-by: Tim Sweña (Swast) <swast@google.com>

third_party/bigframes_vendored/sklearn/decomposition/_mf.py

sycai

Does it make sense to add some system/unit tests for this change?

…is/python-bigquery-dataframes into b338873783-matrix-factorization

tswast · 2025-02-06T20:07:33Z

bigframes/ml/decomposition.py

+        user_col: str,
+        item_col: str,
+        rating_col: str = "rating",


@GarrettWu @shuoweil I see in #1282 you ended up passing in "id_col" as a separate argument to fit() instead of the class constructor. Is this a pattern you would recommend here?

Note: MatrixFactorization differs somewhat from that application in that normally in scikit-learn one would have a "sparse matrix" data type (e.g. https://docs.scipy.org/doc/scipy/reference/sparse.html) where rows/cols/values would all be bundled up in one object, similar to how we are using the bigframes DataFrame for this purpose.

rey-esp added 3 commits January 27, 2025 20:16

feat: add support for creating a Matrix Factorization model

1d39560

feat: add support for creating a Matrix Factorization model

e19c262

feat: add support for creating a Matrix Factorization model

1bef4a2

product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Jan 28, 2025

Merge branch 'main' into b338873783-matrix-factorization

d157cd7

tswast reviewed Jan 28, 2025

View reviewed changes

rey-esp and others added 7 commits January 28, 2025 11:10

Update bigframes/ml/decomposition.py

e336bde

Co-authored-by: Tim Sweña (Swast) <swast@google.com>

Update bigframes/ml/decomposition.py

d5f713a

Co-authored-by: Tim Sweña (Swast) <swast@google.com>

Update bigframes/ml/decomposition.py

5e3e443

Co-authored-by: Tim Sweña (Swast) <swast@google.com>

Merge branch 'main' into b338873783-matrix-factorization

34a60bc

rating_col

c116e8a

(nearly) complete class

dedef39

Merge branch 'main' into b338873783-matrix-factorization

e5165a9

rey-esp force-pushed the b338873783-matrix-factorization branch from 5f4f9d3 to e5165a9 Compare January 28, 2025 21:18

product-auto-label bot added size: m Pull request size is medium. and removed size: l Pull request size is large. labels Jan 28, 2025

rey-esp added 5 commits January 28, 2025 15:31

Merge branch 'main' into b338873783-matrix-factorization

05eb854

removem print()

2787178

removem print()

8c66e07

adding recommend

086b4dd

Merge branch 'main' into b338873783-matrix-factorization

8ed3ccd

tswast reviewed Jan 29, 2025

View reviewed changes

third_party/bigframes_vendored/sklearn/decomposition/_mf.py Outdated Show resolved Hide resolved

third_party/bigframes_vendored/sklearn/decomposition/_mf.py Outdated Show resolved Hide resolved

third_party/bigframes_vendored/sklearn/decomposition/_mf.py Outdated Show resolved Hide resolved

tswast marked this pull request as ready for review January 29, 2025 17:25

tswast requested review from a team as code owners January 29, 2025 17:25

tswast requested a review from sycai January 29, 2025 17:25

blunderbuss-gcf bot assigned jiaxunwu Jan 29, 2025

rey-esp added 2 commits January 29, 2025 17:23

Merge branch 'main' into b338873783-matrix-factorization

1b4eef9

remove hyper parameter runing references

7c371ac

rey-esp added 2 commits January 30, 2025 08:35

Merge branch 'main' into b338873783-matrix-factorization

7498c8c

Merge branch 'main' into b338873783-matrix-factorization

55ef06a

sycai requested changes Jan 30, 2025

View reviewed changes

rey-esp added 9 commits February 4, 2025 15:13

Merge branch 'main' into b338873783-matrix-factorization

29805b5

swap predict in _mf for recommend

8de384a

recommend -> predict

647532b

update predict doc string

b340c4f

Merge branch 'main' into b338873783-matrix-factorization

580de41

Merge branch 'main' into b338873783-matrix-factorization

29ee357

Merge branch 'main' into b338873783-matrix-factorization

bac2ece

Merge branch 'b338873783-matrix-factorization' of github.com:googleap…

3f22c23

…is/python-bigquery-dataframes into b338873783-matrix-factorization

Merge branch 'main' into b338873783-matrix-factorization

213f11d

tswast reviewed Feb 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for creating a Matrix Factorization model #1330

feat: add support for creating a Matrix Factorization model #1330

rey-esp commented Jan 28, 2025

sycai left a comment

tswast Feb 6, 2025

feat: add support for creating a Matrix Factorization model #1330

Are you sure you want to change the base?

feat: add support for creating a Matrix Factorization model #1330

Conversation

rey-esp commented Jan 28, 2025

sycai left a comment

Choose a reason for hiding this comment

tswast Feb 6, 2025

Choose a reason for hiding this comment