-
Notifications
You must be signed in to change notification settings - Fork 1
bug corrections #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -33,7 +33,7 @@ | |
| "interactions_dataset_path = '../data/Beauty/Beauty_5.json'\n", | ||
| "metadata_path = '../data/Beauty/metadata.json'\n", | ||
| "\n", | ||
| "interactions_output_path = '../data/Beauty/inter_new.json'\n", | ||
| "interactions_output_path = '../data/Beauty/inter.json'\n", | ||
| "embeddings_output_path = '../data/Beauty/content_embeddings.pkl'" | ||
| ] | ||
| }, | ||
|
|
@@ -287,7 +287,7 @@ | |
| "metadata": {}, | ||
| "outputs": [], | ||
| "source": [ | ||
| "data = get_data(pl.from_pandas(df), item_ids_mapping)" | ||
| "data = get_data(pl.from_pandas(df), item_ids_mapping_df)" | ||
|
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. запустил, ругался, что пытаешься работать со словариком вместо df |
||
| ] | ||
|
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. второй интересный момент, что я вообще не смог сбилдить эмбеддинги этим токенизатором, поймал может, потому что запускал из datasphere, а нужен впн тут (файлы-то есть вроде там, только тяжелые очень), |
||
| }, | ||
| { | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -79,6 +79,10 @@ def train(self): | |
| LOGGER.debug('Start training...') | ||
|
|
||
| while (step_num < 200_000): | ||
| if self._epoch_cnt is not None and epoch_num >= self._epoch_cnt: | ||
| LOGGER.debug( | ||
| 'Reached the maximum number of epochs ({}). Finish training'.format(self._epoch_cnt)) | ||
| break | ||
|
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. без этого прокидывание max_epoch_cnt в конфиг не работало |
||
| if best_epoch + self._epochs_threshold < epoch_num: | ||
| LOGGER.debug( | ||
| 'There is no progress during {} epochs. Finish training'.format(self._epochs_threshold)) | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
это вообще странная штука,
во-первых, даже тут https://zenodo.org/records/17351848 лежит и inter, и inter_new,
во-вторых, они различаются (посмотрел дифф), так что может быть в этом был какой-то смысл
minor: https://zenodo.org/records/17351848 отсюда без впн невозможно было что-то скачать (очень долго)