Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(#137): mcw step #140

Merged
merged 9 commits into from
Oct 15, 2024
Merged

feat(#137): mcw step #140

merged 9 commits into from
Oct 15, 2024

Conversation

h1alexbel
Copy link
Owner

@h1alexbel h1alexbel commented Oct 15, 2024

In this pull I've implemented mcw step for collecting most common words in each README.

closes #137
History:


PR-Codex overview

This PR focuses on enhancing the functionality for collecting and processing the most common words from README files. It introduces new tasks, modifies workflows, and adds tests to ensure the correctness of the new features.

Detailed summary

  • Updated extend-exclude in _typos.toml to include Markdown files.
  • Modified .github/workflows/markdown-lint.yml to ignore specific directories.
  • Added a new task mcw in justfile for collecting most common words.
  • Updated pyproject.toml to include markdown-it-py dependency.
  • Introduced new mcw task in the Poetry configuration.
  • Changed workflow in .github/workflows/collect.yml to process most common words.
  • Updated logging in sr-data/src/sr_data/steps/extract.py for clarity.
  • Created tests in sr-data/src/tests/test_mcw.py for mcw functionality.
  • Added example README content in sr-data/src/tests/to-words.md and sr-data/src/tests/to-mcw.csv.
  • Implemented the to_words, remove_stop_words, lemmatize, and most_common functions in sr-data/src/sr_data/steps/mcw.py.

✨ Ask PR-Codex anything about this PR by commenting with /codex {your question}

@h1alexbel
Copy link
Owner Author

@rultor merge

@rultor
Copy link
Collaborator

rultor commented Oct 15, 2024

@rultor merge

@h1alexbel OK, I'll try to merge now. You can check the progress of the merge here.

@rultor rultor merged commit 19e0c17 into master Oct 15, 2024
17 checks passed
@h1alexbel h1alexbel deleted the 137 branch October 15, 2024 10:18
@rultor
Copy link
Collaborator

rultor commented Oct 15, 2024

@rultor merge

@h1alexbel Done! FYI, the full log is here (took me 9min).

@h1alexbel
Copy link
Owner Author

@rultor release, tag is 0.0.3

@rultor
Copy link
Collaborator

rultor commented Oct 15, 2024

@rultor release, tag is 0.0.3

@h1alexbel OK, I will release it now. Please check the progress here.

@rultor
Copy link
Collaborator

rultor commented Oct 15, 2024

@rultor release, tag is 0.0.3

@h1alexbel Done! FYI, the full log is here (took me 15min).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

count most common words in the entire README
3 participants