Skip to content

Return sentences that contain keywords#90

Open
katkamrachana wants to merge 32 commits intovi3k6i5:masterfrom
katkamrachana:master
Open

Return sentences that contain keywords#90
katkamrachana wants to merge 32 commits intovi3k6i5:masterfrom
katkamrachana:master

Conversation

@katkamrachana
Copy link
Copy Markdown

@katkamrachana katkamrachana commented Jul 26, 2019

While searching for a search utility, I found FlashText .
However, instead of returning only the keyword matches and the index span info, this PR focuses on returning the sentences which contain the keyword.
The approach followed is to sentence tokenize the corpus using NLTK's sent_tokenize.
This is configurable with fetch_sent flag in extract_keywords(<corpus>, <span_info_flag>, <fetch_sent>) which is False by default.
Also, keyword.py is made entirely PEP8 compliant.

P.S: Please run pip install nltk

vi3k6i5 and others added 30 commits November 10, 2017 20:47
added reference to flashtext paper
  `charactes` | `characters`
  `explaination` | `explanation`
  `matche` | `match`
Fix issue with incomplete keyword at the end of the sentence
Performances improvement for strings manipulations
@vi3k6i5
Copy link
Copy Markdown
Owner

vi3k6i5 commented May 3, 2020

Can you please resolve the conflict.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants