Change the repository type filter
All
Repositories list
6 repositories
- An Information Retrieval Library mainly for learning the concepts of IR and developing tools in this domain for specific purposes.
- A rudimentary Implementation of a simple neural ranking model. Based on word embeddings (Glove, Fasttext). The pre-trained model's inherent vector cosine similarity is the main metric for consideration.
VSM-TFIDF
PublicA simple implementation of a vector space model where documents (text) is converted into vectors. Term frequency-inverse document frequency [TF-IDF] is used for the vectorization.Practical-NLP
PublicNLP, Text Mining and Machine Learning starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, keyword extraction with TFIDF, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.Awesome-IR
Public