What is Semantic Similarity?

semantic similarity is implementation of a technology called text embedding. One of the most useful, new technologies for natural language processing, text embedding transforms words into a numerical representation (vectors) that approximates the conceptual distance of word meaning.

Many NLP applications need to compute the similarity in meaning between two short texts. Search engines, for example, need to model the relevance of a document to a query, beyond the overlap in words between the two. Similarly, question-and-answer sites such as Quora need to determine whether a question has already been asked before. This type of text similarity is often computed by first embedding the two short texts and then calculating the cosine similarity between them.

What embeddings we're using?

We're using following embeddings:

BERT
Elmo
Spacy
W2V

Requirements

Python (3.0 and above)
Flask
TensorFlow
Download Bert pre-trained model from here(https://github.com/google-research/bert#pre-trained-models)
AllenNLP
Spcay

Steps

pip install -r requirements.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bert_similarity.py		bert_similarity.py
common.py		common.py
elmo_cosine_similarity.py		elmo_cosine_similarity.py
flair_cosine_similarity.py		flair_cosine_similarity.py
semantic-similarity.py		semantic-similarity.py
semantic_api.py		semantic_api.py
spacy_cosine_similarity.py		spacy_cosine_similarity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Semantic Similarity?

What embeddings we're using?

Requirements

Steps

About

Releases

Packages

Languages

License

JBAhire/semantic-similarity

Folders and files

Latest commit

History

Repository files navigation

What is Semantic Similarity?

What embeddings we're using?

Requirements

Steps

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages