Skip to content

This repository focuses on recognizing entities in news articles and mapping them to predefined entity names with corresponding IDs.

Notifications You must be signed in to change notification settings

haozhuang0000/NER_News

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NER_News

This repository focuses on recognizing entities in news articles and mapping them to predefined entity names with corresponding IDs.

Install environment

Code running instruction

Prerequiest for generate_mapping_company.py:

  • Please create Milvus vector datebase use this code Create_VDB.py under Scripts.VDB_Similiarity_Search
  • Setting for Embedding model, either Server or Local

Option 1:

  • run code step by step
  1. run mongodb.py to insert raw news data
  2. run ner.py to extract companies for each sentence
  3. run ner_output_processor.py to further process from ner
  4. run generate_mapping_company.py --embedding_method Local to do similarity calculation

Option 2:

run code in one shot:

  • run main.py --embedding_method Local for all functionality

About

This repository focuses on recognizing entities in news articles and mapping them to predefined entity names with corresponding IDs.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages