Skip to content

VecShare Stable Release

Latest
Compare
Choose a tag to compare
@JaredFern JaredFern released this 15 Aug 17:40
· 10 commits to master since this release

Stable Release including:

VecShare: Library for Sharing Word Vectors

Release v1.2.2

This library provides functionality for rapidly sharing and retrieving word embeddings over the internet. Additional information on the VecShare framework can be found at: https://bit.ly/VecShare (Accepted at EMNLP 2017).

Download at pip install vecshare

Supported Functions

The VecShare Python library currently supports:

  • check: See available embeddings
  • format: Autoformat a header to upload an embedding to the data store or Compress an embedding
  • update: Update an existing embedding or its metadata
  • query: Look up word vectors from a specific embedding
  • extract: Download word vectors for only the vocabulary of a specific corpus
  • download: Download an entire shared embedding

Supported Selection Methods

  • maxtkn: Select embedding trained on most tokens
  • simscore: Select embedding scoring highest on 9 set similarity task
  • avgrank: Select embedding with highest avg rank signature score (See https://bit.ly/VecShare)