Skip to content

dmafanasyev/rulexicon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rulexicon

Project Status: Active - The project has reached a stable, usable state and is being actively developed. Lifecycle: stable

An R data package containing a collection of Russian lexicons, polarity tables, stop words lists, lemmatization tables and etc. for text analysis.

Naming convention

The naming convention based on the idea of lexicon package, and help to categorize the data types.

Prefix

Description

key_

A data.frame with a lookup and return value

hash_

A keyed data.table hash table

hash_lemmas_

A lemmatization hash table

hash_sentiment_

A sentiment polarity hash table

profanity_

A profane words vector

sw_

A stopword vector

freq_

A data.table of terms with frequencies

\_en An English version of dictionary

Installation

You can install this package from GitHub with:

# install.packages("devtools")
devtools::install_github("dmafanasyev/rulexicon")