Found here
"Word clouds (also known as text clouds or tag clouds) work in a simple way: the more a specific word appears in a source of textual data (such as a speech, blog post, or database), the bigger and bolder it appears in the word cloud.
A word cloud is a collection, or cluster, of words depicted in different sizes. The bigger and bolder the word appears, the more often it’s mentioned within a given text and the more important it is." as mentioned by Bootlabs.
This project is an open-source word cloud generator that uses the d3 library and the works of Jason Davies found here.
This project is a simple word cloud generator so it is easier to generate word clouds based on certain requirements.
More interestingly most of the word cloud generators out there are too tedious to use with their configuration as well as limited options in terms of data cleaning and transformation. This project aims to be
- easy to use for a person with no programming knowledge
- providing simple data transformation and cleaning based on known ways
For more information, please read the Wikipedia entry here
Plans are in place to implement, 3 published stop words lists.
- The Snowball list is found here
- The Terrier list is found here
- The Minimal list is found here
- Custom list from the user
Depending on interest, this list will be expanded over time.
Plans are in place to implement 2 stemming methodologies
Lemmatization will also be implemented based on the WordNet found here
Feel free to open an issue for discussion or a contribution to this repository. It is hosted via gh-pages
Suggestions for the project are also welcomed but I aim to keep the project as minimalistic as possible without the need for too many external dependencies and libraries.
Remember to use a feature
branch for all PRs
- Fork this project. Then
clone
in your local system - Then create your own branch
git checkout -b feature
- Remember to install npm dependencies via
npm install
- To run the project just do
npm run build
If you see an error during the npm install, please delete the LICENSE and README files under the node-lemmatizer under the node_modules directory