The Project Kashmiri Text Craft is designed for Basic Preprocessing of Kashmiri text to make it more standardized and easier to work with in various applications, such as natural language processing (NLP), machine learning, and text analysis.
Feature | Description |
---|---|
Normalize | Removing Irrelevant Characters |
Handling Variants | |
Normalization of Diacritics | |
Handling punctuations | |
Preprocess | Adjusts digit spacing |
Handles English character spaces | |
Handles punctions space |
- Python 3.7+
- regex library
Clone this repo and run the app:
git clone <https://github.com/injilashah/Kashmiri_normalizer.git>
cd <C:\Users\...\...\Kashmiri_normalizer\kashmiri>
python app.py
Enter the Kashmiri text you want to normalize or preprocess. Click "Normalize and Preprocess" to view normalized text. The output will be displayed in the designated text area.