Skip to content

ankitklakra/corpus-builder

Repository files navigation

Corpus Builder

Corpus Builder is a web application designed to convert unclean text into clean, segmented text. It processes the input text by removing numbers and formatting it for better readability. This tool is ideal for preparing text data for analysis or presentation.

Features

  • Single Line Conversion: Removes line breaks to convert text into a single line.
  • Automatic Segmentation: Adds line breaks after each occurrence of a number.
  • Number Removal: Strips numbers from the text, leaving behind clean, segmented content.

Demo

You can try the live version of the application here.

Technologies Used

  • React: JavaScript library for building user interfaces.
  • Tailwind CSS: Utility-first CSS framework for styling.
  • Netlify: Platform for deploying web applications.

Installation

To run this project locally, follow these steps:

  1. Clone the repository:

    git clone https://github.com/ankitklakra/corpus-builder.git
  2. Navigate to the project directory:

    cd corpus-builder
  3. Install the dependencies:

    npm install
  4. Start the development server:

    npm start
  5. Open your browser and go to http://localhost:3000 to view the application.

Usage

  1. Enter the unclean text into the provided text area.
  2. Click the "Submit" button to process the text.
  3. The cleaned and segmented text will appear in the output text area.

Contributing

If you want to contribute to this project, feel free to fork the repository and submit a pull request. Any contributions are welcome!

License

This project is licensed under the MIT License.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published