Nepali Speech-to-Text Converter

Nepali Speech-to-Text Converter is a simple web application that allows users to convert their Nepali speech into text. The app records your voice and provides the transcription in real time.

Features

Speech Recording: Record Nepali speech with a single click.
Real-Time Transcription: Get the transcription of your speech immediately after recording.
Language Support: Currently supports Nepali (नेपाली).
User-Friendly Interface: Clean and intuitive design.

Prerequisites

Node.js: Ensure you have Node.js installed on your system.
Python Backend: This app requires a Python backend server to handle recording and transcription.

Installation

Clone the Repository:

git clone https://github.com/yogendrabaskota/NepaliASR.git
cd speech

Install Dependencies:
```
npm install
```
Setup Tailwind CSS: Tailwind CSS is already configured in the project. If needed, refer to the Tailwind CSS documentation for additional setup information.
Start the Development Server:
```
npm run dev
```
Run the Python Backend: Ensure the Python backend server is running. The backend is responsible for handling recording and transcription. Use the following commands:
```
cd backend
python main.py
```

Usage

Open the app in your browser:
```
http://localhost:5173
```
Click on the "Start Recording" button.
Speak in Nepali language for the required time and click on 'stop' button when you finished speaking.
View the transcription in the "Live Transcription" section. You can also copy the text by clicking 'copy' button available there.

File Structure

project-root/
├── public/
├── src/
│   ├── components/
│   │   ├── Navbar.jsx
│   │   ├── Recorder.jsx
│   ├── App.jsx
│   ├── index.css
│   ├── main.jsx
├── backend/
|   ├──models/
|   ├──recordings/
|   ├──utils/
│   ├── main.py
├── tailwind.config.js
├── postcss.config.js
├── package.json
└── README.md

Technologies Used

Frontend:
- React
- Vite
- Tailwind CSS
Backend:
- Python (Flask or FastAPI for handling API requests)
ML
- Recurrent Neural Network (RNN) for processing speech to text

Output

Contributing

Contributions are welcome! Feel free to open an issue or submit a pull request.

Feedback

If you have any feedback, please reach out to yogendrabaskota18@gmail.com , awasthintr07@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nepali Speech-to-Text Converter

Features

Prerequisites

Installation

Usage

File Structure

Technologies Used

Output

Contributing

Feedback

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
backend		backend
speech		speech
README.md		README.md

yogendrabaskota/NepaliASR

Folders and files

Latest commit

History

Repository files navigation

Nepali Speech-to-Text Converter

Features

Prerequisites

Installation

Usage

File Structure

Technologies Used

Output

Contributing

Feedback

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages