SpeechGrade 🎤💬 Educational Platform for Speech Assessment and Question Generation

SpeechGrade is an innovative educational platform designed to streamline speech assessment and enhance teaching methodologies through cutting-edge technologies. By integrating the MERN stack, machine learning models, large language models (LLMs), and Google's Generative AI, SpeechGrade SpeechGrade empowers educators and students with tools for automated transcription, similarity analysis, and personalized learning.

This platform enhances teaching methodologies by streamlining speech evaluations and offering dynamic test creation capabilities, making it an indispensable tool for educational institutions and training programs.

Features

1. Speech Assessment

Audio Transcription

Users can upload audio files through a React-based client application.
The Flask server leverages OpenAI’s Whisper machine learning model for high-accuracy speech-to-text transcription.

Similarity Analysis

Transcribed text is compared to a predefined target text (e.g., speeches or learning material) using Google's Generative AI.
The platform calculates a similarity score and provides detailed insights into alignment with the target text.

Results and Feedback

Performance analysis highlights areas of strength and improvement.
Results are stored for further review, enabling continuous learning and progress tracking.

2. Question Generation and Metadata Management

Metadata Management

A Node.js server efficiently handles metadata related to teachers, students, and classes.
Enables organized storage and retrieval of educational resources.

LLM-Powered Question Generation

Teachers can generate test questions tailored to specific topics and objectives, including:
- Multiple-choice questions
- Short-answer questions
- Essay prompts
Questions are designed using advanced large language models, ensuring alignment with curriculum goals.

3. Personalized Learning

SpeechGrade incorporates data-driven insights to offer personalized feedback, enabling students to focus on specific areas for improvement. This personalization enhances the learning experience and boosts student outcomes.

Technologies Used

Frontend

React.js: Interactive and user-friendly interface for uploading audio and viewing results.
Material-UI: Modern and responsive component library for a seamless user experience.

Backend

Flask: Server-side processing for speech transcription and similarity analysis.
Node.js: Server for metadata management and integration with LLMs.
Express.js: Framework for efficient API management.

Database

MongoDB: NoSQL database for storing user profiles, performance logs, and question sets.

Machine Learning

Whisper Model: OpenAI’s state-of-the-art speech-to-text model for accurate transcription.
Google's Generative AI: Advanced model for text similarity assessment and educational content analysis.

Installation

Clone the Repository

git clone https://github.com/themihirmathur/SpeechGrade.git
cd SpeechGrade

Install Dependencies
Install required packages for both the Flask and Node.js servers.

# Flask server
cd server-flask
pip install -r requirements.txt

# Node.js server
cd ../server-node
npm install

Run the Servers
Start both servers.

# Flask server
python app.py

# Node.js server
npm start

Run the Client Application
Navigate to the client folder and start the React application.
```
cd ../client
npm install
npm start
```
Access the Application
Open your browser and navigate to http://localhost:3000.

Usage

1. Speech Assessment

Navigate to the speech assessment page on the platform.
Upload an audio file of your speech or presentation.
View the transcription and similarity score along with detailed feedback.

2. Question Generation

Teachers can log in to access the question generation tool.
Input a topic or learning material, and the platform will generate custom test questions.
Save the questions for use in assessments or classroom activities.

Evaluation Metrics

The platform employs the following metrics to assess speech similarity and system performance:

Word Error Rate (WER): Measures transcription accuracy.
Semantic Similarity Score: Quantifies alignment between the transcribed and target text.
Performance Analytics: Tracks user progress over time, providing actionable insights for improvement.

Benefits

Educator Efficiency: Automates transcription, assessment, and test creation.
Improved Learning Outcomes: Personalized feedback fosters targeted skill development.
Scalable and Flexible: Supports diverse use cases, from language learning to public speaking training.

Future Work

Real-Time Feedback: Implementing real-time transcription and scoring during live presentations.
Multi-language Support: Expanding the platform to support multiple languages for global accessibility.
Time-Series Analysis: Incorporating temporal patterns in speech to provide deeper insights into fluency and pacing.
Enhanced Analytics: Advanced dashboards for educators to analyze class-wide performance trends.

Contribution

Contributions to SpeechGrade are welcome. Please follow these steps:

Fork the repository.
Create a new branch for your feature or bug fix.
Commit your changes and open a pull request.

For major changes, open an issue to discuss the proposed modifications beforehand.

Contact

If you have any questions or feedback, feel free to reach out:
Name: Mihir Mathur
Email: themihirmathur@gmail.com
Phone: +91-7525099328

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Backend		Backend
Frontend		Frontend
LLM		LLM
ML		ML
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeechGrade 🎤💬 Educational Platform for Speech Assessment and Question Generation

Features

1. Speech Assessment

Audio Transcription

Similarity Analysis

Results and Feedback

2. Question Generation and Metadata Management

Metadata Management

LLM-Powered Question Generation

3. Personalized Learning

Technologies Used

Frontend

Backend

Database

Machine Learning

Installation

Usage

1. Speech Assessment

2. Question Generation

Evaluation Metrics

Benefits

Future Work

Contribution

Contact

About

Releases

Packages

Languages

themihirmathur/SpeechGrade

Folders and files

Latest commit

History

Repository files navigation

SpeechGrade 🎤💬 Educational Platform for Speech Assessment and Question Generation

Features

1. Speech Assessment

Audio Transcription

Similarity Analysis

Results and Feedback

2. Question Generation and Metadata Management

Metadata Management

LLM-Powered Question Generation

3. Personalized Learning

Technologies Used

Frontend

Backend

Database

Machine Learning

Installation

Usage

1. Speech Assessment

2. Question Generation

Evaluation Metrics

Benefits

Future Work

Contribution

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages