Skip to content

A sophisticated NLP-driven system that generates interview questions and answers tailored to specific topics. It boasts speech synthesis and grammatical error detection functionalities, enhancing communication quality. This tool promises to streamline interview preparation and foster more effective communication skills.

License

Notifications You must be signed in to change notification settings

archie-a18/Tech-Enhanced-AI-Interview-Learning-Platform

 
 

Repository files navigation

Tech-Enhanced-AI-Interview-Learning-Platform

This project introduces an advanced machine learning model designed to generate a wide range of interview questions tailored to specific topics based on an individual's resume, all while maintaining a deep level of conversation. Furthermore, we incorporate cutting-edge natural language processing (NLP) algorithms to analyse spoken responses, pinpointing grammatical errors, and providing precise corrections post-interview. Additionally, we employ state-of-the-art speech processing techniques, including Automatic Speech Recognition (ASR), to assess learners' speaking pace, detect variations, and provide timely feedback for improvement. The goal of our model is to boost its adaptability and effectiveness across diverse topics and communication styles.

Project Pipeline

FLOWCHART

DATASET PREPARATION

We combined various datasets containing Questions and Answers for various Job profiles such as Software, Business, Language and AI-ML Roles. We added context to them; which was well required by a T5 Transformer to learn and generate Questions.

OVERVIEW

-RESUME SCANNING and CLASSIFICATION
-QUESTION GENERATION
-ASR and SPEECH ANALYSES
-GRAMMAR CORRECTION(using API)
-EVALUATION(BLEU SCORES)

Resume Scanning and Classification: User will input a pdf of his/her resume. NER is used to identify keywords based on the skills and projects mentioned in resume. Then using used Multinomial Naive Bayes, Nearest K-Neighbour Classification to classify the resumes into various job profiles.

WhatsApp Image 2024-04-13 at 00 47 16_94d37725

Question Generation: Based on the job Profile, it will generate questions.For Question Generation, we have fine tuned a T5 Transformer for text-to-text generation. The Dataset consists of 4 columns: Questions, Answers, Job Profiles and Context.

T5 Transformer (Text-To-Text Transfer Transformer) We fine tuned a T5 Transformer , which is an encoder-decoder model pre-trained on Squad Dataset. It works well on a variety of tasks that include text-to-text generation. It takes input in form of text and generates output in the form of text.

image

ASR and SPEECH ANALYSES: Used pyAudio to convert audio into text to generate Follow-Up Questions and then it uses the candidate speech to generate follow up questions using the same Question Generation Model. We also use this audio to identify grammatical errors using a python library:language_tool_python.Automatic Speech Recognition (ASR) is a complex process that converts spoken language into written text. It encompasses several stages including audio pre-processing, feature extraction, phoneme and word mapping, and statistical modeling to determine the most probable sequence of words. The outcome of ASR is the transcribed text, offering a valuable tool for accurately capturing and interpreting spoken information in various applications.

EVALUATION:BLEU Scores are used to evaluate the model for generated text. BLEU Scores:It is a number between zero and one that measures the similarity of the machine-generated text with human generated text. A score of 0.6 to 0.7 is considered the standard.

About

A sophisticated NLP-driven system that generates interview questions and answers tailored to specific topics. It boasts speech synthesis and grammatical error detection functionalities, enhancing communication quality. This tool promises to streamline interview preparation and foster more effective communication skills.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.5%
  • Python 0.5%