The "Audio Filtering System" project addresses the challenge of moderating and filtering offensive or unwanted words within audio streams using advanced speech recognition and audio processing techniques. It aims to create a positive audio environment across various communication platforms by identifying and removing undesired content in real-time.
The system processes audio input, converts speech to text, and identifies offensive words from a predefined blacklist. These words are then replaced with beep sounds, providing uninterrupted, moderated audio content.
The project is motivated by the increasing demand for respectful and safe communication platforms. It leverages advanced technology to foster positive and inclusive interactions, promoting constructive dialogue across diverse platforms.
- Accessibility: Designed to be user-friendly and accessible to individuals of all technical backgrounds.
- Timesaving: Automates the content moderation process, saving users from manual intervention.
- Customization: Allows users to define their own blacklist of unwanted words, tailoring the system to their specific needs.
- Develop a Classification Model: Create a machine learning model to identify offensive words.
- Word Replacement or Censoring: Replace offensive words with beeps or softer alternatives.
- Audio File Generation: Modify audio files to reflect the censored content.
- Predictive Censoring: Estimate the probability of offensive words before they are spoken for proactive moderation.
- Media and Entertainment: Censor offensive language in live and recorded content.
- Customer Service and Call Centers: Maintain professional communication standards.
- Educational Platforms: Ensure safe and respectful learning environments.
- Social Media: Efficiently moderate live and recorded content.
- Public Safety: Monitor audio for aggressive or threatening speech.
Technology Stack:
- Python
- SpeechRecognition Library
- Pydub
- PyCharm
- TensorFlow
- PyTorch
- Flask
Hardware Requirements:
- Microphone
- Speakers or Headphones
- Training the Model: Use BERT for offensive word detection in Marathi text, fine-tuning it for accuracy.
- Word Replacement: Replace detected offensive words in audio with beeps or alternatives.
- Audio Modification: Adjust the original audio to reflect the censored content.
- Predictive Analysis: Analyze prefixes in transcribed audio data to understand patterns leading to offensive language.
The Audio Filtering System project provides an innovative and comprehensive solution for moderating offensive content in audio streams, promoting a safer and more respectful communication environment.
Table 1. Sample data
Fig. 1. Accuracy of model.
Screenshort while running system locally.
table 2. Less offensive word substitution.
Fig. 2. Distribution of Prefix words followed by Offensive words



