This project implements a proof-of-concept pipeline for detecting cognitive decline markers in speech samples. It uses both acoustic and linguistic features to analyze speech patterns and identify potential indicators of cognitive impairment.
- Acoustic feature extraction (MFCC, pitch, spectral features)
- Linguistic feature extraction (word count, sentence structure, etc.)
- Unsupervised learning for pattern detection
- Risk score calculation
- Visualization of results
- Comprehensive reporting
- Clone the repository:
git clone <repository-url>
cd memotag_assignment- Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate- Install dependencies:
pip install -r requirements.txt-
Place audio samples in the
audio_samples/originaldirectory (supported formats: .wav, .mp3, .flac) -
Run the pipeline:
python run_pipeline.py- Check the results:
- Risk assessment report will be displayed in the console
- Visualizations will be saved as PNG files
cognitive_detection.py: Main implementation of the cognitive decline detection pipelinerun_pipeline.py: Script to run the complete analysis pipelinerequirements.txt: Python dependenciesaudio_samples/: Directory for audio filescache/: Directory for cached features and intermediate results
- Python 3.8+
- Required Python packages are listed in requirements.txt
- Audio files in .wav, .mp3, or .flac format
The pipeline generates:
- Feature importance analysis
- Risk scores for each audio sample
- Visualizations of patterns and clusters
- Detailed analysis report
For clinical deployment:
- Expand dataset with validated clinical samples
- Incorporate supervised learning with confirmed diagnoses
- Add contextual task performance features
- Develop longitudinal tracking
- Validate against established assessment tools