Skip to content

"Hey Ditto" activation model using CNN-LSTM neural network.

License

Notifications You must be signed in to change notification settings

omarzanji/ditto_activation

Repository files navigation

HeyDittoNet

Spoken "Hey Ditto" activation using CNN-LSTM model. Model trained on both synthetic and real human voices along with samples of background noise from various scenes around the world.

Getting Started

  1. Install required packages: pip install -r requirements.txt
  2. Run: python main.py to test activation on your default mic.

Model Architecture

  1. CNN-LSTM model architecture below with 99% testing accuracy on roughly 30,000 audio samples:

HeyDittoNet

Training Metrics

  1. CNN-LSTM Training Loss:

HeyDittoNet Training Loss