This project is for Audio Classification using Transformer Architecture. The following are the expriments that are conducted -
- Audio classification using Mel Spectograms(mels).
- Application of ViT to the mels.
- Application of Masked Auto Encoders.
- Applicaiton of Swin Transformers.
Follow all the steps carefully
This project requires python 3.6 or greater and also use virutalenv
- Installing virtualenv
sudo pip3 install virtualenv
- Clone the repo
git clone https://github.com/Saivivek-Peddi/audio_transformer.git
- CD into the project directory
cd audio_transformer
- Create a virtualenv and start it
virtualenv venv && source venv/bin/activate
- Install all the requirements
pip install -r requirements.txt
Distributed under the MIT License. See LICENSE
for more information.
Sai Vivek Peddi - svpeddi@ucdavis.edu