Skip to content

Saivivek-Peddi/audio_transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MIT License LinkedIn


Logo

Audio Classification

Introduction

This project is for Audio Classification using Transformer Architecture. The following are the expriments that are conducted -

  1. Audio classification using Mel Spectograms(mels).
  2. Application of ViT to the mels.
  3. Application of Masked Auto Encoders.
  4. Applicaiton of Swin Transformers.

Getting Started

Follow all the steps carefully

Prerequisites

This project requires python 3.6 or greater and also use virutalenv

  • Installing virtualenv
    sudo pip3 install virtualenv

Installation

  1. Clone the repo
    git clone https://github.com/Saivivek-Peddi/audio_transformer.git
  2. CD into the project directory
    cd audio_transformer
  3. Create a virtualenv and start it
    virtualenv venv && source venv/bin/activate
  4. Install all the requirements
    pip install -r requirements.txt

Usage

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Sai Vivek Peddi - svpeddi@ucdavis.edu

About

Audio Classification using Trasnformer Architecture

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages