[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
-
Updated
Jul 11, 2023 - Python
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
This repository contains the implementation of Environmental Sound Classification on the ESC-50 dataset using the ACDNet.
This project focuses on the ESC50 Challenge. The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.
Model Deployment for HEAR4U Bangkit Capstone Project
Simplified PyTorch implementation of audio classification, support multi-gpu training and validating, automatic mixed precision training, knowledge distillation etc.
Add a description, image, and links to the esc50 topic page so that developers can more easily learn about it.
To associate your repository with the esc50 topic, visit your repo's landing page and select "manage topics."