Skip to content
@VIPL-Audio-Visual-Speech-Understanding

VIPL AVSU

Audio-Visual Speech Understanding Research Group at Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences

Pinned Loading

  1. Lipreading-DenseNet3D Lipreading-DenseNet3D Public

    DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

    Python 117 21

  2. LipNet-PyTorch LipNet-PyTorch Public

    The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

    Python 212 52

  3. learn-an-effective-lip-reading-model-without-pains learn-an-effective-lip-reading-model-without-pains Public

    The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

    Python 153 37

  4. AVSU-VIPL AVSU-VIPL Public

    Collection of works from VIPL-AVSU

    41 5

  5. MAVSR2025-Track2 MAVSR2025-Track2 Public

    Python 1

  6. MAVSR2025-Track1 MAVSR2025-Track1 Public

    Visual Speech Recognition baseline code for MAVSR2025 Track1

    Python 1

Repositories

Showing 10 of 10 repositories
  • VIPL-Audio-Visual-Speech-Understanding/MAVSR2025-Track2’s past year of commit activity
    Python 1 0 0 0 Updated Dec 29, 2024
  • MOV20 Public

    MOV20: A challenging dataset for Chinese visual speech recognition, consisting of video clips from 20 movies.

    VIPL-Audio-Visual-Speech-Understanding/MOV20’s past year of commit activity
    0 0 0 0 Updated Dec 10, 2024
  • MAVSR2025-Track1 Public

    Visual Speech Recognition baseline code for MAVSR2025 Track1

    VIPL-Audio-Visual-Speech-Understanding/MAVSR2025-Track1’s past year of commit activity
    Python 1 0 0 0 Updated Dec 10, 2024
  • AVSU-VIPL Public

    Collection of works from VIPL-AVSU

    VIPL-Audio-Visual-Speech-Understanding/AVSU-VIPL’s past year of commit activity
    41 5 1 (1 issue needs help) 0 Updated Aug 15, 2024
  • CAS-VSR-S68 Public

    CAS-VSR-S68: A dataset for lip reading with unseen speakers, spanning 68 hours of news broadcasts.

    VIPL-Audio-Visual-Speech-Understanding/CAS-VSR-S68’s past year of commit activity
    7 0 0 0 Updated Jul 5, 2024
  • learn-an-effective-lip-reading-model-without-pains Public

    The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

    VIPL-Audio-Visual-Speech-Understanding/learn-an-effective-lip-reading-model-without-pains’s past year of commit activity
    Python 153 37 2 1 Updated May 15, 2023
  • LipNet-PyTorch Public

    The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

    VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch’s past year of commit activity
    Python 212 52 3 0 Updated Sep 21, 2022
  • VIPL-Audio-Visual-Speech-Understanding/SBL_For_Multilingual_Lip_Reading’s past year of commit activity
    Python 2 1 0 0 Updated Apr 1, 2022
  • deep-face-speechreading Public

    Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition"

    VIPL-Audio-Visual-Speech-Understanding/deep-face-speechreading’s past year of commit activity
    Python 17 5 1 0 Updated Apr 12, 2021
  • Lipreading-DenseNet3D Public

    DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

    VIPL-Audio-Visual-Speech-Understanding/Lipreading-DenseNet3D’s past year of commit activity
    Python 117 21 1 0 Updated Dec 10, 2020

Top languages

Loading…

Most used topics

Loading…