Skip to content

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

License

Notifications You must be signed in to change notification settings

Alibaba-NLP/AISHELL-NER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

AISHELL-NER

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

PDF

Run Entity-Aware ASR with ESPnet

We conduct the end-to-end entity-aware ASR experiments with ESPnet. First, install ESPnet following the instruction. Then move to the aishell directory under the egs2 directory.

$ cd egs2/aishell/asr1

You could download AISHELL-1 through:

$ ./run.sh --stage 1 --stop-stage 1

Otherwise you could mannuly download the dataset at https://www.openslr.org/33/.

Once AISHELL-1 is downloaded and extracted, replace the origin transcription in downloads/data_aishell/transcript/aishell_transcript_v0.8.txt by our aishell_ner_transcript.txt. Please note the file name should be the same as aishell_transcript_v0.8.txt. Another way is to change aishell_text=path/to/aishell_ner_transcript.txt in local/data.sh.

Then you could run the Conformer entity-aware ASR experiments through:

$ ./run.sh

Nested Version

You could build a nested version referring to CNERTA, please note that some annotations may differ.

Citation

If you find the dataset useful, please consider citing our paper:

@inproceedings{chen2022aishell,
title={AISHELL-NER: Named Entity Recognition from Chinese Speech},
author={Chen, Boli and Xu, Guangwei and Wang, Xiaobin and Xie, Pengjun and Zhang, Meishan and Huang, Fei},
booktitle={2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
year={2022}
}

About

[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published