KENKU

This repository is the core implementation of the black-box audio adversarial attack framework proposed by our paper (KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against ASR Systems).

Installation

Clone this repo.

git clone https://github.com/Xinghui-Wu/KENKU.git
cd KENKU

Create a virtual environment running Python 3.7 interpreter of later.
Install the dependencies.

pip install -r requirements.txt

Notice that we used a GPU server equipped with a NVIDIA GeForce RTX 3090 card and leveraged the PyTorch framework on the CUDA 11.0 platform to solve the optimization problems.

Usage

Register the ASR cloud services provided by the target manufacturers and fill in the relevant information in the account.json file.
Create two folders, namely songs/ and commands/, under the root directory of the KENKU project.
Prepare a few song clips in the WAV format and place them in the songs/ folder.
Specify the desired target command texts in commands/commands.txt (one line one sentence) and use text_to_speech.py to synthesize audio files for those target commands.

python text_to_speech.py

Launch the hidden voice command attack or the integrated command attack supported by KENKU.

python hidden_voice_command_attacks.py
python integrated_command_attack.py

Test the generated audio adversarial examples on the black-box commercial ASR platforms. Provide the generated csv file in step 5 to the black_box_asr.py script as the input.csv file. The output.csv file will contain the transcribed results.

python black_box_asr.py -i input.csv -o output.csv

Notice that we used the two folder names, songs/ and commands/, to configure the default command line arguments for all the Python scripts in this project. If you would like to rename the corresponding folders or files, you must provide the correct values for those related command line arguments.

In addition, the two attack scripts involve many configurable hyperparameters that can be fine-tuned to improve the attack performance. We do not guarantee the default settings are the best and we encourage the project users to test more cases.

Please use the following commands for more help.

python text_to_speech.py -h
python hidden_voice_command_attacks.py -h
python integrated_command_attack.py -h

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
asr		asr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
account.json		account.json
black_box_asr.py		black_box_asr.py
hidden_voice_command_attack.py		hidden_voice_command_attack.py
integrated_command_attack.py		integrated_command_attack.py
requirements.txt		requirements.txt
text_to_speech.py		text_to_speech.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KENKU

Installation

Usage

Citation

About

Releases 1

Packages

Languages

License

Xinghui-Wu/KENKU

Folders and files

Latest commit

History

Repository files navigation

KENKU

Installation

Usage

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages