DialTest

With the tremendous advancement of recurrent neural network (RNN), dialogue systems have achieved significant development. Many RNN-driven dialogue systems, such as Siri, Google Home, and Alexa, have been deployed to assist various tasks. However, accompanying this outstanding performance, RNN-driven dialogue systems, which are essentially a kind of software, could also produce erroneous behaviors and result in massive losses. Meanwhile, the complexity and intractability of RNN models that power the dialogue systems make their testing challenging.

We design and implement DialTest, the first RNNdriven dialogue system testing tool. DialTest employs a series of transformation operators to make realistic changes on seed data while preserving their oracle information properly. To improve the efficiency of detecting faults, DialTest further adopts Gini impurity to guide the test generation process. To validate DialTest, we conduct extensive experiments. We first experiment it on two fundamental tasks, i.e., intent detection and slot filling, of natural language understanding. The experiment results show that DialTest can effectively detect hundreds of erroneous behaviors for different RNN-driven natural language understanding (NLU) module of dialogue systems and improve their accuracy via retraining with the generated data. Further, we conduct a case study on an industrial dialogue system to investigate the performance of DialTest under the real usage scenario. The study shows DialTest can detect errors and improve the robustness of RNN-driven dialogue systems effectively.

Transformations

Synonym Replacement (SR): The operators of this family transform the sentence by replacing an individual word with its synonyms, which keeps the meaning of this sentence not changed.
Back Translation (BT): The operators of this family translate the target sentence into an intermediate language and then translates it back to the original language.
Word Insertion (WI): The operators of this family transformthe sentence by inserting words with the pre-trained language model.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.assets		README.assets
bert		bert
cov		cov
data		data
eval		eval
fig		fig
save_profile		save_profile
transformation		transformation
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
bert_aug.py		bert_aug.py
main.py		main.py
wordnet_aug.py		wordnet_aug.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DialTest

Transformations

About

Releases

Packages

Languages

License

cicilzx/DialTest

Folders and files

Latest commit

History

Repository files navigation

DialTest

Transformations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages