Skip to content

Commit

Permalink
FSTALIGN-52: Remove some synonyms (#30)
Browse files Browse the repository at this point in the history
* Remove broken synonyms

* maintain unk synonym
  • Loading branch information
nishchalb authored Apr 27, 2022
1 parent d490d01 commit 2456389
Showing 1 changed file with 24 additions and 39 deletions.
63 changes: 24 additions & 39 deletions sample_data/synonyms.rules.txt
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,6 @@ they have | they've
you've | you have
you have | you've

# TODO: which other contractions?

'cause | cause ; because
because | 'cause ; cause
Expand All @@ -125,41 +124,27 @@ outta | out of
what are you | whatcha
whatcha | what are you

# handling comments/annotations in the nlp files
# note: anything after a : is ignored for now
# how to mimig deletions... ε or <del>?
# both options cause problems right now, so we'll avoid that for now
<barking> | <del> ; <unk>
<cheer> | <del> ; <unk>
<clap> | <del> ; <unk>
<clear_throat> | <del> ; <unk>
<cough> | <del> ; <unk>
<crosstalk> | <del> ; <unk>
<cry> | <del> ; <unk>
<finger_snap> | <del> ; <unk>
<foreign> | <del> ; <unk>
# <foreign:{note}> | <del> ; <unk>
<groan> | <del> ; <unk>
<grunt> | <del> ; <unk>
<inaudible> | <del> ; <unk>
<laugh> | <del> ; <unk>
# <laugh:chuckle> | <del> ; <unk>
# <laugh:{note}> | <del> ; <unk>
<mumble> | <del> ; <unk>
<murmur> | <del> ; <unk>
<phonetic> | <del> ; <unk>
<scream> | <del> ; <unk>
<sigh> | <del> ; <unk>
<silence> | <del> ; <unk>
# <silence:blank_audio> | <del> ; <unk>
# <silence:pause> | <del> ; <unk>
# <silence:pause> | <del> ; <unk>
<sing> | <del> ; <unk>
<siren> | <del> ; <unk>
<unk> | <del> ; <unk>
<visual> | <del> ; <unk>
# <visual:{note}> | <del> ; <unk>
<yawn> | <del> ; <unk>

# anything not explicitly handled will be ignored from the graph
#<{note}> | <del>
<barking> | <unk>
<cheer> | <unk>
<clap> | <unk>
<clear_throat> | <unk>
<cough> | <unk>
<crosstalk> | <unk>
<cry> | <unk>
<finger_snap> | <unk>
<foreign> | <unk>
<groan> | <unk>
<grunt> | <unk>
<inaudible> | <unk>
<laugh> | <unk>
<mumble> | <unk>
<murmur> | <unk>
<phonetic> | <unk>
<scream> | <unk>
<sigh> | <unk>
<silence> | <unk>
<sing> | <unk>
<siren> | <unk>
<unk> | <unk>
<visual> | <unk>
<yawn> | <unk>

0 comments on commit 2456389

Please sign in to comment.