Skip to content

Commit 84f2693

Browse files
committed
0.7.27 - improved wordlists.
1 parent 66fcbee commit 84f2693

7 files changed

+119
-15
lines changed

Cargo.toml

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
[package]
22
name = "rustrict"
33
authors = ["Finn Bear"]
4-
version = "0.7.26"
4+
version = "0.7.27"
55
edition = "2021"
66
license = "MIT OR Apache-2.0"
77
repository = "https://github.com/finnbear/rustrict/"

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -177,7 +177,7 @@ is used as a dataset. Positive accuracy is the percentage of profanity detected
177177

178178
| Crate | Accuracy | Positive Accuracy | Negative Accuracy | Time |
179179
|-------|----------|-------------------|-------------------|------|
180-
| [rustrict](https://crates.io/crates/rustrict) | 79.78% | 94.00% | 76.23% | 9s |
180+
| [rustrict](https://crates.io/crates/rustrict) | 79.83% | 94.00% | 76.30% | 9s |
181181
| [censor](https://crates.io/crates/censor) | 76.16% | 72.76% | 77.01% | 23s |
182182

183183
## Development

src/dictionary_blacklist.txt

+18
Original file line numberDiff line numberDiff line change
@@ -93,6 +93,13 @@ blue waffle
9393
bn
9494
bohunks
9595
bollocks
96+
bomb china
97+
bomb india
98+
bomb iran
99+
bomb israel
100+
bomb palestine
101+
bomb russia
102+
bomb ukraine
96103
bon er
97104
boners
98105
bonnering
@@ -133,6 +140,13 @@ bullet vibe
133140
bullshit(.*)
134141
bums
135142
bungholes
143+
burn china
144+
burn gaza
145+
burn israel
146+
burn jew
147+
burn jews
148+
burn palestine
149+
burn yourself
136150
but holes
137151
buttocks
138152
butts
@@ -215,6 +229,7 @@ dirty pillows
215229
dirty sanchez
216230
dongs
217231
donkey punch
232+
dog headed
218233
dog style
219234
douche(.*)
220235
drag queen
@@ -403,10 +418,12 @@ kafirs
403418
kikes
404419
kill china
405420
kill chinese
421+
kill myself
406422
kill people
407423
kill russia
408424
kill russian
409425
kill russians
426+
kill self
410427
kill students
411428
kill ukraine
412429
kill ukrainian
@@ -451,6 +468,7 @@ m
451468
male squirting
452469
masochists
453470
massive wood
471+
master race
454472
masturbate(.*)
455473
maya sol
456474
meat beating

src/dictionary_extra.txt

+3
Original file line numberDiff line numberDiff line change
@@ -85,6 +85,7 @@ few secs
8585
ffa game
8686
fire cracker
8787
fire crackers
88+
for a pea
8889
forgot it's
8990
francoitalian
9091
franco italian
@@ -172,11 +173,13 @@ mini game
172173
n't eat
173174
negativly
174175
ngad
176+
ngay bay
175177
nigth
176178
of agitation
177179
omg
178180
opps
179181
outgaminged
182+
pc master race
180183
pegging the
181184
plss
182185
plsss

src/false_positives.txt

+36-10
Original file line numberDiff line numberDiff line change
@@ -1741,6 +1741,10 @@ attorneys hit
17411741
attorneys lut
17421742
attorneys perm
17431743
attorneys seeks
1744+
auburn china
1745+
auburn israel
1746+
auburn palestine
1747+
auburn yourself
17441748
aught its
17451749
aught texts
17461750
aught thick
@@ -2328,6 +2332,7 @@ bend overhead
23282332
bend overnigh
23292333
bend overs
23302334
bend overview
2335+
bend yourself
23312336
benedick
23322337
benedicks
23332338
benkulen
@@ -2755,13 +2760,9 @@ bol lock
27552760
bol locks
27562761
bol look
27572762
bol looks
2758-
bomb china
2759-
bomb india
2760-
bomb iran
2761-
bomb israel
2762-
bomb palestine
2763-
bomb russia
2764-
bomb ukraine
2763+
bomb indian
2764+
bomb israeli
2765+
bomb russian
27652766
bomb usage
27662767
bon ed
27672768
bon eric
@@ -3360,9 +3361,8 @@ bundles
33603361
bunga
33613362
burgh little
33623363
burgundies
3363-
burn china
3364-
burn israel
3365-
burn palestine
3364+
burn israeli
3365+
burn jewel
33663366
burst fu
33673367
burst its
33683368
burst texts
@@ -3631,6 +3631,7 @@ cases hit
36313631
cases lut
36323632
cases perm
36333633
cases seeks
3634+
cash apps
36343635
casklike
36353636
cast rate
36363637
cast ration
@@ -5282,6 +5283,7 @@ directions hit
52825283
directions lut
52835284
directions perm
52845285
directions seeks
5286+
dirty juan
52855287
disco jones
52865288
disco om
52875289
disco on
@@ -5314,6 +5316,7 @@ dives hit
53145316
dives lut
53155317
dives perm
53165318
dives seeks
5319+
dividend yourself
53175320
divx cocktail
53185321
divx commission
53195322
divx cook
@@ -6582,6 +6585,7 @@ felt chuck
65826585
felt church
65836586
felt xhtml
65846587
females squirting
6588+
fend yourself
65856589
fennig
65866590
fers cumulative
65876591
fers ext
@@ -6966,6 +6970,7 @@ frequencies seeks
69666970
fribblish
69676971
fricassees
69686972
frickle
6973+
friend yourself
69696974
frigage
69706975
frigate
69716976
frigatoon
@@ -8342,6 +8347,7 @@ hoot caring
83428347
hoot carl
83438348
hoot carri
83448349
hoot chick
8350+
hoot girl
83458351
hoot karl
83468352
hop
83478353
hope do
@@ -8611,6 +8617,7 @@ hosts perm
86118617
hosts seeks
86128618
hot
86138619
hot carri
8620+
hot girl
86148621
hot its
86158622
hot karl
86168623
hot texts
@@ -9773,6 +9780,7 @@ kelkoo ny
97739780
kelkoo om
97749781
kelkoo on
97759782
kelkoo ward
9783+
kend yourself
97769784
kenipsim
97779785
kennedy kee
97789786
kennedy keith
@@ -9834,6 +9842,7 @@ kijiji key
98349842
kijiji like
98359843
kijiji link
98369844
kijiji lit
9845+
kijiji myself
98379846
kijiji slim
98389847
kijiji ta
98399848
kijiji tea
@@ -10024,6 +10033,7 @@ killian
1002410033
killing jewel
1002510034
killing palestinian
1002610035
killing peoples
10036+
kills self
1002710037
kilt
1002810038
kinds cumulative
1002910039
kinds ext
@@ -10320,6 +10330,7 @@ leep peru
1032010330
leep public
1032110331
leep puzzles
1032210332
leep rick
10333+
legend yourself
1032310334
legendic
1032410335
leges cumulative
1032510336
leges ext
@@ -10342,6 +10353,7 @@ len illinois
1034210353
lena holes
1034310354
lena zimb
1034410355
lena zinc
10356+
lend yourself
1034510357
length little
1034610358
leningrad
1034710359
leninism
@@ -11012,6 +11024,7 @@ marsh liter
1101211024
marsh little
1101311025
marshite
1101411026
mas hole
11027+
mas terrace
1101511028
masklike
1101611029
masochistic
1101711030
mass cocktail
@@ -11038,6 +11051,7 @@ mass pirate
1103811051
mass seeks
1103911052
mass sees
1104011053
mass sess
11054+
mass terrace
1104111055
massachusetts cumulative
1104211056
massachusetts ext
1104311057
massachusetts hilt
@@ -11051,6 +11065,7 @@ massive woods
1105111065
master balt
1105211066
master bat
1105311067
master batter
11068+
master races
1105411069
mastful
1105511070
masturbational
1105611071
mates cumulative
@@ -11107,6 +11122,7 @@ membered skins
1110711122
memo ron
1110811123
men sees
1110911124
menadic
11125+
mend yourself
1111011126
menisperm
1111111127
mens cumulative
1111211128
mens esc
@@ -11917,6 +11933,7 @@ ng rope
1191711933
ngad
1191811934
ngai
1191911935
ngapi
11936+
ngay bay
1192011937
nibbana
1192111938
nick a
1192211939
nick advertisement
@@ -12461,6 +12478,7 @@ opponents lut
1246112478
opponents perm
1246212479
opponents seeks
1246312480
opps
12481+
or a pe
1246412482
or appeal
1246512483
or appear
1246612484
or append
@@ -12978,6 +12996,7 @@ pays hit
1297812996
pays lut
1297912997
pays perm
1298012998
pays seeks
12999+
pc master race
1298113000
pe do
1298213001
pe nissan
1298313002
peaceful licking
@@ -13081,6 +13100,7 @@ pen nis
1308113100
pen us
1308213101
pen uzbek
1308313102
pen vs
13103+
pend yourself
1308413104
peneseismic
1308513105
penest
1308613106
penistone
@@ -15080,6 +15100,7 @@ remedy kep
1508015100
remedy ker
1508115101
remedy kevin
1508215102
remedy key
15103+
rend yourself
1508315104
rendered skins
1508415105
reneger
1508515106
renga
@@ -16078,6 +16099,7 @@ seminudity
1607816099
semisextile
1607916100
semislave
1608016101
send feet
16102+
send yourself
1608116103
senior
1608216104
senior appeal
1608316105
senior appear
@@ -16572,8 +16594,10 @@ skiddycock
1657216594
skiepper
1657316595
skill china
1657416596
skill chinese
16597+
skill myself
1657516598
skill people
1657616599
skill russia
16600+
skill self
1657716601
skill student
1657816602
skill ukraine
1657916603
skill yourself
@@ -17725,6 +17749,7 @@ teiglech
1772517749
temple assured
1772617750
temple peer
1772717751
ten secs
17752+
tend yourself
1772817753
tendrillar
1772917754
tenebra
1773017755
tenggerese
@@ -19475,6 +19500,7 @@ we xnxx
1947519500
wealth little
1947619501
weathercock
1947719502
weatherstrippers
19503+
webmaster race
1947819504
week chi
1947919505
week cocktail
1948019506
week commission

0 commit comments

Comments
 (0)