Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Huge amount of scam links going undetected by Automod #481

Open
sahilasopa opened this issue Feb 18, 2024 · 7 comments
Open

Huge amount of scam links going undetected by Automod #481

sahilasopa opened this issue Feb 18, 2024 · 7 comments

Comments

@sahilasopa
Copy link
Contributor

Currently, Javabot relies on the file src/main/resources/spamLinks.txt for detecting scam links, but this file was updated 3 years ago, it should be considered to update the file to contain the newest URLs

@sahilasopa sahilasopa changed the title Huge amount of scam links going detected by Automod Huge amount of scam links going undetected by Automod Feb 18, 2024
@danthe1st
Copy link
Member

I am unsure about the extent this file is actually used since the bot seems to load the new version of the spam URLs on startup.

@sahilasopa
Copy link
Contributor Author

I am unsure about the extent this file is actually used since the bot seems to load the new version of the spam URLs on startup.

it does load it in memory but it should not be tooooo big of a problem unless the bot is hosted on a toaster, and I skimmed through the last few scam URLs sent on the server, the updated file in #482 seems to contain 9/10 of those URLs

@sahilasopa
Copy link
Contributor Author

After doing a quick calculation, the file loads 195.73 KB of data in memory, which seems pretty reasonable.

@danthe1st
Copy link
Member

it does load it in memory but it should not be tooooo big of a problem unless the bot is hosted on a toaster, and I skimmed through the last few scam URLs sent on the server, the updated file in #482 seems to contain 9/10 of those URLs

This is not about the amount of URLs. I was saying that I think that the bot actually doesn't use the file specified in the resources directory but automatically downloads the newest version and uses that.

@sahilasopa
Copy link
Contributor Author

it does load it in memory but it should not be tooooo big of a problem unless the bot is hosted on a toaster, and I skimmed through the last few scam URLs sent on the server, the updated file in #482 seems to contain 9/10 of those URLs

This is not about the amount of URLs. I was saying that I think that the bot actually doesn't use the file specified in the resources directory but automatically downloads the newest version and uses that.

I missed that, well in that case the file should be removed from resources directory

@danthe1st
Copy link
Member

danthe1st commented Feb 20, 2024

If that file is actually unused, it might be good to completely delete it but it would be necessary to verify this.

@danthe1st
Copy link
Member

danthe1st commented Feb 20, 2024

Originally, it was loaded from resources but this was changed to automatically fetch it and the file might be completely obsolete since then (unless it would be in use somewhere else and I didn't find anything with a full-text search on spamLinks.txt so we can probably delete the file altogether.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants