Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using RD-Filters on MoleculeNet datasets #23

Open
rbharath opened this issue Jan 29, 2021 · 0 comments
Open

Using RD-Filters on MoleculeNet datasets #23

rbharath opened this issue Jan 29, 2021 · 0 comments

Comments

@rbharath
Copy link
Member

A number of the MoleculeNet datasets have PAINS compounds and other compounds that detract from their usefulness as benchmarking datasets. Let's use this issue to brainstorm the set of datasets that we want to filter. I think we can use @PatWalters https://github.com/PatWalters/rd_filters library to help us filter datasets.

Off the top of my head, I think we can start by applying the PAINS filters to the

  1. Chembl
  2. Chemb25
  3. HIV
  4. PCBA

datasets. We should discuss here to see if this makes sense though.

CC @mufeili @PatWalters @lilleswing @peastman

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant