Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

population as a mandatory column is unclear #21

Open
abhishek-ch opened this issue Jun 24, 2024 · 5 comments
Open

population as a mandatory column is unclear #21

abhishek-ch opened this issue Jun 24, 2024 · 5 comments

Comments

@abhishek-ch
Copy link
Collaborator

A necessity of population named column is not logical for every usecase, make it data driven

@sprivite
Copy link
Collaborator

Can you give me a use case in which population is not needed?

@abhishek-ch
Copy link
Collaborator Author

If I create dummy dataset for impact of lifestyle choices like Smoking, Exercise, Blood pressure level etc, the target column seems to be Treatment or Is_Patient etc, I couldn't related population column for such scenario

Dataset Sample

image

@sprivite
Copy link
Collaborator

You're saying you want a different name for the column?

@abhishek-ch
Copy link
Collaborator Author

Will it be possible/logical to make any column Populations, I can always make sure to have a column name called population but not sure abt the value

@sprivite
Copy link
Collaborator

Does this solve your issue?

https://bayer-group.github.io/pybalance/03_api.html#pybalance.utils.MatchingData

Note that init can take population_col as an argument.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants