"Have a bias toward action—let’s see something happen now. You can break that big plan into small steps and take the first step right away." ~ Indira Gandhi
![UDQSS](https://private-user-images.githubusercontent.com/169787519/387583419-4738cfb2-a00d-40c1-80d4-80b505f3161c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkxNzA1NjAsIm5iZiI6MTczOTE3MDI2MCwicGF0aCI6Ii8xNjk3ODc1MTkvMzg3NTgzNDE5LTQ3MzhjZmIyLWEwMGQtNDBjMS04MGQ0LTgwYjUwNWYzMTYxYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEwJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMFQwNjUxMDBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xMzU3YTRhOTkxYTk0ZWZlY2VhY2NkOTc1OTRmNmFlZTVkNzNhNDBiN2VmN2UxNTUzOGM0MGRjNmJlMjEyMGYxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.5JVr0WShqYcd27eV56XtqiOUxBoWkbMs1C0OGSgkL18)
UDQSS is an open-source standard for evaluating and scoring data quality across different domains. It provides a comprehensive framework to assess, measure, and benchmark the quality of datasets using standardized criteria.
- Establish a universal standard for data quality assessment
- Enable consistent quality benchmarking across datasets
- Provide tools and templates for easy quality evaluation
- Support customization for domain-specific requirements
- Foster transparency in data quality metrics
udqss/
├── docs/ # Documentation
│ ├── parameters.md # Quality parameters
│ ├── scoring-guide.md # Scoring methodology
│ └── customization.md # Customization guide
├── templates/ # Ready-to-use templates
│ ├── udqss-questionnaire.pdf
│ └── udqss-template.xlsx
├── examples/ # Implementation examples
│ ├── healthcare-data/
│ ├── financial-data/
│ └── more-domains/
└── .github/ # Project management
- Review the parameters documentation to understand quality metrics
- Choose either the PDF questionnaire or Excel template from the
templates/
directory - Follow the scoring guide to evaluate your dataset
- Check example implementations in the
examples/
directory for reference
UDQSS evaluates datasets across multiple dimensions including:
- Data Completeness
- Data Accuracy
- Data Consistency
- Data Timeliness
- Data Validity
- Documentation Quality
- Format Compliance
Each dimension is scored on a standardized scale, with detailed scoring criteria available in the scoring guide.
- PDF Questionnaire: Print-friendly assessment form for manual evaluations
- Excel Template: An automated scoring calculator with built-in formulas
- Example Datasets: Sample evaluations across different domains
UDQSS can be adapted for specific domains while maintaining compatibility with the core standard. See customization.md for guidelines.
We welcome contributions! Please see CONTRIBUTING.md for guidelines.
Key areas for contribution:
- Additional domain-specific examples
- Template improvements
- Documentation enhancements
- Scoring methodology refinements
This project is licensed under the [LICENSE] - see the LICENSE file for details.
Complete documentation is available in the docs/
directory:
UDQSS is a community-driven project aimed at improving data quality standards across industries. Originally initiated by Opendatabay as part of their mission to advance data quality standards, the project has grown into a collaborative, open-source effort.
We extend our gratitude to all contributors who continue to support and enhance this framework, ensuring its transparency, accessibility, and universal applicability.
For more information, visit udqss.org (Coming soon)