Skip to content

Registry of published datasets in the Datasets Project

Notifications You must be signed in to change notification settings

senegalouvert-datasets/registry

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Register (list) of Data Packages including the core datasets in the Frictionless Data Project.

Currently two registers maintained here:

  • catalog-list.txt - catalog of all the community data packages we can find (at the moment largely those found on github via automatic search)
  • core-list.txt - "Core" Datasets (hand-maintained)

Preparation

Catalog List

The main Catalog list is scraped using the python script scripts/scrape.py:

# install deps
pip install -r scripts/requirements.txt
# scrape data
python scripts/scrape.py

Note we'd prefer not to scrape and use the API but we can't do the relevant query via the API - see http://developer.github.com/changes/2013-10-18-new-code-search-requirements/

Core List

To add a dataset please add it to the core-list.txt - we recommend fork and pull.

Discussion of proposals for new datasets and for incorporation of prepared datasets takes place in the issues.

To propose a new dataset for inclusion, please create a new issue.

About

Registry of published datasets in the Datasets Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%