-
Notifications
You must be signed in to change notification settings - Fork 2
First_Quixote_Conference_ _22nd 23rd_March_2010
\ --> **A meeting to create the first Open distributed repository for electronic simulations** <-- --> **Sponsored by the Hartree Centre** <-- \
<<toc></toc>>
For a general overview of the Quixote Project, please follow the links on the Front_Page.
Meeting's EtherPad is at http://okfnpad.org/quixotemeeting.
* **Primary:** build and populate a Chempound repository for outputs from electronic codes. Repository should search for chemical concepts, job meta-data and core physico-chemical properties. * **Subsidiary:** build Core CML-XML parsers for 3- 8 commonly used codes in atomistic QM and planewave simulations. Use Lensfield to scrape directories for files.
* Chemistry (not QM methodology). * practical achievement (running code and a working repository). * Opensource/opendata without access control to the repository. Repoistry-agnostic (?[[http://swordapp.org/|SWORD]],?[[http://www.openarchives.org/ore/|OAI]])
The meeting will have two main strands:
* **use case:** searching the repository for given concept over several filetypes * **preparation:** several CML files (with index data) for several codes * Files for data set (final CML, original TXT, index.rdf, etc.) aggregated under ORE * Aggregation uploaded by SWORD (METS, ZIP, etc.. as agreed) * index in simple name-value search (?RDF, ?Mongo)
* Use-case: disk populated with calculations of given kind where (say) suffix = FOO . * Arbitrary directory structure. * Lensfield trawls disk for *.foo * JumboFooConverter/Parser converts to CML (failures => null files). * conversion supported by FooDict.cml
The parsing need not be complete. The CML is used for specific data and indexing. The results COULD (if all goes well) be fed into the Chempound repo. Small Parsers can be written on-the-fly for codes if the format is pre-explored. We do not have to do the whole job.
* The codes should all have mini-dictionaries (some will be large) * The indexes will be normalized (e.g. dates, units, formulae, etc.). This means that users search a single conceptual model but the original data is not affected and can be retrieved as part of the ORE aggregation.
* [[Lensfield2]] * [[JUMBO]] * Moka * [[http://avogadro.openmolecules.net|Avogadro]] * Chempound
- NB:** - this is a draft timetable and subject to change.
The meeting will take place at STFC Daresbury Laboratory.
The closest airports are Manchester and Liverpool.
For invited attendees, accommodation has been arranged for the nights of the 21st and 22nd at the Daresbury Park Hotel
For further information about the meeting, or if you are interested in attending, please contact Jens Thomas: jens.thomas@stfc.ac.uk