Releases · IMAGO-Catalogues-Jjanes/cataloguesSegmentationOCR

05 Sep 17:52

c5e0fa4

4.1 Latest

Latest

This release contains 363 xml files, and their corresponding images from a large corpus of 19th, 20th and 21th exhibition catalogs, manuscripts'fair catalogs and directories. The new catalogs added here were created using the HTR and segmentation models accessible in the repository.
It includes a csv file describing the xml files and various tools to create a training dataset: differents bash scripts, a python programm to divide the xml files into testing, training and evaluation dataset and several fixed tests. A xsl transformation sheet is also accessible to delete the Entry and EntryEnd zones from the xml files in order to have a SegmOnto-like dataset.
The xml files has been corrected since the 4.0 release thanks to the addition of a github action (SegmOntoKraken).

Assets 2

23 Jul 13:19

Juliettejns

CatIndep

6e5ee8c

4.0

Assets 2

21 Jul 13:56

Juliettejns

NewCatalogs2

e3a70a4

3.0

Assets 2

15 Jun 09:29

Juliettejns

Dataset274

f04e170

2.0

This release contains 274 xml files, and their corresponding images from a large corpus of 19th, 20th and 21th exhibition catalogs, manuscripts'fair catalogs and directories.
It also includes several bash scripts to create various datasets and a csv file describing the xml files.

Assets 2

09 Jun 14:30

Juliettejns

Data150

89de4b8

1.0

This release contains 150 xml files, and their corresponding images from a large corpus of 19th, 20th and 21th exhibition catalogs, manuscripts'fair catalogs and directories.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: IMAGO-Catalogues-Jjanes/cataloguesSegmentationOCR

4.1

4.0

3.0

2.0

1.0