Skip to content
#

data-archiving

Here are 8 public repositories matching this topic...

Language: All
Filter by language

This Python-based repository hosts a sophisticated service designed for scraping web articles and converting them into Markdown format. The core functionality of this service includes extracting the main content of articles, such as headlines, key paragraphs, and associated images, and then seamlessly transforming this content into well-structured…

  • Updated Feb 19, 2024
  • Python

FileArchiver is a robust tool designed to safely archive outdated data from very large datasets (Terabyte size) and efficiently filter geo-data for mapping purposes. Developed for Deutsche Bahn AG, it streamlines the management of extensive geographical data to optimize storage and enhance data processing efficiency.

  • Updated Sep 26, 2024
  • Java

Improve this page

Add a description, image, and links to the data-archiving topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-archiving topic, visit your repo's landing page and select "manage topics."

Learn more