Scalable data pre processing and curation toolkit for LLMs
-
Updated
Nov 8, 2024 - Jupyter Notebook
Scalable data pre processing and curation toolkit for LLMs
80+ CLI tools to build, browse, and blend your media library: an index for your archive.
Open source project for data preparation of LLM application builders
Wikidata-based scholarly profiles
An image + data web scraper build to crawl the CarMax website and store relevant information for vehicle identification projects.
Exploration and data curation of a dataset given by a Kaggle competition (https://www.kaggle.com/dansbecker/melbourne-housing-snapshot) related to properties that were sold in Melbourne in 2016 and 2017. The meaning of this project is to prepare a well-structured matrix, so it can be used to run a model in order to estimate their prices.
This is a capstone project for the course of Business Analytics and Business Data Management at IIT Madras. The project involves analyzing sales data of Uttam Supermarket in Indore, which has 5 franchises, collected over a year. The analysis includes store-wise and monthly sales, the effect of holidays on sales, and weekly sales analysis.
Acest repo conține materiale, seturi de date și soluții care au fost folosite în cadrul Școlii de vară Astra, prima ediție, 2021
PostgreSQL code for archaeological data management
Add a description, image, and links to the datacuration topic page so that developers can more easily learn about it.
To associate your repository with the datacuration topic, visit your repo's landing page and select "manage topics."