massive-datasets
Here are 24 public repositories matching this topic...
Permite abrir e manipular arquivos massivos de texto/dados cujo seria impossivel abrir em um computador, por exemplo um arquivo de texto de +20gb, permite manipular o arquivo pegando apenas as linhas necessárias sem travar o computador por falta de memória.
-
Updated
Feb 12, 2022 - Python
Building node2vec algorithm
-
Updated
Oct 7, 2021 - Jupyter Notebook
Command line tool to quickly generate a lot of files in a lot of directories
-
Updated
Feb 18, 2022 - C++
gipa -- compression/decompression tool to package compress and encode massive archive files with floating-point data
-
Updated
Sep 14, 2017 - Python
Map Reduce program to suggest new friends based on count of mutual friends
-
Updated
Mar 2, 2018 - Java
Lab assignments for the Analysis of Massive Data Sets course @ FER, University of Zagreb
-
Updated
Jun 30, 2018 - C#
Building a Bloom Filter on English dictionary words
-
Updated
Oct 7, 2021 - Jupyter Notebook
Algorithms for Massive Datasets (AMD) -- Market-baskets analysis project
-
Updated
Sep 3, 2024 - Jupyter Notebook
-
Updated
Dec 11, 2020 - Jupyter Notebook
📺 Content Recommendation System for the Netflix Prize Challenge with Collaborative Filtering.
-
Updated
Feb 17, 2024 - Jupyter Notebook
The project is based on the analysis of the "IBM Transactions for Anti Money Laundering" dataset published on Kaggle. The task is to implement a model which predicts whether or not a transaction is illicit, using the attribute "Is Laundering" as a label to be predicted.
-
Updated
Aug 12, 2024 - Jupyter Notebook
Stream, parse, manipulate and transform extremly large data ( can be 1 GB or 1TB ) in NodeJS without any process block, memory overflow or bottle neck with peak performance. And also show it in UI with the help of webStreams
-
Updated
Jul 21, 2024 - JavaScript
TF-Package: Multiple-Input Multiple-Output Keras Data-Generator for massive and complex datasets
-
Updated
Jan 2, 2023 - Python
This repository contains a LaTeX file that generates a PDF document comprising comprehensive notes for the course "Algorithms for Massive Datasets"
-
Updated
Aug 12, 2024 - TeX
Calculate statistical measures of one column in big data Datasets with these simply Hadoop Application
-
Updated
Feb 24, 2017 - Java
University lab exercises with processing big data.
-
Updated
Nov 19, 2018 - Python
word count in Spark
-
Updated
Oct 6, 2021 - Jupyter Notebook
Building PageRank algorithm on Web Graph around Stanford.edu using NetworkX python library
-
Updated
Oct 7, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the massive-datasets topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the massive-datasets topic, visit your repo's landing page and select "manage topics."