massive-datasets

Stream, parse, manipulate and transform extremly large data ( can be 1 GB or 1TB ) in NodeJS without any process block, memory overflow or bottle neck with peak performance. And also show it in UI with the help of webStreams

stream buffers transform node-js massive-datasets advance-nodejs

Updated Jul 21, 2024
JavaScript

INFJakZda / Processing-Massive-Data-Sets

Star

University lab exercises with processing big data.

data-processing massive-datasets star-schema

Updated Nov 19, 2018
Python

diem-ai / google-bigquery

Star

Series of SQL exercise working with databases, using Google BigQuery to scale to massive datasets taught by educators in Kaggle.com

python bigquery sql analytics kaggle massive-datasets

Updated Jul 9, 2019
Jupyter Notebook

Alex4gtx / Massive-Data-Handler

Star

Permite abrir e manipular arquivos massivos de texto/dados cujo seria impossivel abrir em um computador, por exemplo um arquivo de texto de +20gb, permite manipular o arquivo pegando apenas as linhas necessárias sem travar o computador por falta de memória.

big-data dictionaries python-script massive-datasets manipulacao-arquivos

Updated Feb 12, 2022
Python

rajeshidumalla / node2vec

Star

Building node2vec algorithm

python data-science machine-learning numpy pandas data-analysis matplotlib massive-datasets node2vec networkx-graph

Updated Oct 7, 2021
Jupyter Notebook

arhcoder / Netflix-Recommendation

Star

📺 Content Recommendation System for the Netflix Prize Challenge with Collaborative Filtering.

python jupyter-notebook collaborative-filtering netflix recommendation-system recommendation-engine recommender-system massive-datasets netflix-prize massive-data

Updated Feb 17, 2024
Jupyter Notebook

manuparra / hadoop-statistics

Star

Calculate statistical measures of one column in big data Datasets with these simply Hadoop Application

java hadoop bigdata max avg min standardeviation massive-datasets

Updated Feb 24, 2017
Java

FedericoBruzzone / algorithms-for-massive-datasets

Star

This repository contains a LaTeX file that generates a PDF document comprising comprehensive notes for the course "Algorithms for Massive Datasets"

deep-learning algorithms recommender-system massive-datasets unimi linkanalysis

Updated Aug 12, 2024
TeX

gmalik9 / floating_point_data_compressor

Star

gipa -- compression/decompression tool to package compress and encode massive archive files with floating-point data

compression data-visualization autoencoder compressor data-compression representation representation-learning floating-point massive-datasets

Updated Sep 14, 2017
Python

rajeshidumalla / PageRank

Star

Building PageRank algorithm on Web Graph around Stanford.edu using NetworkX python library

python data-science machine-learning spark numpy pagerank-algorithm pandas data-analysis massive-datasets networkx-library

Updated Oct 7, 2021
Jupyter Notebook

rajeshidumalla / Bloom-Filter

Star

Building a Bloom Filter on English dictionary words

python data-science machine-learning bloom-filter data-analysis nltk-library massive-datasets

Updated Oct 7, 2021
Jupyter Notebook

FedericoBruzzone / anti-money-laundering

Star

The project is based on the analysis of the "IBM Transactions for Anti Money Laundering" dataset published on Kaggle. The task is to implement a model which predicts whether or not a transaction is illicit, using the attribute "Is Laundering" as a label to be predicted.

machine-learning machine-learning-algorithms pyspark massive-datasets

Updated Aug 12, 2024
Jupyter Notebook

joshuaboud / gen-dataset

Star

Command line tool to quickly generate a lot of files in a lot of directories

linux benchmarking evaluation multithreading dataset dataset-generation massive-datasets cli-tool dataset-generator

Updated Feb 18, 2022
C++

Improve this page

Add a description, image, and links to the massive-datasets topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the massive-datasets topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

massive-datasets

Here are 24 public repositories matching this topic...

rajeshidumalla / Wordcount-in-Spark

KolwaBrad / massivedataset

dhruv3 / MRbasedFriendRecommender

pero5ar / FER.AVSP

nelsonstos / bulk-load-api-multivende

Sabaudian / AMD_Market_Basket_Analysis

miguel-kjh / Machine-Translation

SJ22032003 / massive-data-streaming-nodejs

INFJakZda / Processing-Massive-Data-Sets

diem-ai / google-bigquery

Alex4gtx / Massive-Data-Handler

rajeshidumalla / node2vec

arhcoder / Netflix-Recommendation

manuparra / hadoop-statistics

FedericoBruzzone / algorithms-for-massive-datasets

gmalik9 / floating_point_data_compressor

rajeshidumalla / PageRank

rajeshidumalla / Bloom-Filter

FedericoBruzzone / anti-money-laundering

joshuaboud / gen-dataset

Improve this page

Add this topic to your repo