Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
-
Updated
Oct 23, 2024 - Python
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
COM6012 Scalable Machine Learning - University of Sheffield
Fork and custom implementation of LineUp Library for Visual Analysis of Multi-Attribute
A quotation-based Scala DSL for scalable data analysis.
Description of work done at Merck pharmaceutical company in the summer of 2018 as a Computational Drug Discovery Intern at West Point, PA. Information excludes all proprietary information belonging to Merck & Co.
Knowledge data processing
This repository contain projects completed during my graduate study in Data Science & Analytics at the J. Mack Robinson College of Business, Georgia State University. I worked as part of a team of 4 or 6 members and we equally contributed in completing tasks and preparing final documentations (code file, report & PowerPoint presentation).
Lecture notes and other materials for a one-semester course on data mechanics.
Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.
Spark GIS (Docker + Flask Webserver + SparkGIS)
A cloud-based tool for sentiment analysis in reviews about restaurants on TripAdvisor
deprecated use lineup.js develop branch instead
Add a description, image, and links to the scalable-data-analysis topic page so that developers can more easily learn about it.
To associate your repository with the scalable-data-analysis topic, visit your repo's landing page and select "manage topics."