Pinned Loading
-
Unsupervised-Fraud-Algorithm-on-the-NY-Property-Tax-Submission-Data
Unsupervised-Fraud-Algorithm-on-the-NY-Property-Tax-Submission-Data PublicAutomated, unsupervised outlier identification from 1 million+ NYC properties. Two fraud models were constructed via z-score/PCA and autoencoders, respectively, and combined to identify 100+ instan…
Jupyter Notebook
-
Bayesian_Recommender_System
Bayesian_Recommender_System PublicUsed Probabilistic Matrix Factorization (PMF) to recommend Netflix users movies and TV shows using PyMC3. Proved the superiority of the Bayesian method against baseline models.
Jupyter Notebook 2
-
DS-Take-Home-Challenges
DS-Take-Home-Challenges PublicUsed linear and tree-based models, visualizations techniques to solve commonplace data science problems, including calculating conversion rate, analyzing A/B testing, churn/retention prediction, fr…
Jupyter Notebook
-
Pricing-Analytics---Hotel-Pricing-through-Casual-Inference-Analysis
Pricing-Analytics---Hotel-Pricing-through-Casual-Inference-Analysis PublicI used 28 relevant attributes to price hotel rooms using casual inference analysis between price and demand. PCA and K-Means Clustering were used to compare prices only among rooms with similar eno…
Jupyter Notebook 1
-
Spark-Desmontration
Spark-Desmontration PublicThis is a demonstration of using Spark to explore large dataset, by using PySpark and SparkR. The files include loading data, data exploration and using clustering on words of Shakespeare's novels.
Jupyter Notebook
-
time-series-sales-prediction
time-series-sales-prediction PublicThis is a data challenge to predict future store sales using past store sales. The data was more than 10 Gigabytes across years and retail stores across the United States. Unfortunately, due to an …
Jupyter Notebook 1
If the problem persists, check the GitHub status page or contact support.