You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Final year major project on big data analysis of instacart dataset and finally Product Bundle Recommendation using pyspark(for clustering) and bigram for recommendation
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
Clean data, optimize SQL queries, and write select advanced JOINs to enhance analysis performance. Use SQL DDL to create the data schemas & apply SQL DML to migrate data from a denormalized schema to a normalized
This repo contains an algorithm that identifies vendors whose customers have a higher willingness to pay. The inherent inelasticity of these vendors is utilized as part of a price differentiation strategy called "Loved Brands"