Skip to content
View shrivastavasatyam's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report shrivastavasatyam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shrivastavasatyam/README.md

Visitor Badge

Hi! I'm Satyam Shrivastava 🌟 Turning Data into Insights 💡📈

About Me 👨‍💻

I'm a Data Science graduate student at Northeastern University with 6+ years of expertise in machine learning, statistics, NLP, data engineering, data visualization, and advanced data analytics. I develop scalable, cloud-based solutions that drive high-value, data-driven business outcomes.

Education 🎓

Khoury College of Computer Sciences Northeastern University | Khoury College of Computer Sciences
Master of Science (MS) in Data Science
Boston, MA, US
Sep 2022 - Aug 2024

Coursework: Linear Algebra, Statistics & Probability, Data Structures & Algorithms, Data Management & Processing, Machine Learning, Data Mining, Natural Language Processing (NLP), Deep Learning, Information Visualization, Large Scale Parallel Data Processing, MLOps, Large Language Models (LLM)

Work Experience 💼

Fidelity Investments Fidelity Investments
Co-op, Data Analytics and Insights          
Boston, MA, US
Jan 2024 - Aug 2024
Fractal Analytics Fractal Analytics
Senior Decision Scientist          
Mumbai, MH, India
Mar 2020 - Aug 2022
Netlink Software Group America Inc Netlink Software Group America Inc
Senior Analytics Engineer          
Bhopal, MP, India
Oct 2019 - Mar 2020
Tata Consultancy Services Tata Consultancy Services
Data Engineer          
Pune, MH, India
Dec 2016 - Oct 2019

Academic Experience 🏫

D'Amore-McKim School of Business D'Amore-McKim School of Business | Northeastern University
Graduate Teaching Assistant
MISM 6205: Data Wrangling for Business - EDGE for PwC
Boston, MA, US
Fall 2024 Semester
Sep 2024 - Dec 2024
Khoury College of Computer Sciences Khoury College of Computer Sciences | Northeastern University
Graduate Teaching Assistant
DS 5110: Introduction to Data Management and Processing
Boston, MA, US
Fall 2023 Semester
Sep 2023 - Dec 2023
Khoury College of Computer Sciences Khoury College of Computer Sciences | Northeastern University
Graduate Teaching Assistant
CS 3200: Database Design
Boston, MA, US
Spring 2023 Semester
Jan 2023 - Apr 2023

Featured Projects 📂🚀

Data Engineering & Big Data 💾🌐

  1. Twitter-Followers-Patterns-Spark - Analyzing Twitter graph patterns using Hadoop Spark, focusing on 2-hop paths and triangles.
  2. Twitter-Followers-Patterns-MapReduce - Analyzing Twitter graph patterns using Hadoop MapReduce and join techniques.
  3. Word-Count-Spark - Implementing word count for a large text corpus using Apache Spark.
  4. Word-Count-MapReduce - Performing word count using Hadoop MapReduce.
  5. Credit-Card-Management-System - Python-based system with a 3NF-compliant SQL database for credit card management, involving 10K+ realistic customer profiles.

NLP, Machine Learning, and Deep Learning 🤖🧠

  1. NLP-Word-Embeddings - Assessing the quality of word embeddings with T-SNE projections and word analogies.
  2. NLP-Powered-Story-Generator - Generating stories based on prompts using NLP and deep learning models.
  3. Consumer-Complaint-Tagger - Automating complaint classification for improved handling using multi-level NLP classification.
  4. CIFAR-10-Classifier - Image classification on CIFAR-10 dataset using machine learning and deep learning techniques.
  5. Book-Recommendation-System - Creating an advanced book recommendation system using unsupervised learning methods.

Data Analysis, Exploration, and Visualization 🔍📊

  1. Vietnam-Bombings - Analyzing and visualizing the impact and details of bombings during the Vietnam War through comprehensive data exploration.

Skills 🔧

Category Technologies
Languages Python R SQL PL/SQL NoSQL Java Scala DAX Power Query (M) Unix/Linux Shell Script Markdown LaTeX
Databases Snowflake BigQuery Oracle PostgreSQL MySQL DB2 MS SQL Server MongoDB Cassandra
Frameworks Pandas NumPy Matplotlib Plotly SciKit-Learn NLTK TensorFlow PyTorch Apache Hadoop Apache Spark Apache Hive Apache Kafka Apache Airflow
Developer Tools Power BI Tableau Alteryx Designer Google Cloud AWS Azure Adobe Analytics GitHub
Management Tools Jira Confluence Alation Microsoft Office Microsoft Excel Microsoft PowerPoint

Connect with me 💬📬

Personal Email
shrivastava.sat@northeastern.edu
shrivastavasatyam@hotmail.com

LinkedIn
linkedin.com/in/shrivastavasatyam

Feel free to explore my repositories and reach out for collaborations or discussions!

Pinned Loading

  1. NLP-Powered-Story-Generator NLP-Powered-Story-Generator Public

    This project aims to generate creative and coherent stories based on given prompts using natural language processing. It also explores the effectiveness of various deep learning models to generate …

    Jupyter Notebook 1

  2. Consumer-Complaint-Tagger Consumer-Complaint-Tagger Public

    Forked from Harshan1823/complaintTag

    Automating complaint classification for the Consumer Financial Protection Bureau (CFPB), using NLP to improve efficiency in handling complaints. Project focuses on multi-level classification of pro…

    Jupyter Notebook

  3. Text-to-SQL-Generator Text-to-SQL-Generator Public

    This project transforms natural language text into SQL queries, enabling users to interact with databases using plain language.

    Jupyter Notebook

  4. Twitter-Followers-Patterns-Spark Twitter-Followers-Patterns-Spark Public

    This project focuses on analyzing patterns (2-hop paths and Triangles) in a Twitter graph dataset containing nodes and edges using Hadoop Spark and various combining and join techniques.

    Makefile