Skip to content
View urvi3012's full-sized avatar
🎯
Working Hard
🎯
Working Hard
  • California, USA
  • 07:41 (UTC -08:00)

Block or report urvi3012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
urvi3012/README.md

Hello there, I'm Urvi Jain

"Data-savvy graduate (May 2023) seeking a Data Engineer/ Analyst role to solve the company's biggest business problems"

Download my Resume

              


Technology Stack 💻


Programming Languages 👨‍💻 :


        


Frameworks & Databases 📦 :


           


Libraries 🔣 :


            Matplotlib   Selenium   Request  


Cloud Platforms ☁️ :


     


Others ➕ :


   Tableau   Tkinter   BS4  


IDEs/Editors 👨‍🔧 :


           


Version Control 🔧 :


     



Hosting 🌎 :


        


Projects 💻+🖱️


• Scraped 24000+ products under 25+ categories from Walmart with meta data to develop a “Walmart Lens”.

• Eliminated data redundancies and skewness by pre-processing and cleaning.

• Analyzed around 25000 image input through CNN model to list all the products (<=10) recognized.

• Worked on GCP’s Vertex AI and Google Cloud Vision to generate labels and detect (<=10) texts in an image.

• Labeled the products using Amazon Rekognition Custom Label API and lambda functions.

Web App


• Performed data wrangling and EDA on real-world data of 84,000+ building units details sold in NYC over a year

• Created quantile regression model with area- sale price and applied Recursive Feature Elimination to get top 10 features

• Designed a Random Forest Regressor model to compare the results with RFE

• Built Neural Network and improved its accuracy with Adam to improve performance.


• Developed a script to scrape tweets (10,000+) in real-time

• Analyzed tweets data to find out the impression a tweet makes based on keyword and usernames.

• Visualized the results to find out the extent of correlation between keywords and user using Seaborn & Matplotlib.

• Improved the efficiency of script to scrape tweets related to multiple keywords synchronously.


• Performed data cleaning, wrangling, outlier detection and stop word removal for some columns in the data.

• Determined relationship between categorical data using Chi Square method and built Random Forest Regressor with different number of estimators. Calculated model score (R-Squared) for each estimator by fitting X and Y.


Real-Time Stock Market Data Fetching and Analysis

Fetched NOPE value associated with any US stock from Nopechart in real-time using CURL, bypassing the need for API.

Fetched Indian stock market options-chain data in real-time and calculated PUT and CALL value changes.

• Formulated Open Interest per minute per strike price for any given stock, NIFTY and Bank NIFTY.

Scraped brokers trading data from DayTradeScans to gain insights and facilitate trade calls in Vietnam stock market

BsScan Transaction Alert on Mail



         


Github Contribution Streak 🔥




Github Stats  📊




Most Used Languages 📚


Note : May/ May not indicate my skill level, it is just a GitHub metric of languages I have in my commits.



Github Contributions 📈




Learning while writing ✍️


Tic Tac Toe in Python

Handwritten Digit Recognition

Simple Calculator

Dice Rolling Simulator

Text to Speech

Image Cartoonifier

Language Translator

File Manager



Connect with me and support me by starring ⭐ some of my repositories


Popular repositories Loading

  1. Working-with-Python-Revision-notebooks Working-with-Python-Revision-notebooks Public

    Revision of Python from basic to intermediate

    Jupyter Notebook 6 6

  2. Computer-Graphics Computer-Graphics Public

    This repository contains all programs made to run in Turbo CPP. These programs cover Computer Graphics from basic to 1D, 2D, and 3d, with animation and details about file formats.

    3 2

  3. Operating_System Operating_System Public

    All the basic programs depicting scheduling techniques coded in java and c++, all with their respective outputs.

    C++ 1

  4. Speakulator Speakulator Public

    This is a small web app for voice based calculator in Python build using flask web framework.

    Python 1

  5. Grammerly-Tool Grammerly-Tool Public

    Grammer checker

    Python 1

  6. breast_cancer_KNN breast_cancer_KNN Public

    Forked from Vamshi399/breast_cancer_KNN

    Breast Cancer prediction and detection using KNN and SVM

    Jupyter Notebook 1