Skip to content
View davidzajac1's full-sized avatar
β›΅
Code, Ship, Repeat
β›΅
Code, Ship, Repeat

Sponsoring

@pallets
@numfocus

Block or report davidzajac1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
davidzajac1/README.md

Header



  • πŸ’ͺ Worked at multiple start-ups ranging from bootstrapped to Series B
  • βœ… AWS Certified Solutions Architect - Associate
  • ⭐ 100% Job Success Score and only 5-star client reviews as contractor on Upwork.com
  • πŸ‘· Domain knowledge in Upstream Oil and Gas Operations


RE Data - Open Source Maintainer & Top Contributor

Stars Forks PyPI - Downloads PyPI - Version MIT License Language Language

An open source data reliability framework for the modern data stack. RE Data is a DBT package, Python library and React UI. Adding the RE Data DBT package to a DBT project will run out of the box data observability SQL queries in the background when dbt run is called. These queries calculate and store metrics like standard deviation, mean, row count, etc. The RE Data Python library can be called from the CLI to read in the stored metrics and create and serve the RE Data UI. RE Data is hosted across two GitHub repos. I am a top contributor to both and manage reviewing/merging PRs and creating releases.


packages:
  - package: re-data/re_data
    version: 0.11.0


Zillacode - Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake

Stars Forks Apache License Language Language Language Language

Created Zillacode the only platform on the internet where you can practice LeetCode like questions using PySpark, Spark, DBT and Snowflake. Zillacode.com was once a live micro-serviced B2C SAAS platform with SSO and Automated Billing but is now Open Source on GitHub.

The entire platform spins up locally with one command using Docker Compose. When deployed to the Cloud Zillacode utilizes AWS Lambda Functions built from custom Docker images that package Spark and PySpark in a way that allows answers to be returned quickly.

$ git clone https://github.com/davidzajac1/zillacode.git
$ cd zillacode
$ docker-compose up


IAMScan - CLI tool checks code for AWS IAM Privileges

Language License PyPI - Version Code style: black

IAMScan is an open source command line tool that reads your code and generates an AWS IAM policy with your needed permissions. Keeping track of AWS IAM permissions is annoying and time consuming. How often have you seen an update deployed to the cloud followed by The provided execution role does not have permissions to call CreateSomething on SomeService? IAMScan solves this issue by generating a perfectly least privileged AWS IAM Policy for all Python Files, JavaScript Files and Shell Scripts from a single command line command.

IAMScan is hosted on PyPI and is installed using pip


$ pip install iamscan


ZOil - Generate random Oil and Gas Data

Language License PyPI - Version

ZOil is a python library used to generate random Oil and Gas data. Most Oil and Gas data is either proprietary or costly to acquire. ZOil lets you quickly generate an unlimited amount of production data that can be used to for testing, anonymization and much more. ZOil was inspired by the Faker library.

ZOil is hosted on PyPI and is installed using pip


$ pip install zoil

Pinned Loading

  1. re-data/re-data re-data/re-data Public

    re_data - fix data issues before your users & CEO would discover them 😊

    HTML 1.6k 121

  2. re-data/dbt-re-data re-data/dbt-re-data Public

    re_data - fix data issues before your users & CEO would discover them 😊

    Python 99 41

  3. zillacode zillacode Public

    Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake

    Python 98 22

  4. Reptoro Reptoro Public

    A Data Visualization and Analytics Platform for the Reptile Industry

    CSS 4

  5. iamscan iamscan Public

    Checks code for needed AWS IAM Privileges

    Python 2

  6. zoil zoil Public

    Generates random Oil and Gas Data

    Python 3