Skip to content
View hiwafeizi's full-sized avatar

Block or report hiwafeizi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hiwafeizi/README.md

Hi, I’m Hiwa 👋 I’m a Data Engineer with a background in Cognitive Science and Artificial Intelligence (Tilburg University). I focus on designing, migrating, and operating scalable data systems, with strong attention to correctness, performance, and long-term maintainability.

My work sits primarily in data engineering, with applied AI and automation used where it adds real leverage. I enjoy building robust transformation layers, validation frameworks, and metadata-driven systems that make complex data pipelines reliable and understandable. I’m particularly interested in structure, abstractions, and system design rather than one-off solutions.

💻 What I Work With Languages: Python, SQL, JavaScript, C++ Data Engineering: dbt, Snowflake, Oracle SQL, PostgreSQL, pandas, Parquet Automation & AI: scikit-learn, PyTorch, HuggingFace, OpenAI API Web & APIs: Flask, SQLAlchemy, Jinja2, Tailwind CSS Cloud & DevOps: GCP (Cloud Run, Storage, Build, Functions), Docker, GitHub Actions

🚀 Selected Projects

Allianz – Data Engineering (Professional Experience) Worked on large-scale migrations from legacy pipelines to dbt on Snowflake, building reusable macros, validation frameworks, and metadata-driven automation. Validated full row- and column-level parity for daily data loads exceeding 50 million rows and built AI-assisted, multi-language documentation systems to improve data lineage and trust.

Thesis – Perception Modeling Designed and evaluated models predicting human impressions of Dutch company names using linguistic and semantic features. Trained 80+ models and published full results and feature importance analyses.

AV-HuBERT Pipeline – Research Workshop Built preprocessing and evaluation pipelines around a pretrained AV-HuBERT model, focusing on data preparation, alignment, and reproducible experimentation for multimodal inputs.

NeoCru – Startup Platform (private) Built a data-driven recruitment platform using Flask and GCP, handling structured job data, application pipelines, and secure recruiter dashboards with AI-assisted automation.

LinkInLead – Personal Tool (private) Created a data-centric automation tool for LinkedIn content generation and scheduling, integrating APIs and cloud workflows to support scalable publishing.

Pinned Loading

  1. half_stupid half_stupid Public

    An experimental project exploring how theories from cognitive science and neuroscience can be implemented in artificial agents inside Minecraft. The goal is to build the closest functional approxim…

    Python

  2. thesis thesis Public

    Jupyter Notebook

  3. SE4CSAI-Project SE4CSAI-Project Public

    Python 2

  4. AV-HuBERT-Pipeline AV-HuBERT-Pipeline Public

    Python 1 1

  5. AI4NE AI4NE Public

    Python

  6. Battle-C Battle-C Public

    C++