Hi, I’m Hiwa 👋 I’m a Data Engineer with a background in Cognitive Science and Artificial Intelligence (Tilburg University). I focus on designing, migrating, and operating scalable data systems, with strong attention to correctness, performance, and long-term maintainability.
My work sits primarily in data engineering, with applied AI and automation used where it adds real leverage. I enjoy building robust transformation layers, validation frameworks, and metadata-driven systems that make complex data pipelines reliable and understandable. I’m particularly interested in structure, abstractions, and system design rather than one-off solutions.
💻 What I Work With Languages: Python, SQL, JavaScript, C++ Data Engineering: dbt, Snowflake, Oracle SQL, PostgreSQL, pandas, Parquet Automation & AI: scikit-learn, PyTorch, HuggingFace, OpenAI API Web & APIs: Flask, SQLAlchemy, Jinja2, Tailwind CSS Cloud & DevOps: GCP (Cloud Run, Storage, Build, Functions), Docker, GitHub Actions
🚀 Selected Projects
Allianz – Data Engineering (Professional Experience) Worked on large-scale migrations from legacy pipelines to dbt on Snowflake, building reusable macros, validation frameworks, and metadata-driven automation. Validated full row- and column-level parity for daily data loads exceeding 50 million rows and built AI-assisted, multi-language documentation systems to improve data lineage and trust.
Thesis – Perception Modeling Designed and evaluated models predicting human impressions of Dutch company names using linguistic and semantic features. Trained 80+ models and published full results and feature importance analyses.
AV-HuBERT Pipeline – Research Workshop Built preprocessing and evaluation pipelines around a pretrained AV-HuBERT model, focusing on data preparation, alignment, and reproducible experimentation for multimodal inputs.
NeoCru – Startup Platform (private) Built a data-driven recruitment platform using Flask and GCP, handling structured job data, application pipelines, and secure recruiter dashboards with AI-assisted automation.
LinkInLead – Personal Tool (private) Created a data-centric automation tool for LinkedIn content generation and scheduling, integrating APIs and cloud workflows to support scalable publishing.

