As a passionate Junior Data Engineer, I specialize in leveraging data to drive impactful insights and solutions. I thrive on solving complex data challenges and enjoy working with diverse technologies to optimize data pipelines and workflows. With a strong foundation in data engineering tools and methodologies, I am particularly interested in exploring advancements in data science, cloud computing, and big data analytics.
My goal is to contribute to innovative projects that harness the power of data to create meaningful outcomes and drive progress in various industries.
-
π« How to reach me: swisetpakdeewong@gmail.com
-
π Know about my experiences: https://www.linkedin.com/in/siriwat-wisetpakdeewong-78b641256/
-
βοΈ View my resume: Click Here
- βΉοΈ Description: Big Data Migration from GCP to Azure.
- π Achievements: Successfully migrated over 19 million rows of data from Google Cloud Storage to Azure Data Lake, ensuring data integrity with zero data loss.
- π― Technologies used:
- Processing Tools: PySpark
- GCP Services: Google Cloud Storage (GCS), BigQuery
- Azure Services: Azure Data Factory, Azure Data Lake Storage Gen 2, Databricks, Key Vault
- Others: Docker
- βΉοΈ Description: Building a Data Lakehouse using the Medallion architecture.
- π Achievements: Developed a scalable Data Lakehouse architecture using the Medallion framework, facilitating efficient data storage, processing, and analysis with seamless integration across Azure services.
- π― Technologies used:
- Processing Tools: DBT (Data Build Tool)
- Azure Services: Azure SQL Database, Azure Data Lake Storage Gen 2, Databricks, Azure Key Vault
- βΉοΈ Description: Real-time data ingestion to Cassandra using Airflow, Kafka, and Spark.
- π Achievements: Engineered a robust real-time data streaming pipeline, enabling low-latency data ingestion into Cassandra and ensuring consistent data flow and processing across multiple platforms.
- π― Technologies used:
- Processing Tools: PySpark
- Orchestration Tools: Airflow, Kafka, Zookeeper
- Monitoring: Confluent
- Storage: Cassandra
- Others: Docker