I will build a production ready etl data pipeline using AWS, airflow, and pyspark

Pakistan

I speak English

Data Engineer, AWS, Apache Airflow, Spark, PostgreSQL, ETL

I am a Data Engineer and final-year Computer Science student with hands-on professional experience building scalable ETL pipelines and data architectures. I have worked at Cognetix.io on enterprise-gr...
About this Gig

Are you drowning in raw data with no reliable way to process it?

I build production-grade data pipelines that run automatically, scale with your data, and never break silently. No spaghetti scripts. No manual steps. Just clean, reliable data exactly where you need it.


What I Build

  • ETL pipelines using Python and PySpark extract, transform, load, done
  • Apache Airflow DAGs for fully automated, scheduled workflows
  • Medallion Architecture pipelines (Bronze Silver Gold) with data quality at every layer
  • AWS data platforms S3 data lake, Glue, EMR on EKS, IAM, Terraform
  • Cloud ingestion pipelines from any source into PostgreSQL, MySQL, ClickHouse, or Supabase
  • Fully containerised setups with Docker and Docker Compose
  • One-command deployments with CI/CD no manual SSH, no runbooks

Expertise:

Big data

Data extraction

Data flow

Data manipulation

Technology:

Amazon Redshift

Apache Kafka

Apache Spark

Python

SQL

My Portfolio