I will build custom data engineering projects, etl, elt, data pipelines
About this Gig
Hello there!
I specialize in building end-to-end ETL/ELT pipelines across Microsoft Azure, Microsoft Fabric, and AWS turning raw datasets into a clean, high-performance single source of truth.
Along being a Microsoft-certified data professional, I also work with Python, Spark (PySpark, SparkSQL), SQL, ClickHouse, NoSQL, dbt, Airbyte, and Dagster to deliver scalable data solutions ready for Power BI or Tableau dashboards and reports.
What I offer:
- ETL/ELT pipelines with medallion architecture (Bronze / Silver / Gold)
- Microsoft Fabric: Lakehouses, Delta Lake, Dataflows & Notebooks
- Azure & AWS cloud data integrations
- Data modeling: Star Schema & relational design for BI
- Workflow orchestration with Dagster and dbt
- API integrations and incremental loading patterns
Why choose me?
- Microsoft data certified professional: DP-700, DP-600, PL-300
- Hands-on across Azure, Fabric, and AWS
- Clean, documented code optimized for speed and scalability
- I turn complex architecture into insights your business can act on
Have a project in mind and need some extra help?
Feel free to reach out I am happy to talk through your requirements!
My Portfolio
FAQ
What kinds of data pipelines can you build?
I build batch and incremental ETL/ELT pipelines that pull data from APIs, databases, flat files, and cloud storage, transforming and loading it into warehouses or lakehouses like Microsoft Fabric, Azure Data Lake, or SQL-based systems. I use various tools based on the scenario.
Do you work with Microsoft Fabric?
Yes. I am certified in both Fabric Data Engineering (DP-600) and Fabric Analytics Engineering (DP-700). I can set up lakehouses, build medallion architecture (Bronze/Silver/Gold), configure Dataflows, and connect everything to Power BI.
Do you connect pipelines to Power BI or Tableau?
Yes. I can model your data into a Star Schema and connect it directly to Power BI or Tableau so your dashboards refresh automatically.
How long does a typical project take?
It depends on complexity. A straightforward pipeline with one or two sources can take 3–5 days. A full medallion architecture with multiple sources, transformations, and a BI layer typically runs for more days or weeks. I will give you a clear timeline after reviewing your requirements.
What do you need from me to get started?
A description of your data sources (APIs, databases, files), where you want the data to land, and what the end output should look like. We can figure out the rest together.
