I will design big data models and etl pipelines using pyspark and databricks

India

I speak Hindi, Gujarati, English

34 orders completed

Data Engineering Expert and Cloud Solutions Architect

Experienced Azure data engineer with 13+ years building scalable data solutions using Microsoft Fabric, Azure Data Factory (ADF), Azure Data Lake, and Synapse Analytics. I also work across Snowflake, ...
About this Gig

Process petabytes of data at lightning speed with optimized PySpark models and Databricks pipelines that scale infinitely.


Overwhelmed by massive datasets that crash traditional systems? Need real-time processing that handles billions of records effortlessly? You've found your big data architect.


What You'll Get:

  • Scalable PySpark data models and transformations
  • Optimized Databricks cluster configurations
  • Delta Lake architecture for ACID transactions
  • Real-time and batch processing pipelines
  • Performance-tuned Spark SQL queries
  • Cost optimization strategies and monitoring setup


My Big Data Expertise:

With 13+ years architecting Spark solutions, I've built pipelines processing 500+ TB daily for tech giants, achieving 10x performance improvements through advanced optimization techniques and cluster tuning.


Technologies I Master:

  • Platforms: Databricks, Apache Spark, Delta Lake, MLflow
  • Languages: PySpark, Scala, Spark SQL, Python
  • Optimization: Catalyst optimizer, partitioning, caching strategies

Langugae:

English

Technical expertise:

Apache Spark

Databricks

Snowflake

Expertise:

Data Pipelines

ETL Development

Data Warehousing

Industry:

Data analytics

Financial services

Other Data Engineering Services I Offer