Are you looking for an experienced Data Engineer to build your data pipeline in the cloud of your choice? You're in the right hands!
I'm a data engineer having 10 years of development experience in the industry. I have a proven track record of building and maintaining scalable and efficient data
pipelines using DataBricks, PySpark, , Dbt and AWS Glue.
Following are some of my expertise
- Create Data architecture and strategy for your organization
- Batch/Streaming ETL Jobs in the cloud ( AWS, Data bricks)
- Data processing using PySpark
- Building Data Pipeline using Lake House Architecture
- Building ELT data pipeline using DBT ( Data Build Tool)
- Data Quality through Pydeeque, Great Expectation
- Data Modelling ( Dimension Modelling)
- Data warehousing ( Snowflake, Big Query, RedShift)
- Orchestration using Apache Airflow, AWS Step Function.
- Data Lake ( AWS S3 , Google Cloud Storage)
- Apache Kafka
- Spark Streaming
- AWS Athena, Aws Kinesis, AWS Glue Catalog
- AWS DMS
- GCP Data Proc