· Build scalable, production-grade ETL/ELT pipelines using Databricks (PySpark, Spark SQL, Delta Live Tables, Workflows).
· Ingest structured, semi-structured, and streaming data into Bronze, Silver, and Gold layers.
· Develop optimized transformations, data quality rules, and reusable framework components.
· Strong usage / proven experience of leveraging inbuilt AI Genie code for faster delivery.
· Implement best practices for job orchestration, monitoring, alerting, and automation.
· Hands-on experience: Spark, Delta Lake, Workflows, Unity Catalog.
· Strong SQL programming and performance tuning skills.
· Experience with cloud environments (AWS/Azure/GCP).
· Experience with modern data lakehouse concepts and distributed systems.
Data Modeling:
· Design and implement dimensional models (star/snowflake) for analytical workloads.
· Apply normalization/denormalization strategies for performance and usability.
· Create physical data models aligned to Delta Lake and Medallion architecture.
· Ensure data quality, integrity, and alignment with enterprise governance policies.
Technology Stack:
· Advanced hands-on Databricks experience: Spark, Delta Lake, Workflows, Unity Catalog.
· Strong SQL programming and performance tuning skills.
· Experience with cloud environments (AWS/Azure/GCP).
· Experience with modern data lakehouse concepts and distributed systems.
Required Qualifications:
· 7–10+ years of experience in data engineering.
· Proven experience building and deploying large-scale Databricks pipelines.
· Strong understanding of Medallion architecture (Bronze/Silver/Gold).
· Proficiency with PySpark, SQL, ETL/ELT frameworks, and Delta Lake optimizations.
· Strong experience working with CI/CD, Git, and job orchestration tools.
· Deep understanding of data modeling principles for warehousing and analytics.
Preferred Skills:
· Knowledge of data governance, metadata management, and Unity Catalog.
· Experience with streaming technologies (Auto-Loader, Structured Streaming).
· Background in data security, compliance, and access models.
· Knowledge of performance tuning and cost optimization in cloud environments.
· Experience with tools like Airflow, Databricks Workflows, dbt, or similar.
Role Descriptions:
