82686
Permanent/Direct Hire
3 days ago
Job Title: Principal Data Engineer
Location: Toronto, ON (Onsite)
Estimated Duration: Fulltime
Principal Data Engineer
Lead the design, development, and implementation of scalable data pipelines on Azure Databricks and ADLS. This role requires expertise in Azure, PySpark, Azure SQL, CDC, and ADF to enable efficient and near real-time data movement from SQL Data Warehouses to an Enterprise Data Lake. The ideal candidate has experience architecting incremental data ingestion solutions, optimizing Databricks pipelines, and enforcing data governance best practices.
Responsibilities
- Architect and implement an end-to-end data platform on Azure Data Lake and Databricks
- Design incremental ingestion pipelines using ADF, CDC, and Databricks Auto Loader
- Build and optimize PySpark workflows for batch and streaming data processing
- Manage Delta Lake architecture (Bronze, Silver, Gold layers)
- Lead performance tuning and cost optimization of Databricks and Azure SQL workloads
- Enforce security, access controls, and data governance practices
- Collaborate across business, data science, and BI teams
- Mentor engineers and set best practices across the data team
Requirements
- 10+ years in data engineering with a strong Azure cloud focus
- Deep expertise in Databricks, PySpark, and Delta Lake
- Proven experience with ADF, Azure SQL, and CDC methods
- Skilled in both batch and streaming architectures
- Strong SQL capabilities and performance tuning
- Knowledge of cloud security, governance, and compliance
- Experience leading teams in Agile environments
Preferred Qualifications
- Terraform experience (nice to have)
- Certifications in Azure Data Engineering, Databricks, or Terraform
- Background in high-scale enterprise data environments
- Familiarity with Power BI integration with Databricks/ADLS