We are seeking a highly skilled Data Architect with a strong background in Azure Databricks , data modeling , performance optimization , and data security . The ideal candidate will be responsible for designing and implementing sophisticated data architectures and models while optimizing data flows and ensuring data security. This role also requires leadership skills, the ability to manage and mentor junior team members, and effective communication with cross-functional teams.
Key Responsibilities Data Architecture and Modeling:
Design and implement Medallion Architecture for data pipelines and solutions using Azure Databricks .
Develop and optimize data models including Star Schema , Snowflake Schema , and Data Vault , while understanding their advantages, disadvantages, and partitioning techniques.
Lead the design and implementation of data architecture solutions ensuring data security , scalability, and high performance.
Provide data security strategies for various Azure-based services ensuring row-level security and authorization management.
Performance Optimization:
Focus on performance optimization techniques for data processing and analytics in Azure Databricks , ensuring efficiency and scalability of solutions.
Perform query optimization using SQL , advanced SQL queries , and techniques for enhancing the overall performance of data models.
Development and Automation:
Work with Python and PySpark to create scalable, efficient, and performant data processing pipelines.
Utilize Python libraries such as Pandas , NumPy , Matplotlib , and SciPy for data manipulation, analysis, and visualization.
Collaborate on CI/CD implementation, managing branching strategies , and using Github for version control.
Leadership and Collaboration:
Lead a team of junior data engineers, providing mentorship and guidance to ensure project success.
Break down larger tasks into manageable work packages and estimate effort and time requirements accurately.
Collaborate with business teams to understand the requirements and translate them into actionable technical solutions, particularly in the areas of product supply and production .
Additional Responsibilities:
Leverage Azure Data Factory and other Azure services like Key Vault , Power Apps , and Container Apps as part of the data architecture solutions.
Ensure data governance and manage the implementation of row-level security in Azure environments.
Keep up-to-date with best practices, emerging trends, and technologies in the field of data architecture and cloud-based solutions.
Required Skills
Azure Databricks and experience in implementing solutions using it.
Data modeling techniques including Star Schema , Snowflake Schema , and Data Vault .
Strong expertise in SQL and advanced SQL query optimization .
Experience with Python , PySpark , and Python libraries like Pandas , NumPy , Matplotlib , and SciPy .
Knowledge of Azure Data Factory , Azure Key Vault , and other Azure services (such as Container Apps and Power Apps ).
Experience with CI/CD , branching strategies , and Github .
Strong understanding of data security and row-level security in Azure.
Excellent communication skills with the ability to lead teams and break down complex tasks.
Ability to estimate efforts and time required for data architecture projects accurately.
Business/Data knowledge in the product supply/production domain is highly preferred.
Nice-to-Have Skills
Experience with Snowflake (the platform, not just the data model).
Familiarity with Azure-based containerized solutions .
Job Classification
Industry: IT Services & Consulting Functional Area / Department: Engineering - Software & QA, Role Category: DBA / Data warehousing Role: Data warehouse Architect / Consultant Employement Type: Full time