Position Title: Data EngineerLocation: London / Hybrid
Job Summary:My client, an AI SaaS Platform are seeking a skilled and motivated Data Engineer to join the team. The ideal candidate will design, develop, and maintain robust data pipelines and systems that support our organization's data-driven decision-making. You will leverage your expertise in Azure, Python, SQL, Databricks, and data modeling to ensure efficient, scalable, and high-quality data delivery across various platforms.
Key Responsibilities:
- Design, develop, and optimize ETL/ELT pipelines for data integration, transformation, and migration using Azure Data Factory, Databricks, and other Azure services.
- Develop and implement data models (conceptual, logical, and physical) to support business intelligence, analytics, and operational needs.
- Collaborate with data analysts, data scientists, and business stakeholders to understand data requirements and deliver solutions that meet organizational objectives.
- Create and maintain robust SQL scripts for querying, data manipulation, and performance tuning.
- Implement and maintain data solutions using Azure Data Lake, Azure Synapse Analytics, and Azure Storage.
- Ensure the security, privacy, and governance of data through proper implementation of data policies and standards.
- Monitor, troubleshoot, and optimize existing data pipelines to ensure high performance and reliability.
- Develop reusable, efficient, and well-documented Python scripts for data manipulation, automation, and analysis.
- Work with large datasets and ensure scalability and efficiency in processing using Databricks and distributed computing frameworks.
- Provide technical support and mentorship to team members regarding best practices in data engineering.
Qualifications:
Required Skills and Experience:
- Bachelor's degree in Computer Science, Information Systems, or a related field (or equivalent experience).
- Proven experience as a Data Engineer or in a similar role.
- Expertise in Microsoft Azure services such as Azure Data Factory, Azure Databricks, Azure Synapse Analytics, and Azure Data Lake Storage.
- Proficiency in Python for data processing and automation tasks.
- Advanced knowledge of SQL, including query optimization and database management.
- Strong understanding of data modeling principles and experience in creating and implementing data models.
- Experience working with big data technologies and distributed processing.
- Knowledge of version control systems (e.g., Git) and CI/CD pipelines for data solutions.
Preferred Skills:
- Experience with Power BI or other visualization tools for reporting purposes.
- Familiarity with Spark for distributed data processing.
- Knowledge of cloud cost management and optimization strategies.
- Certifications in Azure (e.g., Azure Data Engineer Associate) are a plus.