SonicJobs Logo
Left arrow iconBack to search

Data Engineer

The Bridge IT Recruitment
Posted 17 hours ago, valid for 10 days
Location

Teversham, Cambridgeshire CB1, England

Salary

£40,000 - £48,000 per annum

info
Contract type

Full Time

By applying, a CV-Library account will be created for you. CV-Library's Terms & Conditions and Privacy Policy will apply.

Sonic Summary

info
  • We are looking for a Senior Bioinformatics Data Engineer with a strong background in Bioinformatics and a minimum of 5 years of relevant experience.
  • The successful candidate will support the cBioPortal operations by developing and maintaining ETL pipelines for bioinformatics data.
  • Key responsibilities include ensuring the reliability of data systems, troubleshooting data integration issues, and collaborating with various stakeholders.
  • Candidates should have extensive experience with Python and bioinformatics visualization systems, particularly cBioPortal.
  • The salary for this position is competitive and commensurate with experience.

Senior Bioinformatics Data Engineer – cBioPortal Specialist

We are seeking a highly skilled and experienced Senior Bioinformatics Data Engineer to join our clients dynamic team and support the operation and engineering needs of cBioPortal. In this role, you will significantly impact the delivery of bioinformatics data engineering and visualizations to the Oncology R&D organisation. Your work will be central to advancing our data stack, as well as our automation and observability capabilities.

*** NB we require candidates who have demonstrable experience in Bioinformatics / have a Pharmaceuticals industry background ***


Main Duties and Responsibilities
• Develop, execute, and maintain ETL pipelines for extracting, transforming, and loading data for use in cBio and other bioinformatics analysis and visualizations
• Ensure the reliability, scalability, and performance of ETL pipelines and data systems
• Troubleshoot and resolve issues related to data loading and integration into downstream systems
• Collaborate with bioinformaticians, data scientists and other stakeholders to understand and meet the data needs and requirements of the organization
• Stay up-to-date with new technologies and best practices in bioinformatics data engineering
Essential Requirements
• A background in Computer Science, Engineering, or Bioinformatics (Master level) with 5 years of relevant experience
• Familiar with bioinformatics visualizations in different omics domains including genomics, transcriptomics, proteomics, DNA methylation, etc
• Extensive experience with Python and Python data/scientific libraries like pandas, numpy/scipy, polars, etc
• Proven experience with bioinformatics visualization systems like cBioPortal, including data loading and troubleshooting
• Strong understanding of ETL processes and data pipeline development
• Ability to interact with various data sources, both structured and unstructured (e.g. HDFS, SQL, noSQL)
• Experience working across multiple scientific compute environments to create data workflows and pipelines (e.g. HPC, cloud, Unix/Linux systems)
Desirable:
• Experience with deploying data pipelines using orchestration services like Airflow, Prefect, AWS Glue, Dagster, etc
• Experience using AWS services such as S3/EBS, EC2, CloudWatch, SNS, and Lambda.
• Understanding of software development, testing and quality processes with experience with testing frameworks and documentation
• Expertise with biological/health data, especially genomics and other *omics technologies.
• Ability to understand, map, integrate, and document complex data relationship and business rules.

Apply now in a few quick clicks

By applying, a CV-Library account will be created for you. CV-Library's Terms & Conditions and Privacy Policy will apply.