
Senior Data Engineer - (Genetics) Maternity Cover - 12 months FTC
- London
- £74,000-78,000 per year
- Permanent
- Full-time
- Support the build of production-level data pipelines from data providers to our primary data store and Trusted Research Environment. Work closely with the Lead Data Engineer on key designs and features.
- Build and maintain pipelines which meets the requirements for our end users and builds well curated, accessible and quality controlled data for analysis.
- Keep abreast of best practice in data engineering across industry, research and Government and facilitating the adoption of standards. Work to promote the adoption of best practises across the squad (unit testing, CI/CD).
- Work with our Science team and Product to understand the data requirements and work with them to deliver the data needed for their projects.
- Experience working in an agile development team.
- Comfortable building and maintaining robust, scalable and efficient data pipelines that run in the cloud. Capable of processing very large amounts of data being received daily based on feeds from multiple systems using a range of different technologies.
- Can listen to the needs of technical and business stakeholders and interpret them, and effectively manage stakeholder expectations. Can write ODPs/RFCs equivalent and drive discussions within the squad and help the Lead Data Engineer supervise/drive specific initiatives of work.
- Strong experience working with genetic data (ideally genotype and imputation data). Detailed understanding of common bioinformatics file formats (VCF, BAM/CRAM, GTC, FastQ etc) and accompanying tools (bcftools, PLINK, QCtools etc)
- Experience in validating and QC'ing complex genomic datasets.
- Highly proficient in Python with solid command line knowledge and Unix skills.
- Highly proficient working with cloud environments (ideally Azure), distributed computing and optimising workflows and pipelines.
- Experience working with common data transformation and storage formats, e.g. Apache Parquet, Delta tables.
- Strong experience working with containerisation (e.g. Docker) and deployment (e.g. Kubernetes).
- Experience with Spark, Databricks, data lakes.
- Highly proficient in working with version control and Git/GitHub.
- Awareness of data standards such as GA4GH (
- Competitive salary starting from £74,000
- Generous Pension Scheme – We invest in your future with employer contributions of up to 12%.
- 30 Days Holiday + Bank Holidays – Enjoy a generous holiday allowance with the flexibility to take bank holidays when it suits you.
- Enhanced Parental Leave – Supporting you during life’s biggest moments.
- Career Growth & Development – £500 per year to spend on Learnerbly, our learning platform, plus regular appraisals and development opportunities.
- Cycle to Work Scheme – Save 25-39% on a new bike and accessories through salary sacrifice.
- Home & Tech Savings – Get up to 8% off on IKEA and Currys products, spreading the cost over 12 months through salary sacrifice
- £1,000 Employee Referral Bonus – Know someone amazing? Get rewarded for bringing them on board!
- Wellbeing Support – Access to Mental Health First Aiders, plus 24/7 online GP services and an Employee Assistance Programme for you and your family.
- A Great Place to Work – We have a lovely Central London office in Holborn, and offer flexible and remote working arrangements.
We are sorry but this recruiter does not accept applications from abroad.