Data Engineer
- Bath
- Permanent
- Full-time
- Design, build, and optimize data pipelines with Python.
- Streamline and scale data workflows using Metaflow, Prefect, AWS and proprietary data services
- Collaborate with data scientists, analysts, and developers to ensure seamless data flow and implementation of bespoke company methodologies
- Improve data quality, reliability, and accessibility across teams.
- Deliver high-quality, maintainable well-tested code that meets business requirements
- Enable a consistent approach to our data production pipeline
- A proven track record of developing in Python
- An ability to meet pipeline requirements through an applied understanding of good data acquisition, transformation and manipulation techniques
- Established experience of building ETL frameworks and tooling
- Knowledge of common Python data analysis libraries (numpy, pandas)
- Familiarity with Agile software development practices
- Good understanding of git and working collaboratively on team-level code base
- Familiarity working with geospatial data within Python (GDAL, rasterio, shapely)
- Knowledge of cloud technologies and platforms such as AWS
- Experience using docker or other container orchestration technologies
- Experience of the Linux command line and basic Linux server administration skills