
Principal Data Engineer
- London
- Permanent
- Full-time
- Design and lead the implementation of real-time data pipelines which transport billions of data points per day, with strong traffic variations around peak hours
- Design and deploy state-of-the-art ML (predominantly NLP and voice recognition) models that are optimised for both accuracy and throughput
- Transform prototypes into production-ready data and ML applications that meet throughput and latency requirements
- Deploy and manage data and ML infrastructure necessary for productionising code (Kafka, Docker, Terraform, etc)
- Build efficient data pipelines between on-premise and cloud environments to handle text and audio data processing loads for ML models
- Deploy NLP models in cloud environments (AWS SageMaker) through Jenkins
- Design and implement MLflow and other ML Ops applications to streamline ML workflows which adhere to strict data privacy and residency guidelines
- Communicate your work throughout the team and related departments
- Mentor and guide junior members of the team, establish and champion best practices and introduce fresh ideas and concepts
- 10+ years of experience working on data processing and engineering for ML models, with 6+ years developing large-scale data and ML systems twhich receive billions of requests per day
- Expert level know-how of designing and implemention synchronous, asynchronous and batch data processing operations
- Expert level programming skills in Python, along with experience in using relevant tools and frameworks such as PyTorch, FastAPI and Huggingface; strong programming skills in Java are a plus
- Expert level know-how of ML Ops systems, data pipeline design and implementation, and working with ML platforms (preferably AWS SageMaker)
- Strong analytical and problem-solving abilities, with a keen eye for detail and accuracy
- Curiosity and a strong growth mindset with a demonstrable history of learning quickly in a loosely structured, rapidly changing environment
- Excellent collaboration and communication skills
- At least a bachelor's degree in computer science or other relevant fields
- Fosters a culture of collaboration, communication, performance and learning
- Drives innovation and creativity within and between teams
- Introduces employees to priorities outside of their immediate realm
- Ensures important interpersonal relationships and connections with one another and our community!