Software Engineer, Applied ML
Brave
- London
- £100,000-125,000 per year
- Permanent
- Full-time
- Evaluate, integrate, and deploy state-of-the-art language models for Leo and other browser AI capabilities, including both cloud-based and on-device deployment scenarios
- Design, optimize, and maintain ML inference pipelines for browser-integrated AI features, with focus on reducing deployment costs and improving model performance
- Develop and train custom ML models for browser-specific use cases such as content classification and search optimization using techniques like LoRA and DPO, including distributed training setups
- Generate synthetic data for training data augmentation and model evaluation
- Collaborate with browser engineering teams to seamlessly integrate AI capabilities into core product features while maintaining performance and privacy standards
- Collaborate with product and design teams to define, prototype, and ship new AI-powered features including text-to-speech, image generation, and enhanced tool calling capabilities
- Implement and optimize model serving infrastructure using frameworks like vLLM, ONNX Runtime, and Nvidia Triton to achieve production-scale performance requirements
- Collaborate with DevOps teams on MLOps infrastructure including model monitoring, load testing, caching optimization, and automated CI/CD pipelines for model deployments
- Contribute to privacy-preserving ML approaches and on-device model implementations that align with Brave's privacy-first mission
- 2 to 5 years of experience optimizing and deploying ML models in production environments
- Strong software engineering background with production experience
- Extensive experience with PyTorch or other modern ML frameworks
- Experience training custom models from scratch
- Experience with model optimization and inference frameworks (e.g., vLLM, ONNX Runtime, Nvidia Triton)
- Familiarity with MLOps practices & Kubernetes and ability to collaborate with DevOps teams on model monitoring, load testing, and CI/CD pipelines
- Experience shipping ML-powered features or systems (consumer applications preferred)
- Master's degree in Computer Science, Machine Learning, or related field
- Familiarity with LLM serving frameworks (vLLM, TGI, Ray Serve) and GPU optimization
- Experience with embeddings, vector databases, semantic search implementations, model training workflows, and data pipeline development
- Experience integrating LLMs with tool calling/MCP
- Knowledge of privacy-preserving ML techniques and on-device model deployment
- Previous work on cost optimization and performance tuning of ML systems at scale
- Deep curiosity about emerging AI models and their practical applications
- Strong problem-solving skills with ability to work in ambiguous environments
- Excellence in cross-functional collaboration and technical communication
- Drive to make AI technology more accessible through the browser
- Pragmatic approach to balancing innovation with shipping products
- Opportunity to shape the future of AI-powered browsing experiences
- Work with cutting-edge technology and state-of-the-art ML tools
- Competitive compensation with room for growth
- Great international exposure and team atmosphere
- Flexible work location with preference for London office