Ca...
Senior Machine Learning Engineer - Agents data
By CanvaDescription
Canva seeks a Machine Learning Engineer to build data pipelines and tooling for multimodal agent research. Collaborate with researchers to turn ideas into trainable reality.
Experience Level
Senior
Responsibilities
- Design and build data pipelines for agent training
- Develop tooling for dataset construction
- Own data quality and build validation frameworks
- Create evaluation datasets and benchmarks
- Build and maintain infrastructure for data loading and storage
- Collaborate with research scientists to translate requirements
- Document datasets thoroughly
- Profile and optimize research code for efficiency
- Elevate codebase quality through reviews and best practices
- Contribute to team roadmaps by identifying data bottlenecks
Requirements
- Strong software engineering skills in Python
- Experience building production-grade data pipelines and ML DevOps
- Practical experience with prompt engineering
- Experience with ML data workflows and large-scale data processing
- Hands-on experience with data pipelines for distributed ML training
- Familiarity with annotation tooling and human-in-the-loop data collection
- Understanding of ML training requirements
- Experience with cloud infrastructure and distributed storage systems
- Strong communication skills
- Collaborative approach and ownership
Nice to Have
- Experience with preference data collection for RLHF
- Familiarity with multimodal data
- Experience building synthetic data generation pipelines
- Background in data quality metrics and monitoring systems
- Contributions to dataset releases or benchmarks in the ML community
Opportunity Details
job
London