Back to Careers
Founding Machine Learning Engineer
Research & Engineering•Full-time•Remote / Hybrid
About the Role
We are looking for a Founding Machine Learning Engineer to design and train state-of-the-art models for voice conversion and accent transformation. You will work on the bleeding edge of audio generative AI, solving challenges in prosody preservation, low-latency inference, and few-shot learning.
Responsibilities
- Research and implement novel architectures for detailed voice conversion (VC) and text-to-speech (TTS).
- Optimize model inference for low-latency real-time applications using ONNX, TensorRT, or quantization techniques.
- Build data pipelines for processing large-scale audio datasets.
- Collaborate with backend engineers to deploy models into production environments.
- Stay up-to-date with the latest research in audio synthesis and generative models.
Requirements
- Strong demonstrated experience with PyTorch or JAX.
- Background in audio signal processing and speech synthesis (TTS/VC).
- Experience training large models on multi-GPU clusters.
- Familiarity with model optimization and deployment for production.
- Published research or significant open-source contributions in the field is a plus.