Quantum Insider

About the Role

We are looking for a Founding Machine Learning Engineer to design and train state-of-the-art models for voice conversion and accent transformation. You will work on the bleeding edge of audio generative AI, solving challenges in prosody preservation, low-latency inference, and few-shot learning.

Responsibilities

Research and implement novel architectures for detailed voice conversion (VC) and text-to-speech (TTS).
Optimize model inference for low-latency real-time applications using ONNX, TensorRT, or quantization techniques.
Build data pipelines for processing large-scale audio datasets.
Collaborate with backend engineers to deploy models into production environments.
Stay up-to-date with the latest research in audio synthesis and generative models.

Requirements

Strong demonstrated experience with PyTorch or JAX.
Background in audio signal processing and speech synthesis (TTS/VC).
Experience training large models on multi-GPU clusters.
Familiarity with model optimization and deployment for production.
Published research or significant open-source contributions in the field is a plus.

Founding Machine Learning Engineer

About the Role

Responsibilities

Requirements

Apply for this position