Speech AI Engineer

Speech AI Engineer

Speech AI Engineer

Upwork

Upwork

Remoto

3 hours ago

No application

About

Hello, We’re hiring a Speech AI Engineer to build a voice-to-voice NLP system that can adapt to accents, preserve tone/emotion, and output in any 200+ languages. This contract, if successfully completed, is a stepping stone for you to be offered a full-time position with our AI startup with UK Visa sponsorship once we secure investment by end of the year. What You’ll Do - Build a speech-to-speech translation pipeline: - ASR (Automatic Speech Recognition) with strong accent handling. - Tone/context-aware translation. - TTS (Text-to-Speech) with natural output that adapts to a user’s voice (via short sample). - Deliver a working demo within 1 week of contract start, with daily/regular review sessions with the founder for feedback. - Support our engineering team with app integration (Flutter/web). - Demo the technology internally and in up to 3 investor meetings. - Deliver a comparative report showing how this system outperforms existing APIs (Google Speech, Amazon Polly, Whisper, etc.). Post-funding: opportunity to continue full-time with UK sponsorship. What We’re Looking For - Strong track record with speech AI systems (ASR, TTS, speech-to-speech). - Experience with accent adaptation, prosody modeling, and voice cloning (Whisper, ESPnet, Coqui TTS, VALL-E, YourTTS, etc.). - Skilled in PyTorch/TensorFlow, Hugging Face, Fairseq, ESPnet. - Experience building scalable APIs for production apps. - Ability to communicate technical work clearly and present to investors. - Startup-ready mindset: comfortable working fast and iterating daily. Why Join Us - Build groundbreaking tech: your work will power the voice of our global platform. - Shape the future: direct path to full-time role + UK sponsorship post-funding. - Confidential project: NDA required before starting contract. Thanks, Mark