Machine Learning Researcher Engineer
Date Posted
05 Aug, 2025
Work Location
Salary Offered
$140000 — $200000 yearly
Job Type
💪🗣 About BoldVoice
BoldVoice helps the 1 billion global non native English speakers speak English with clarity and confidence, so they can advance their careers and lives.
The app gives users instant pronunciation feedback from speech AI, and then teaches them how to improve with video lessons and training exercises, developed by Hollywood accent coaches.
Today, BoldVoice is one of the top Education apps on the App Store and serves non-native speakers of 100+ different language backgrounds all over the world.
💻 About the Role
As a Machine Learning Engineer / Researcher at BoldVoice, you’ll play a critical role in driving the development and optimization of our AI systems. Your work will directly enhance the user experience by creating new machine learning-enabled capabilities, and improve the accuracy and efficacy of our existing machine learning systems. Specifically, you’ll work on:
Model Development and Deployment
- Designing, training, and fine-tuning machine learning models for AI coaching, pronunciation feedback, and accent detection. This will include working on LLMs, speech models like Wav2Wec2.0, and multi-modal models like speech to speech models.
- Deploying these models into production environments for real-time and batch inference.
Pipeline Development and Optimization
- Building reusable and organized data preprocessing pipelines for various data, including audio data, text data and more.
- Setting up automated evaluation systems to monitor model performance.
- Optimizing training workflows to reduce time-to-deployment.
You will be joining a top-notch machine learning team, who are striving to push forward what’s possible in speech and audio AI, but also care about creating practical uses for their work. An example of our team’s research can be found here: Accents in Latent Spaces. Our team is also behind the viral hit: BoldVoice Accent Oracle, which has been tried more than 40m times, by users all over the world.
😀 Who you are
- You have a strong foundation in machine learning, statistics and mathematics, and take pride in building systems that work well and make it into production.
- You thrive on solving challenging problems, and bring equal parts creativity and focus to methodically try out both proven and unproven techniques.
- You care about user experience and are driven to create technologies that make a real difference.
- You want to work fast, you want to not get interrupted by meetings, and you want to not need to ask for permission to do things.
✨ Requirements
- You have at least 3 years of experience working on machine learning models in production environments, specifically training, fine-tuning, evaluating and directly implementing machine learning models, in the fields of Speech, NLP, and/or Vision
- Proficiency in Python and frameworks like TensorFlow, PyTorch, or similar. Experience with speech or audio processing is a plus but not required.
🎁 What we offer
- You will be compensated in salary and generous stock options -- we want you to feel like an integral part of the success and growth of the company
- Benefits include excellent fully paid health/vision/dental insurance and 401K
- We’re an in-person team and we work out of our office in downtown Manhattan in NYC. If you’re not in NYC, we would like to help you move here and can help with your relocation.
- Access to exclusive startup events, conferences and networks
📲 How to Apply:
- Reach out here or email us at engineering [at] boldvoice [dot] com to start the conversation!