Principal ML Research Engineer

Lanesurf | San Francisco, California, United States | 2mo ago

This role has closed. Here are similar open builder roles:

1.	AI Builder Intern - Agentic AI (Mondee) Austin, Texas, United States \| on-site \| internship \| internship \| ai, agentic ai, llms \| 3w ago
2.	AI Systems Weirdo (UniteGPS) South Portland, ME, United States \| on-site \| full-time \| mid \| ai systems design, route optimization, gps tracking \| 3w ago
3.	Software Engineering Intern (Maritime Technology Startup (Stealth)) El Segundo, California, United States \| $40 – $48/hr \| on-site \| internship \| internship \| python, go, javascript \| 3w ago
4.	AI-Native Founding Engineer (Jobright.ai) San Francisco, California, United States \| $130,000 – $170,000/yr \| on-site \| full-time \| lead \| typescript, react, sql \| 3w ago
5.	MLE @ Krnel (NYC, Full-Time) (krnel.ai) New York, New York, United States \| on-site \| full-time \| mid \| machine learning, devops, ci/cd \| 3w ago
6.	AI Builder Intern - Agentic AI (Tabhi) Austin, Texas, United States \| on-site \| internship \| internship \| agentic ai, llms, agent frameworks \| 3w ago
7.	Forward Deployed Engineer (Legion Intelligence) Washington DC, United States \| $185,000 – $260,000/yr \| on-site \| full-time \| mid \| python, javascript, typescript \| 3w ago
8.	Senior Founding Engineer (Ambral) New York City, New York, United States \| $185,000 – $245,000/yr \| on-site \| full-time \| senior \| typescript, nuxt, postgres \| 3w ago
9.	Full-Stack Engineer- Series B Ai · $200-300K + equity (Benchstack Ai) San Francisco, California, United States+1 \| $200,000 – $300,000/yr \| on-site \| full-time \| senior \| react, typescript, python \| 3w ago
10.	Marketing Productivity Engineer (Sigma Computing) San Francisco, United States+2 \| $130,000 – $165,000/yr \| on-site \| full-time \| senior \| performance marketing, growth engineering, marketing operations \| 3w ago

browse all open builder jobs →

Original posting (closed) below

$120,000 – $225,000/yr| full-time | on-site | lead | 3+ years

skills: python, pytorch, llm, speech models, transcriber, transcription accuracy, interruption models, voice agents, machine learning, mlops

Build AI that talks, negotiates rates, and enables autonomous movement of trucks from pickup to delivery

Demo of AI booking a shipment in 10 minutes by speaking to 96 trucking companies simultaneously

The problem

If Walmart needs to move a truck of avocados from California to Chicago, today they must:

Speak with 50+ trucking companies
Check weight and temperature requirements
Negotiate price and availability
Do it one call at a time

This process takes hours and thousands of phone calls every day across the industry.

What we’re building

We’re building AI agents that do this work automatically.

Calls and emails dozens of trucking companies at once
Checks requirements (weight, temperature, lanes)
Negotiates prices in parallel
Books a truck in minutes, not hours

Proof it works

👉 In this demo, our AI spoke to 96 trucking companies simultaneously and booked a shipment in under 10 minutes - https://www.linkedin.com/feed/update/urn:li:activity:7394069447327555584

Why this is exciting

You’ll work on AI that handles real-world transactions through phone calls
Real-world, high-stakes work enabling autonomous logistics - think moving a truck from Chicago to Texas, fully coordinated by AI
Small team, high ownership, fast iteration
Hard problems that don’t exist in benchmarks

What we’ll work on

Train & Tune Models

Fine-tune transcribers and speech models for real-time voice agents operating on live phone calls.

Enable real time transcriber fine-tuning based on caller context
Improve transcription accuracy for domain-specific language under noisy conditions
Fine-tune interruption models on domain-specific conversations
Post-Train speech models for intonations, pacing and naturalness and avoiding robotic cadence

LLM optimization

Structuring modules, and policies that compose cleanly
Optimizing LLM outputs for brevity, correctness, and timing
Reducing drift across long, multi-turn conversations
Evaluating changes against real call outcomes, not just text metrics

Evaluation & iteration

You’ll help define how we measure quality across:

Transcription accuracy where it actually matters
Voice naturalness as judged by listeners
Conversation efficiency and completion

You can be a great fit, if:

ML Engineer with Real-World Experience – You’ve trained and shipped models in production. Bonus if you’ve worked with LLMs or audio models.
Fluent in Modern ML Stack – You know your way around Python, PyTorch, and today’s ML tools - from training pipelines to evaluation benchmarks.
Execution-Oriented – You move fast, take ownership, and focus on solving real problems over perfect ones.
Startup-Ready – You’re adaptable, resilient, and energized by ambiguity and fast-changing priorities.
Clear Communicator & Team Player – You collaborate well across functions and push decisions forward.

Details

Cash + Equity
Location: San Francisco, CA, US

Benefits

cash · equity

Get new builder jobs daily: