AI/ML Backend Engineer (LLMs / Agentic AI)
| 1. | AI Builder Intern - Agentic AI (Mondee) Austin, Texas, United States | on-site | internship | internship | ai, agentic ai, llms | 2w ago |
| 2. | AI Systems Weirdo (UniteGPS) South Portland, ME, United States | on-site | full-time | mid | ai systems design, route optimization, gps tracking | 2w ago |
| 3. | Software Engineering Intern (Maritime Technology Startup (Stealth)) El Segundo, California, United States | $40 – $48/hr | on-site | internship | internship | python, go, javascript | 2w ago |
| 4. | AI-Native Founding Engineer (Jobright.ai) San Francisco, California, United States | $130,000 – $170,000/yr | on-site | full-time | lead | typescript, react, sql | 2w ago |
| 5. | MLE @ Krnel (NYC, Full-Time) (krnel.ai) New York, New York, United States | on-site | full-time | mid | machine learning, devops, ci/cd | 2w ago |
| 6. | AI Builder Intern - Agentic AI (Tabhi) Austin, Texas, United States | on-site | internship | internship | agentic ai, llms, agent frameworks | 2w ago |
| 7. | Forward Deployed Engineer (Legion Intelligence) Washington DC, United States | $185,000 – $260,000/yr | on-site | full-time | mid | python, javascript, typescript | 2w ago |
| 8. | Senior Founding Engineer (Ambral) New York City, New York, United States | $185,000 – $245,000/yr | on-site | full-time | senior | typescript, nuxt, postgres | 2w ago |
| 9. | Full-Stack Engineer- Series B Ai · $200-300K + equity (Benchstack Ai) San Francisco, California, United States+1 | $200,000 – $300,000/yr | on-site | full-time | senior | react, typescript, python | 2w ago |
| 10. | Marketing Productivity Engineer (Sigma Computing) San Francisco, United States+2 | $130,000 – $165,000/yr | on-site | full-time | senior | performance marketing, growth engineering, marketing operations | 2w ago |
AI/ML Backend Engineer (LLMs / Agentic AI)
$150,000 – $250,000 + Equity + 401(k) + Full Benefits + PTO + Relocation Support + Progression
Miami, Florida – Fully Onsite (Paid Relocation)
Are you a Backend Engineer with strong Python experience, already building with LLMs and AI agents in production, and looking to step into a role where you’ll own systems end-to-end in a true 0-1 environment?
This is a rare opportunity to join a well-funded, AI-native startup at a critical growth stage, already powering millions of real-world conversations per month across enterprise customers.
Backed by top-tier investors and experienced operators, the business has achieved strong product-market fit and is now scaling aggressively they are building production-grade AI systems that deliver real commercial impact, not prototypes.
You’ll be one of the key engineers shaping how these systems are architected, deployed, and scaled, with direct ownership over core backend infrastructure and AI systems. You will design, build, and scale high-performance backend systems that power AI agents and conversational workflows. This includes backend services, LLM integrations, retrieval systems, and infrastructure focused on speed, reliability, and scalability at scale.
If you’re already building with LLMs and want to own how real-world AI systems are designed, scaled, and deployed in production, this is a standout opportunity to do it in a high-growth, well-funded environment.
The Person:
* Strong Python skills (FastAPI or similar)
* Experience building with LLMs in production (agent loops, tool calling, prompting)
* Deep understanding of embeddings, vector search, and retrieval systems
* Experience designing agent workflows / agent frameworks
* Building integrations against third-party APIs (CRMs, webhooks, OAuth flows)
* Thinking in reusable, multi-tenant patterns:
* Looking for a fast paced start-up culture
The Role:
* A fast-growing AI-native conversational AI company in Miami
* Team of 5 other highly skilled developers
* Design and build core backend systems for AI-driven voice and chat platforms
* Develop scalable services using Python / FastAPI in production
* Build and optimize LLM-powered systems (tool calling, multi-step agents, structured outputs)
* Architect real-time retrieval systems (embeddings + vector search)
* Build integrations with third-party systems (CRMs, APIs, OAuth, webhooks)
* Develop multi-tenant, scalable architecture across multiple customers
* Own performance, reliability, and system scalability end-to-end