Staff Engineer
VLM Run | Santa Clara, California, United States | 1mo ago
This role has closed. Here are similar open builder roles:
| 1. | AI Builder Intern - Agentic AI (Mondee) Austin, Texas, United States | on-site | internship | internship | ai, agentic ai, llms | 1mo ago |
| 2. | AI Systems Weirdo (UniteGPS) South Portland, ME, United States | on-site | full-time | mid | ai systems design, route optimization, gps tracking | 1mo ago |
| 3. | Software Engineering Intern (Maritime Technology Startup (Stealth)) El Segundo, California, United States | $40 – $48/hr | on-site | internship | internship | python, go, javascript | 1mo ago |
| 4. | AI Builder Intern - Agentic AI (Tabhi) Austin, Texas, United States | on-site | internship | internship | agentic ai, llms, agent frameworks | 1mo ago |
| 5. | Senior Founding Engineer (Ambral) New York City, New York, United States | $185,000 – $245,000/yr | on-site | full-time | senior | typescript, nuxt, postgres | 1mo ago |
| 6. | Marketing Productivity Engineer (Sigma Computing) San Francisco, United States+2 | $130,000 – $165,000/yr | on-site | full-time | senior | performance marketing, growth engineering, marketing operations | 1mo ago |
| 7. | Software Engineer (Cognition) San Francisco, California, United States | From $260,000/yr | on-site | full-time | mid | python, distributed systems, ai | 1mo ago |
| 8. | Intelligence Architect (Basis) New York, New York, United States | $150,000 – $225,000/yr | on-site | full-time | senior | applied machine learning, natural language processing, system design | 1mo ago |
| 9. | Senior GNC Engineer (Inversion) Playa Vista, California, United States | $139,000 – $199,000/yr | on-site | full-time | senior | kalman filtering, sensor fusion, state estimation | 1mo ago |
| 10. | Forward Deployed Engineer (Stuut) New York City, New York, United States | $150,000 – $240,000/yr | on-site | full-time | senior | python, apis, etl | 1mo ago |
Original posting (closed) below
full-time | on-site | lead
skills: machine learning, product engineering, ml engineering, rust, python, inference, orchestration, vision-language models, cli, benchmarking
We're building the inference and orchestration layer for production Vision-Language Models. We care deeply about fast and ergonomic visual inference, reliable structured outputs, and the observability to iterate on them.
A few things we've shipped recently you can poke at:
1. Orion: our visual agent that reasons and acts over images, video, and documents. Chat at https://chat.vlm.run.
2. mm-ctx: a Unix-style multimodal CLI (find, cat, grep, wc) that gives coding agents real context over images, video, and PDFs. Rust core, Python devex.
3. vlmbench: single-file CLI for benchmarking VLM inference (TTFT, TPOT, throughput) across vLLM, Ollama, and SGLang.
Apply: https://app.dover.com/jobs/vlm-runEmail hiring "at" vlm.run with your GitHub + a couple recent projects.
[2] https://pypi.org/project/mm-ctx | https://www.vlm.run/open-source/mm
[3] https://github.com/vlm-run/vlmbench | https://www.vlm.run/open-source/vlmbench
Get new builder jobs daily: