J
Jobs Base
999 active jobs

Senior Infrastructure Engineer, Applied AI Engineer

Mixpeek | United States | 1mo ago
This role has closed. Here are similar open builder roles:
1.
United States | remote | full-time | executive | full-stack engineering, ai, llms | 2w ago
2.
United States+8 | $165,000 – $200,000/yr | remote | full-time | senior | backend services, data pipelines, api design | 2w ago
3.
Mexico+1 | remote | full-time | senior | scala, gcp, react | 2w ago
4.
United States | remote | full-time | executive | salesforce, hubspot, conversation intelligence | 2w ago
5.
Senior Software Engineer (PlayOn Sports)
United States | remote | part-time | senior | python, ai, computer vision | 2w ago
6.
United States+1 | $148,000 – $275,000/yr | remote | full-time | lead | python, typescript, go | 2w ago
7.
United States | remote | full-time | senior | typescript, react, aws lambda | 2w ago
8.
United States | remote | full-time | senior | sql, python, rag | 2w ago
9.
United States+1 | $190,000 – $280,000/yr | remote | full-time | lead | react, swift, kotlin | 2w ago
10.
United States+1 | $190,000 – $280,000/yr | remote | full-time | senior | react, swift, kotlin | 2w ago
Original posting (closed) below
full-time | remote | senior
skills: python, ray, s3, fastapi, react, typescript, vector databases, distributed systems, ai, machine learning
Mixpeek is multimodal AI infrastructure — we turn unstructured content (video, images, audio, documents) into searchable, programmable assets through a unified API. Think of us as the data infrastructure layer that sits between your raw media and your AI applications. If you've ever tried to build a pipeline that extracts features from video, makes them retrievable, and serves them at scale — you know this is a 12-18 month infrastructure project. We compress that to an afternoon.

We're currently building MVS (Mixpeek Vector Store) — a distributed vector database built on Ray + S3, designed to handle 100B+ vectors at a fraction of the cost of existing solutions. Architecture details: shard-level WAL, LIRE-based adaptive search, replica sets, and agent-native query primitives. If you've ever wanted to rethink how vector search works from the storage layer up, this is that project.

Some things we're shipping right now:

- IP safety for media & sports — our copyright detection platform (https://copyright.mixpeek.com) helps brands and leagues detect unauthorized use of visual IP at scale. We're working with partners in the media/sports ecosystem including Backblaze for storage-native integration.

- Healthcare pipelines — multimodal extraction for clinical trial recruitment and SNF/MDS coding workflows, working with enterprise partners in the space.

- Ad verification — we contribute to the IAB Tech Lab ARTF working group and power contextual intelligence for ad safety.

Our core primitives: feature extractors, retrievers, taxonomies, clusters. Decompose with extractors, recompose with retrievers. Docs: https://docs.mixpeek.com

Stack: Python, Ray, S3, FastAPI, React/TypeScript. We also maintain amux [https://github.com/mixpeek/amux], an open-source tmux multiplexer for running parallel Claude Code agent sessions — if you're into agentic dev workflows, check it out.

I'm Ethan (founder/CEO, previously led search at MongoDB). Small team, high ownership, real problems. We're preparing for NAB Show next week and scaling enterprise pipeline work across healthcare, adtech, and media.

Reach out: ethan [at] mixpeek [dot] com — mention HN.

Get new builder jobs daily: