J
Jobs Base 0-to-1 builder jobs
2,506 active jobs 43 new today

Head of Applied AI

Niural AI | New York City, New York, United States | Yesterday
full-time | on-site | lead | 6+ years
skills: python, rag, prompt engineering, llm evaluation, model serving, vector databases, embedding pipelines, nlp, langgraph, autogen, crewai, multi-agent orchestration

NIURAL

Head of Applied AI

New York City · Onsite · Full-Time · Reports to CEO & CTO

The Short Version

Niural is the AI-native platform that unifies payroll, compliance, HR, and financial operations across 150+ countries. Our AI agent layer, EMMA, doesn’t sit on top of the product, it is the product. We process high-stakes, multi-jurisdictional payroll and compliance data where a single error can mean regulatory penalties, missed paychecks, or broken trust.

We’re hiring a Head of Applied AI to own the entire AI engineering function. This is a hands-on technical leadership role for someone who ships production systems, not someone who manages from a distance. You will design, build, and iterate on the AI systems that make Niural’s platform intelligent, and you will set the engineering standard for the team that grows around you.

Backed by Marathon, M13, and Inspired Capital.

Why This Role Is Different

Most “Head of AI” roles at startups mean wrangling a chatbot wrapper or running prompt experiments. This one is different because the underlying domain is genuinely hard and unsolved:

  • Multi-jurisdictional document understanding across 150+ countries, each with distinct payroll rules, tax codes, and employment regulations, in dozens of languages.

  • Structured data extraction from payroll and compliance inputs where accuracy isn’t aspirational, it’s a regulatory requirement. You will build systems where the cost of a hallucination is a compliance violation, not a bad search result.

  • Agentic workflows (EMMA) that need to execute autonomously on financial transactions while remaining explainable and auditable. This isn’t a chatbot that just answers questions, it’s autonomous payroll execution.

  • Real-time anomaly detection on financial data across currencies, jurisdictions, and tax regimes simultaneously.

If you want to do serious applied AI work in a domain where the output actually matters, where model performance translates directly to whether people get paid correctly, this is the role.

What You Will Own

You report to the CEO and CTO. Your focus is on delivering, getting AI systems into production, iterating on model performance, and building engineering practices that scale.

  • Production AI Systems. Design, build, and ship LLM-powered features end-to-end: document understanding, structured data extraction from payroll and compliance inputs, multi-step reasoning workflows, and intelligent automation across the Niural platform. You write and review code. You deploy to production. You own the outcome.

  • Evaluation & Reliability. Construct rigorous evaluation frameworks with ground-truth datasets, regression tracking, and clear production-readiness criteria. In regulated financial and employment contexts, “works most of the time” is not good enough. Define what “good” means, measure it, and hold the bar.

  • AI Infrastructure. Build and own the core stack: RAG pipelines, vector stores, embedding systems, fine-tuning workflows, model serving, and observability tooling. Make principled build-vs-integrate decisions based on accuracy, cost, latency, and data privacy.

  • Responsible AI in Practice. Implement hallucination controls, confidence scoring, human-in-the-loop review flows, and audit trails for model-driven decisions. This is a domain where responsible AI isn’t a slide deck, it’s an engineering requirement with regulatory consequences.

  • Cross-Functional Translation. Work with product and engineering to translate business requirements into concrete model specifications, data requirements, and acceptance criteria. Bridge the gap between “we need AI to do X” and a shipped, measurable feature.

  • Team & Engineering Culture. Set the engineering foundations, code standards, review processes, documentation practices, that will support a growing AI team. You’ll be the first dedicated AI hire; the team grows around the standard you set.

What We’re Looking For

We care about what you’ve built and shipped, not where you trained. Show us the production systems, the eval frameworks, the hard tradeoffs you’ve made.

  • 6+ years hands-on in ML or AI engineering, with at least 2 years building and shipping production LLM systems, not notebooks, not prototypes, production.

  • Strong Python engineering skills. You write production-quality code that other engineers can review, test, and extend.

  • Direct experience with RAG architectures, prompt engineering, fine-tuning, and LLM evaluation, including building eval datasets and tracking model performance over time.

  • Solid command of ML infrastructure: model serving, vector databases, embedding pipelines, and monitoring in production environments.

  • Experience working with structured and unstructured data in a high-accuracy, low-tolerance-for-error context. Fintech, HR tech, legal tech, or healthcare strongly preferred.

  • Demonstrated ability to translate ambiguous business requirements into concrete technical approaches and ship them cross-functionally.

  • Comfortable with a high degree of ownership in an early-stage environment where the right process needs to be created, not followed.

Bonus Points

  • Experience with agentic systems and multi-agent orchestration in production- LangGraph, AutoGen, CrewAI or similar (not just toy examples) .

  • Background in NLP applied to legal, financial, or regulatory document processing, especially multilingual.

  • Familiarity with global payroll, employment law, or multi-jurisdictional compliance data.

  • Prior experience as an early AI hire at a Series A - C company where you built the function from scratch.

This Role Is Probably Not For You If…

  • You prefer managing engineers to writing code. This role is at least 70% hands-on for the first 12–18 months.

  • You want to do pure research. We selectively incorporate techniques that are genuinely relevant; we don’t publish papers.

  • You need established processes and clear handoffs. At this stage, you’ll often be defining the process, not inheriting it.

  • You’re uncomfortable with regulated, high-stakes domains where shipping fast and shipping correctly are both non-negotiable.

What Niural Offers

  • Competitive base + significant equity at a high-growth, venture-backed company approaching Series B.

  • Onsite at Niural’s New York headquarters, working directly with a focused, technical team.

  • Direct access to the CEO and CTO. Your technical decisions will have immediate, visible product impact.

  • Well-resourced AI infrastructure budget and access to frontier model APIs from day one.

  • A domain with genuine complexity, real data, and problems that have not been solved yet, across 150+ countries.

  • Comprehensive health, dental, and vision coverage. Learning and development budget.

Niural is an equal opportunity employer. We value diverse perspectives and believe your potential should only be limited by the size of the problem you want to solve.

Benefits

health insurance · dental insurance · vision insurance · learning and development budget · equity
Get new builder jobs daily: