J
Jobs Base 0-to-1 builder jobs
317 active jobs 4 new today

Staff AI Engineer

MLabs | New York, United States | Today
$175,000 – $250,000/yr| full-time | remote | senior
skills: python, go, typescript, ml engineering, reinforcement learning, online learning, closed-loop systems, data pipelines, distributed systems, llm, peft, lora, vllm, tgi, inference optimization, financial ml, alpha research, portfolio optimization, execution, multi-agent systems, on-chain data, defi
Location: Need to be able to work EST timezone.
Remote | Full-time
Compensation: $175K - $250K
We are hiring on behalf of our client who is developing a cutting-edge autonomous agent runtime focused on high-frequency financial environments. While current agents operate effectively as independent units, the next phase of evolution involves building a sophisticated intelligence layer where the entire fleet learns autonomously from real-time market outcomes.
The Staff AI Engineer will be responsible for moving beyond manual propagation of insights to a system where the fleet gets smarter with every trade. This is a high-stakes production role, not a research position. The feedback loop is immediate and measurable: the work produced either enhances agent profitability or it does not. The successful candidate will own the intelligence layer that turns thousands of daily trading decisions into compounding, autonomous growth.
Key Responsibilities:
Learning & Optimization
  • Feedback Loop Implementation: Design and implement systems that connect trade outcomes back to strategy improvement, specifically focusing on signal selection, risk parameters, position sizing, and timing
  • Evaluation Frameworks: Build frameworks to quantify which signals and market conditions accurately predict profitable trades versus noise
  • Automated Strategy Generation: Develop systems to explore new configurations, backtest them against real fleet data, and surface candidates for deployment autonomously
  • Market Adaptation: Build mechanisms to detect shifts in market conditions (e.g., trending vs. choppy) and adapt fleet behavior in real-time
Autonomous Fleet Intelligence
  • Fleet Monitoring: Create higher-order agents for automated monitoring to catch configuration errors and performance degradation across all concurrent agents
  • Performance Attribution: Decompose trades into component drivers—signal accuracy, execution efficiency, and exit timing—to feed insights back into strategy design
  • Coordination & Risk: Manage concentration risk and capital allocation across the fleet, balancing the exploration of new approaches with the exploitation of proven strategies
Model & Inference
  • Infrastructure Ownership: Transition from external LLM dependence to controlled intelligence, evaluating hosting strategies ranging from proxied external models to fine-tuned, domain-specific models
  • Data Capture: Build the telemetry and data capture layer to ensure every decision and outcome is structured and queryable
  • Domain-Specific Training: Determine the efficacy of domain-specific training over general-purpose prompting and build the necessary pipelines for implementation
  • Inference Optimization: Optimize inference for many concurrent agents, ensuring structured decision outputs and cost-efficiency at scale
Requirements
Essential Qualifications
  • Production ML Engineering: Proven experience training, deploying, and maintaining models that run in production and directly impact business outcomes
  • Reinforcement/Online Learning: Deep understanding of the practical challenges of learning from real-world outcomes rather than static datasets
  • Closed-Loop Systems: A track record of building systems where predictions lead to actions that generate outcomes, which then feed back into improved predictions
  • Software Engineering: Proficiency in Python is required, with additional comfort in Go or TypeScript for production services. Experience building data pipelines and distributed systems is essential
Preferred Experience
  • Financial ML: Background in signal generation, alpha research, portfolio optimization, or execution
  • LLM Specialization: Experience with fine-tuning and serving (PEFT/LoRA, vLLM, TGI) or custom inference pipelines
  • Multi-Agent Systems: Experience designing environments where autonomous agents coordinate or learn from one another
  • Domain Knowledge: Background in on-chain data, DeFi protocols, or sectors where agents make sequential decisions under uncertainty (e.g., robotics, game AI)
Benefits
  • Base Salary: $175,000 - $250,000 USD (dependent on location and experience)
  • Equity: Approximately 1% initial stock grant, with significant valuation growth potential
  • Performance Incentives: Eligibility for salary increases and bonuses tied directly to revenue and usage
  • Token Participation: Pro-rata participation in the client's planned 2026 token launch
  • Ownership: High-impact role with meaningful upside tied directly to the success of the autonomous fleet
Interview Process:

  • Initial Interview: Discussion with the Founder/CEO
  • Technical Assessment: A take-home test to evaluate practical application
  • Technical Interview: Deep dive into engineering and ML capabilities
  • Final Interview: Comprehensive final review
Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.
Commitment to Equality and Accessibility:
At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing human-resources@mlabs.city.
MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd's Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting legal@mlabs.city.

Benefits

base salary · equity · performance incentives · bonus · token participation · ownership