Software Engineer Jobs New York NY 2026

Baseten — the AI inference and training platform trusted by frontier companies including Cursor, Notion, Abridge, Clay, Gamma, and Writer — is hiring a Software Engineer in New York, NY. Backed by a $1.5 billion Series F led by Altimeter Capital, Conviction Partners, and Spark Capital, Baseten powers mission-critical AI workloads at scale. You will own features like multi-node training and serverless reinforcement learning from concept through to production — earning a salary of $165,000 to $330,000 plus meaningful equity.

Compensation Range: $165,000 – $330,000 per year + Meaningful Equity + 100% Medical, Dental & Vision Coverage for Employee & Dependents

About Baseten — Frontier AI Infrastructure Leader

Funding: $1.5 Billion Series F — led by Altimeter Capital, Conviction Partners, and Spark Capital

Mission: Power mission-critical AI inference and training for the world’s most dynamic AI companies

Clients: Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, Writer — frontier AI builders

Platform: Applied AI research, flexible infrastructure, and seamless developer tooling in one unified stack

Growth: Rapidly scaling team building the platform engineers turn to when shipping AI products at speed

Career Opportunity & Why Baseten New York

Market Position: The go-to inference and training platform for frontier AI companies globally

Compensation: $165K–$330K base salary plus meaningful equity in a $1.5B Series F company

Tech Depth: Build across the full stack — API, backend, infrastructure, and ML training layers

Career Impact: Ship products used by the world’s most ambitious AI teams at production scale

Position Overview

The Software Engineer at Baseten in New York is a high-ownership, full-stack role at the frontier of AI infrastructure development. You will own complex features — from multi-node training pipelines and serverless reinforcement learning to checkpointing systems and developer experience tooling — taking them from initial conception through MVP and all the way to general availability. Working through the complete technical stack from API and UI down to the infrastructure layer, you will partner with world-class research engineers, fine-tune models to develop genuine user workflow intuition, and help build the training platform that the most ambitious AI companies on the planet trust to ship their most demanding models into production.

 Why This Role Matters: As Software Engineer at Baseten, you directly build the infrastructure that lets frontier AI companies like Cursor, Notion, and Abridge bring cutting-edge models into production. You will architect and ship multi-node training systems, serverless RL environments, and seamless model checkpointing pipelines used at real scale, work alongside research engineers applying the latest training techniques including FSDP, DeepSpeed, LoRA, and Reinforcement Learning, earn top-of-market compensation ($165K–$330K) plus meaningful equity in a $1.5B Series F company, and define the future of how AI developers train, evaluate, and deploy their most important models.

Key Responsibilities

Product Engineering & Feature Ownership

  • Own end-to-end features like multi-node training and serverless reinforcement learning from conception all the way through to MVP and general availability
  • Design ergonomic, developer-friendly APIs and clean abstractions that model complex distributed training resources and lifecycle states
  • Iterate rapidly on product hypotheses, ship working features early, and refine based on real user feedback and production data
  • Drive long-term reliability improvements and accelerate development velocity across Baseten’s training platform infrastructure

Full-Stack Technical Implementation

  • Work throughout the complete technical stack — API layer, backend services, database implementation, and deep infrastructure layer
  • Build and maintain the checkpointing pipeline including automated model versioning, cloud backup, and seamless inference server deployment
  • Develop Kubernetes-layer solutions for multi-node training job scheduling, startup coordination, inter-node communication, and clean shutdown
  • Contribute to CLI and UI tooling experiences that enable users to manage, monitor, and iterate on training jobs with ease and speed

ML Training Workflow & Research Collaboration

  • Fine-tune and deploy AI models yourself to develop genuine, hands-on intuition around real user training workflow pain points
  • Partner closely with world-class research engineers to deeply understand requirements across post-training and model development workflows
  • Collaborate with model developers leveraging state-of-the-art techniques to build product experiences that meaningfully accelerate model development
  • Enhance training observability from pod-level GPU metrics to per-GPU and per-node performance visibility for users

Reliability, Quality & Customer Impact

  • Investigate and fix bugs with urgency, resolving customer-reported issues quickly to maintain platform trust and reliability
  • Drive architectural decisions that improve the long-term scalability and resilience of distributed training systems at Baseten’s scale
  • Support product releases across the full release lifecycle including MVP, Beta, and General Availability stages with clear quality standards
  • Bring strong customer obsession to every product decision — building tools that genuinely help AI engineers ship faster and smarter

Requirements & Technical Skills

Core Requirements

  • 5+ years of professional experience building scalable, production-grade software applications in engineering roles
  • Deep knowledge of the full web stack including databases, distributed systems architecture, and backend service design
  • Experience developing developer tooling or infrastructure products for external engineers or internal platform users
  • Strong product taste — particularly for developer-facing tools — and the ability to make opinionated, well-reasoned product decisions
  • High agency, strong ownership mindset, and the drive to take complex problems from ambiguous to shipped without heavy direction
  • Strong communication skills bridging deep technical depth and clear business context across engineering and product teams

Nice to Have

  • Experience shipping features and products through structured release cycles — MVP, Beta, GA — in a fast-moving engineering environment
  • Familiarity with model development paradigms including Supervised Fine-Tuning, Reinforcement Learning, LoRA, and Synthetic Data Generation
  • Hands-on experience with open source training frameworks such as PyTorch, Megatron, NemoRL, VeRL, Axolotl, or Hugging Face Trainer
  • Experience with distributed training techniques including FSDP, DeepSpeed, and NCCL inter-node communication systems
  • Frontend development fluency for contributing to UI experiences that complement backend and infrastructure work

Benefits & Compensation at Baseten

  • Competitive base compensation of $165,000 to $330,000 per year plus meaningful equity in a $1.5B Series F company
  • 100% employer-paid medical, dental, and vision insurance coverage for the employee and all dependents
  • Flexible PTO policy plus a company-wide Winter Break — offices closed from Christmas Eve through New Year’s Day
  • Paid parental leave and a fertility and family-building stipend through Carrot for eligible employees
  • Company-facilitated 401(k) retirement savings plan
  • Direct exposure to a rich ecosystem of ML startups, offering unparalleled learning and career networking opportunities

 About Baseten

Baseten powers mission-critical AI inference and training for the world’s most dynamic AI companies — including Cursor, Notion, Abridge, Clay, and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, Baseten enables companies operating at the frontier of AI to bring cutting-edge models into production with confidence and speed. The company recently completed its $1.5 billion Series F led by Altimeter Capital, Conviction Partners, and Spark Capital — and is growing rapidly to build the definitive platform that engineers trust when shipping AI products at real scale.

Career Excellence: Join one of the most exciting and well-funded AI infrastructure companies in the world and help build the training platform that powers the next generation of frontier AI — right from New York City.

 Who Should Apply?

  • Senior Software Engineers: With 5+ years of distributed systems or infrastructure experience and genuine interest in AI/ML platforms
  • AI Infrastructure Engineers: Seeking a high-ownership role at a well-funded ($1.5B Series F) frontier AI company in New York
  • ML Platform Engineers: With PyTorch, Kubernetes, or model training pipeline experience wanting to work at the cutting edge
  • Developer Tooling Engineers: Passionate about building elegant, powerful APIs and CLI experiences used by top-tier AI engineers
  • Full-Stack Engineers: With strong backend depth who want to work across the entire stack from infrastructure to user-facing product features
Software Engineer Jobs New York NY 2026

Recently opened JOBS 👇🏻

AI Engineering Lead Jobs UAE 2026 AI Engineer Jobs Dubai UAE 2026

Leave a Comment

Select Your Degree:
Please select an option.
Select Your Experience:
Please select an option.
Select Currently Your Location:
Please select an option.