R

Software Engineer, ML Infrastructure

Recruiting From Scratch
Full-time
On-site
San Francisco, California, United States
Software / Technology / IT
Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.

Senior Machine Learning Infrastructure Engineer

  • Location: Redwood City, CA
  • Company Stage of Funding: Series A ($46M raised)
  • Office Type: Hybrid (4 days in office)
  • Salary: $180K - $250K (Higher for exceptional candidates) + Significant Equity

Company Description

We are representing an exciting AI startup revolutionizing how deep learning models are trained. Our client builds cutting-edge tools that automatically select optimal data for training deep learning models, eliminating redundant, noisy, or harmful data points. Their modality-agnostic algorithms don't require labels, making them ideal for next-generation large deep learning models. Their technology enables customers across industries to train better models more cost-effectively.

What You Will Do

  • Design, build, and maintain robust training infrastructure for in-house ML research and validation efforts
  • Develop core infrastructure for running curation pipelines delivered to customers
  • Partner closely with founders to influence product direction and drive business-critical technical decisions
  • Create scalable systems that impact the company's ability to deliver, scale, and deploy their product
  • Collaborate with researchers and other stakeholders to understand requirements and implement effective solutions
  • Help build a world-class engineering culture as an early senior team member

Ideal Candidate Background

  • 5-15 years of experience as a software engineer working with infrastructure for Machine Learning models
  • Senior, Staff, or Principal level engineering experience
  • Experience on compute teams at major tech companies (e.g., Google, Amazon, Meta) or AI startups training machine learning models
  • Track record of leading and building production ML infrastructure and platforms that deliver on major product initiatives
  • Proficiency in Python and common infrastructure tools: Linux, Kubernetes, Terraform/Pulumi, etc.
  • Strong knowledge of cloud-native systems, Kubernetes workloads, and distributed systems
  • Effective communication skills for collaborating with researchers and various stakeholders
  • Computer Science degree

Preferred Qualifications

  • Experience with AWS and GCP cloud platforms
  • Background in scaling ML systems from prototype to production
  • Familiarity with ML research and latest developments in the field
  • Previous startup experience
  • Passion for advancing machine learning technology

Compensation and Benefits

  • Competitive salary range: $180K - $250K (higher for exceptional candidates)
  • Significant equity package
  • Comprehensive health benefits
  • Visa sponsorship available (TN, H1-B transfer, and O1)
  • Opportunity to work with cutting-edge ML technology
  • Chance to make significant impact at an early-stage, well-funded AI startup
  • Collaborative team of 22 talented professionals