Infrastructure Engineer

Remote from
USA
Salary, yearly, USD
150 - 175
Employment type
Full Time,
Job posted
Apply before
21 Jul 2026
Experience level
Midweight
Views / Applies
154 / 38

About Roboflow

Everything you need to build and deploy computer vision models, from automated annotation tools to high-performance deployment solutions.

Actively Hiring
Verified job posting
This job post has been manually reviewed for authenticity and compliance.

AI Summary

Roboflow is seeking an Infrastructure Engineer to secure, scale, and maintain the infrastructure powering their computer vision platform. This role involves working with Kubernetes, infrastructure-as-code, cloud services, and machine learning pipelines. The ideal candidate has production experience with Kubernetes, Terraform, and scaling large applications, along with programming skills in Node.js and Python. The position offers high autonomy and ownership in a fast-paced startup environment. The team is small but impactful, and the company values curiosity, ownership, and a track record of improving systems.

Role DNA

Job Complexity
Easy Hard
Pace & Pressure
Relaxed Fast-paced
Autonomy Level
Guided Full Ownership
Communication Load
Independent Highly Collaborative
AI Insight The role requires deep expertise in Kubernetes, infrastructure-as-code, and scaling large-scale applications, along with cross-team collaboration and ML pipeline knowledge, making it a challenging position for experienced engineers.

Salary Analysis

Median Highly Competitive
USD162,500
US Market
USD120k – 200k
0 USD220k
AI Insight The offered salary range of $150,000 to $175,000 is competitive and above the typical US market median for Infrastructure Engineers, which is around $140,000. The midpoint of $162,500 aligns well with the specialized skills required, including Kubernetes, Terraform, and ML infrastructure.

Key Skills

Kubernetes Terraform AWS GCP Python Node.js CI/CD Machine Learning Security Infrastructure as Code

Dear Hiring Manager,

I am writing to express my strong interest in the Infrastructure Engineer position at Roboflow. With extensive experience in Kubernetes, Terraform, and cloud infrastructure at scale, I have successfully designed and maintained systems that handle millions of requests per day. At my previous company, I led the migration to a microservices architecture on Kubernetes, improving deployment frequency by 80% while maintaining 99.99% uptime.

I am particularly drawn to Roboflow's mission of making the world programmable through computer vision. I have used Roboflow in a side project to train a custom object detection model, and I was impressed by the developer experience. I thrive in fast-paced environments that value ownership and am eager to contribute to securing and scaling the infrastructure that powers your platform.

I look forward to discussing how my skills align with Roboflow's needs. Thank you for your consideration.

Describe your experience with Kubernetes in production. How did you handle scaling and security?
I have managed Kubernetes clusters for a SaaS platform serving over 10 million users. I implemented Horizontal Pod Autoscaling and Cluster Autoscaling to handle traffic spikes, and used network policies and RBAC for security. I also set up Prometheus and Grafana for monitoring and alerting.
How do you approach infrastructure as code? Can you give an example of a Terraform project you worked on?
I use Terraform with modules for reusable components. For example, I built a module for provisioning EKS clusters that included VPC, subnets, node groups, and IAM roles. This allowed the team to deploy consistent environments in minutes.
Explain a time you had to troubleshoot a critical infrastructure issue. What was your process?
Once, a database replication lag caused read latency for users. I quickly analyzed the logs and found that a recent deployment had increased write load. I temporarily scaled up the primary and added read replicas, then worked with the product team to optimize queries.
How do you ensure security in cloud infrastructure? What best practices do you follow?
I follow the principle of least privilege for IAM roles, encrypt data at rest and in transit, use network segmentation with security groups and VPCs, and regularly conduct vulnerability scans. I also implement Infrastructure as Code scanning in CI/CD pipelines.
Describe your experience with machine learning infrastructure. Have you worked with GPU clusters or ML pipelines?
I have set up GPU-enabled Kubernetes clusters for training models using NVIDIA GPU Operator. I also built CI/CD pipelines for ML workflows using Kubeflow and Argo, automating data preprocessing, training, and model deployment.

Roboflow – Infrastructure Engineer

Who We Are

Our mission is to make the world programmable. Sight is one of the key ways we understand the world, and soon this will be true for the software we use, too.

At Roboflow, we’re building the tools, community, and resources needed to make the world programmable with artificial intelligence. Roboflow simplifies building and using computer vision models. Today, over 1M+ developers, including those from half the Fortune 100, use Roboflow’s machine learning open source and hosted tools. That includes counting cells to accelerate cancer research, improving construction site safety, digitizing floor plans, preserving coral reef populations, guiding drone flight, and much more.

Our team is small relative to our impact, and we believe our user success is our success (not the inverse). A team member summarized: “Roboflow is a company full of giant brains and tiny egos.” We find software has a multiplier effect on all roles (not only product and engineering), so Roboflow employs developers across the company in design, sales, customer support, marketing, and beyond.

We’re supported by great customers and investors, having raised over 63 million from Y Combinator, Google Ventures, Craft Ventures, Sam Altman, Lachy Groom, amongst other leading software investors.

What We’re Looking For

Primarily, you like to make great things with passionate colleagues. You are someone that likes to own outcomes, not only inputs. You’re motivated by having responsibility and accountability. You’re eager to ‘do the work,’ big and small.

You’re curious and learning about new technologies, perhaps an early tinkerer with MLOps products. You show more than you tell.

You’re motivated by the question, “How can I improve this?” and have a track record of doing so, even in ways adjacent to your role. Much of our current team is made up of former founders and thrive in the level of autonomy at Roboflow. Maybe you had a side hustle in high school or college.

Many Roboflowers have used our tools before joining. One of the best ways to stand out amongst other applicants is to write about something you have built with Roboflow or contribute to one of our open source projects. Likewise we highly value users with meaningful contributions to successful open source devtool and security projects.

What You’ll Do

The focus of this role is on securing, scaling, and maintaining the infrastructure that powers our product backend, including: our cloud architecture, databases, file storage, search cluster, micro-services, and machine learning pipelines.

You’ll be working alongside our existing infrastructure team along with doing cross-team work spanning product, operations and customer-facing projects and should have the ability to context switch across a wide range of infrastructure, security and systems engineering work in a fast-paced startup environment.

Skillset

Some or all of the following would be helpful:

  • Production experience with Kubernetes
  • Infrastructure-as-code – Terraform, Kubernetes Helm charts, bash scripting and Python-based automation in production environments
  • Scale – operating infrastructure for large scale applications, especially in the machine learning/AI space
  • Site reliability – alerting, monitoring, scaling services in AWS and GCP clouds
  • Node.js and Python programming skills; ability to work with full-stack developers on designing, developing, and operating SaaS applications
  • Experience with machine learning/big data at scale (GPU, Docker and Kubernetes)
  • Experience with CI/CD automation (for example Github actions, Spacelift)
  • Prior experience with machine learning libraries and stacks (Pytorch, PyTorch, Tensorflow, OpenCV, Supervision) etc. is a plus.
  • Awareness of security best practices and tightening infrastructure for highly secure cloud operations; ideally experienced in a GDPR, ISO 27001 and/or SoC2 certification for SaaS applications
  • Implement engineering security and reliability best practices in roboflow applications and infrastructure

Examples of tasks

  • Running a high availability machine learning inference service
  • Work with customer security teams to securely integrate Roboflow with their systems
  • Develop infrastructure-as-code solutions to scale Roboflow in a cost-effective manner
  • Work with the engineering team to define SLAs and SLOs, and participate in addressing security and reliability incidents across the platform
  • Diving into cost optimization opportunities across the Roboflow stack
  • Be part of teams designing and deploying new product features, including hands on coding in Python, Javascript and other related technologies
  • Work with SoC2, HiPAA and GDPR requirements by improving security across systems and processes at Roboflow, making Roboflow audit-ready for the highest security standards in the industry
  • Participate in on-call rotations

📅 Within one week, you will…

  • Learn all about computer vision, our product, company, customers, and vision.
  • Ship something substantial to an end user
  • Start learning our infrastructure and security practices.

📅 Within one month, you will…

  • Onboard in person with your manager
  • Build your first computer vision project with Roboflow (if you haven’t already)
  • Start contributing to infra-as-code
  • Start working with customers to help with their security questions and onboarding
  • Understand the architecture of Roboflow

📅 Within six months, you will…

  • Attend your first all company onsite
  • Be ramped up on other relevant parts of the Roboflow product.

Who You’ll Be Working With

Our team of ~60 attracts talent like executives that wanted to return to building, founders with a 100M+ exit, Roboflow users turned team members, open source contributors, a cyclist who biked across the United States, prolific high school hackers, a CTO from 100+ engineering organization, amongst many exceptional others.

You will directly be working with our Engineering Lead and a team of product, infrastructure and security engineers.

Where You’ll Work

Roboflow is distributed across the US and Europe. We currently have Hubs in New York City and San Francisco (and plan to open more as we grow density in new cities). We provide opportunities (like team on-sites in different cities) and resources (like a $4000/yr travel stipend) to work in person with other team members as much as you’d like, while also supporting remote team members. You can work from one of our Hubs (we offer a relocation bonus), work from home, work at co-working spaces, etc. We want you to work where you work best!

When You’ll Work

Roboflow primarily operates during the daytime hours in the US and there are some synchronous meetings you’ll be expected to attend each week. Apart from that, we have a flexible schedule that allows you to work collaboratively with other team members and asynchronously when needed.

What You’ll Receive

To determine your salary, we use a number of market and data-driven salary sources. We review all salaries every six months to ensure we stay in line with the market.

💰 The target compensation for this role is USD $165,000 base.

📈 In addition to our cash compensation, we offer generous perks and benefits. Below are some of the highlights:

  • $4000/yr Travel Stipend to travel anywhere anytime to work alongside other Roboflowers
  • $350/mo Productivity stipend to spend on things that make your work environment more productive, like high-speed internet at home or a co-working space
  • Cover up to 100% of your health insurance costs for you and your partner or family
  • Equity in the company so we are all invested in the future of computer vision

Interview Process (~5 hours)

Below is the interview process you can expect for this role. We are all motivated to work with an exceptional team and don’t currently have in-house recruiters. You will be speaking directly with our team about what it’s like to work and thrive at Roboflow. We like to be decisive and work fast, so don’t be surprised if all the below conversations happen over a day or two.

Before the Interview:

  • We’ll review your application, LinkedIn, Github, etc.
  • The best way to stand out is to write about something you’ve built with Roboflow or contribute to one of our open source projects, or highlight your contributions to devtools/infrastructure/security engineering open-source projects.
  • We may send you a technical screen if applicable.

Introduction Phase:

  • [45m] Meet with hiring manager for introduction, Sachin Agarwal, to assess overall mindset and skillset. This first interview is a time to get to know more about the role, allow us to get to know you better, and ensure it’s a good fit for both parties to continue moving forward in the process

Team Interview Phase:

  • [45m] Meet with our CTO, Brad Dwyer
  • [90m] Meet with hiring manager and team for a technical infrastructure hands-on interview

Ask questions!

Final Interview Stage:

  • [45m] Meet with Kate Wagner, Head of Operations for a culture discussion
  • [60m] Meet with Joseph Nelson, CEO
  • We check references and conduct a background check

Note: you are welcome to request additional conversations with anyone you would like to meet and we will accommodate as best we can.

Not sure if this is you?

We want a diverse, global team with a broad range of experience and perspectives. If this job sounds great, but you’re not sure if you qualify, look into our Former Founders role or subscribe to our career newsletter by emailing “Subscribe” to operations[at]roboflow.com. We carefully consider every application and will either move forward with you, find another team that might be a better fit, keep in touch for future opportunities, or thank you for your time.

Learn More About Us

We are building a diverse Distributed team that is distributed across the globe. Roboflow is an equal opportunity workplace; we welcome people from all backgrounds, communities, and experiences.

We provide competitive compensation and stellar benefits to accelerate your personal and work life. Learn more about what it is like to work at Roboflow by reading these blog posts.

See our careers page for all open listings.

Apply now >

This job listing has been manually reviewed by the Jobicy Trust & Safety Team for compliance with our posting guidelines, including verification of the company's legitimacy, accuracy of job details, clarity of remote work policy, and absence of misleading or fraudulent content.

How to apply

Did you apply? Let us know, and we’ll help you track your application.

See a few more

Similar Software Engineering remote jobs

Job Search Safety Tips

Here are some tips to help you search and apply for jobs safely:
Watch out for suspicious jobs Don't apply for jobs that offer high pay for little work or offer to hire you without an interview. Read more ›
Check the employer's profile Make sure you're applying for a trustworthy job by visiting the employer's profile and learning more about them. Read more ›
Protect your information Don't share personal details like your bank account or government-issued ID on suspicious websites or messengers. Read more ›
Report jobs that feel unsafe If you see a job that seems misleading, inappropriate or discriminatory, report it for going against our policies and we'll review it.

Share this job

Jobicy+ Subscription

Jobicy

617 professionals pay to access exclusive and experimental features on Jobicy

Free

USD $0/month

For people just getting started

  • • Unlimited applies and searches
  • • Access on web and mobile apps
  • • Weekly job alerts and digest
  • • Access to additional tools like Bookmarks, Applications, and more

Plus

USD $8/month

Everything in Free, and:

  • • Ad-free experience
  • • Daily job alerts and digest
  • • Personal career consultant
  • • AI-powered job advice
Go to account ›