About [Astronomer](https://jobicy.com/company/astronomer)

The Apache Airflow Company

*

[Computer Software](https://jobicy.com/company-category/software)
*  2015

Actively Hiring  Verified job posting This job post has been [manually reviewed](https://jobicy.com/tools/help-center/employee/how-does-jobicy-verify-the-legitimacy-of-remote-job-listings) for authenticity and compliance.       Tailor Resume Check Job Fit Cover Letter

### Tailor my resume to this job

Discover how to best rewrite and optimize your resume for this specific job. You'll receive personalized suggestions and detailed guidance to highlight your key strengths, effectively address the job requirements, and make your application more compelling to recruiters and hiring managers.     After analyzing your resume, I can provide several recommendations to better position yourself for this role.  Your background in software development shows strong technical skills, particularly in Java and Python development. However, for this Senior Backend Developer position, I notice there could be more emphasis on your experience with microservices architecture and cloud technologies, which are key requirements for this role. I recommend highlighting specific projects where you utilized these technologies and quantifying your achievements to demonstrate impact... Upgrade to Plus

### Am I a good fit for this job?

Understand your compatibility with this specific job opportunity. Our detailed analysis will assess your resume against the role's requirements, providing insights into your potential fit, key skill alignments, and areas you might need to develop to be a strong candidate.     After assessing your resume against the job requirements, here's a summary of your fit:  1. Overall Match: Moderate Fit (Approx. 65-70%). Your resume shows good alignment with several core responsibilities for the Project Manager role, especially your experience in agile methodologies and stakeholder communication.
2. Key Strengths: Your PMP certification and proven track record in delivering projects on time are strong assets for this position.
3. Potential Gap: The role specifies experience with 'XYZ specific software', which is not explicitly mentioned in your resume. If you have this experience, ensure it's highlighted.
4. Recommendation: Consider adding a quantifiable achievement related to budget management, as this is often a key metric for PM roles... Upgrade to Plus

### Cover Letter Assistant

Need help writing a compelling cover letter? Our system can analyze this job and your resume to help you draft personalized paragraphs that highlight your strengths and impress hiring managers.      Let me help you draft a strong opening...  Dear Hiring Manager, I am writing to express my keen interest in the Senior Marketing Manager position. My background in developing data-driven marketing strategies and leading successful product launches, as detailed in my resume, directly aligns with your need for a candidate capable of enhancing brand visibility and driving market share growth. I am confident I can make a significant contribution to your team... Upgrade to Plus

###  AI Summary

Astronomer is seeking a Customer Reliability Engineer (CRE) to ensure the reliability and success of their managed Apache Airflow service. This role involves operating and maintaining cloud infrastructure and Kubernetes clusters, responding to incidents, and working directly with customers to troubleshoot issues. The ideal candidate has 5+ years of experience with large-scale cloud infrastructures, strong Kubernetes skills, and a customer-facing mindset. This is a fully remote position with on-call rotation, offering an opportunity to impact customer experience and contribute to product improvements.

### Role DNA

Job Complexity Easy Hard Pace & Pressure Relaxed Fast-paced Autonomy Level Guided Full Ownership Communication Load Independent Highly Collaborative

AI Insight This role requires a high level of technical expertise in Kubernetes, cloud infrastructure, and incident response, along with strong customer communication skills, making it challenging but not the hardest level.

### Salary Analysis

Median  Market Rate  USD127,500US Market USD100k – 180k 0 USD198k      AI Insight The offered salary range of $125,000-$130,000 is competitive and aligns well with the market for a Customer Reliability Engineer role with 5+ years of experience. The median of $127,500 is slightly above the market median for similar roles, reflecting the specialized skills required.

### Key Skills

Kubernetes Cloud Infrastructure Incident Response Customer Success Python DevOps Monitoring and Alerting Distributed Systems Linux Site Reliability Engineering

### Cover Letter Sample

Dear Hiring Manager,

I am writing to express my strong interest in the Customer Reliability Engineer - Infrastructure role at Astronomer. With over 5 years of experience managing large-scale cloud infrastructures and a deep expertise in Kubernetes, I am confident in my ability to ensure the reliability and success of your managed Airflow service. I have a proven track record of incident response, automation, and direct customer engagement, which aligns perfectly with the responsibilities outlined in the job description.

In my previous role, I successfully maintained distributed systems across AWS and GCP, reducing incident resolution time by 30% through improved monitoring and automation. I also collaborated closely with customers to troubleshoot complex issues, ensuring high satisfaction and adherence to SLAs. My strong communication skills and technical proficiency make me an ideal fit for this customer-facing position.

I am excited about the opportunity to contribute to Astronomer's mission of empowering data teams and would welcome the chance to discuss how my skills and experience can benefit your team. Thank you for your consideration.

Sincerely,
Your Name

Copy

### Possible Interview Questions

Describe your experience with Kubernetes in a production environment. How have you managed scaling and reliability?I have worked with Kubernetes for over 3 years, managing clusters with hundreds of nodes. I implemented autoscaling using Horizontal Pod Autoscaler and cluster autoscaler, and ensured reliability through proper resource limits, readiness probes, and rolling updates. I also used Prometheus and Grafana for monitoring.How do you approach incident response and on-call rotations? Can you give an example of a critical incident you resolved?I follow a structured incident response process: first, stabilize the system to reduce impact, then diagnose the root cause. For example, once we had a database crash due to a memory leak; I quickly scaled up resources to restore service, then worked with the dev team to fix the leak and added monitoring alerts to prevent recurrence.What strategies do you use to communicate technical issues to non-technical customers?I focus on clear, jargon-free language and provide status updates in terms of impact on their business. I also use visual aids like timelines and dashboards. For example, during a latency issue, I explained the problem as 'a slowdown in data processing' and gave regular updates on recovery progress.How would you improve the observability of a managed Airflow service?I would start by ensuring all metrics (CPU, memory, network) are captured, along with application-specific metrics like DAG success rates and task durations. I'd implement distributed tracing for workflow steps and set up dashboards with alerts for anomaly detection. I'd also work on log aggregation for quick debugging.Describe a time you automated a routine operational task. What was the outcome?I automated the process of scaling Kubernetes nodes by writing a Python script that integrated with cloud APIs to adjust node groups based on usage patterns. This reduced manual intervention by 80% and improved cost efficiency by ensuring resources matched demand closely.  Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 800 of the world’s leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit [www.astronomer.io](http://www.astronomer.io).

## About this role

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers’ usage of our managed Airflow service.

The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.

As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.

This role is directly customer-facing and gives exposure to very diverse problems and requirements. CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers’ success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.

## What you get to do:

*

Provide solutions to customers to make them successful using our products.

*

Troubleshoot customer environments and engage in active triaging with customers

*

Participate in on-call rotation for weekend coverage

*

Provide feedback to the product development teams on customer needs and pain points.

*

Build out our monitoring and alerting systems.

*

Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible.

*

Help direct the architecture of the products and contribute where possible.

*

Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production.

*

Participate remotely within a fully distributed team.

*

Enhance and enrich customer documentation

*

Work with the latest technology and multi-cloud implementations

## What you bring to the role:

*

5 years of experience, preferably with large, complex cloud infrastructures operating at scale

*

3 years of experience with Kubernetes

*

Experience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure)

*

Strong Linux experience

*

Knowledge of how to operate and monitor issues for distributed systems

*

Previous experience in handling customers issues (internal or external)

*

Strong communication skills

*

DevOps or CI/CD experience

*

Python scripting

*

Good troubleshooting Skills

## Bonus points if you have:

*

Experience as a Site Reliability Engineer

*

Worked with Kubernetes Custom Resources

*

Depth of knowledge with Azure

*

Airflow/Big Data Orchestration experience

*

IaC experience

The estimated total compensation for this role ranges from $125,000 – $130,000 based on leveling and geography, along with an equity component and a comprehensive benefits package. This range is merely an estimate; actual compensation may deviate from this range based on skills, experience, and qualifications.

#LI-Fulltime

#LI-Remote

At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Show more

[Apply now >](https://jobicy.com/jobs/147812-customer-reliability-engineer-infrastructure)

*

![Upload CV](data:image/svg+xml;base64,PHN2ZyB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciIHdpZHRoPSI2NSIgaGVpZ2h0PSI2NSIgZmlsbD0ibm9uZSIgeG1sbnM6dj0iaHR0cHM6Ly92ZWN0YS5pby9uYW5vIj48ZyBjbGlwLXBhdGg9InVybCgjQSkiPjxwYXRoIGQ9Ik0wIDBINjVWNjVIMFYwWiIgZmlsbD0iIzAyOWFlYiIvPjxnIGZpbGw9IiNmZmYiIHN0cm9rZT0iI2ZmZiIgc3Ryb2tlLXdpZHRoPSIyIj48cGF0aCBkPSJNMzMuMDQ5IDE1LjQ1NGExLjQzIDEuNDMgMCAwIDAtMi4wOTcgMGwtNy41NzkgOC4xNDdhMS4zOCAxLjM4IDAgMCAwIC4wOSAxLjk3MyAxLjQ0IDEuNDQgMCAwIDAgMi4wMDgtLjA4OGw1LjEwOS01LjQ5MnYyMC42MWExLjQxIDEuNDEgMCAwIDAgMS40MjEgMS4zOTdjLjc4NSAwIDEuNDIxLS42MjUgMS40MjEtMS4zOTd2LTIwLjYxbDUuMTA5IDUuNDkyYTEuNDQgMS40NCAwIDAgMCAyLjAwOC4wODggMS4zOCAxLjM4IDAgMCAwIC4wOS0xLjk3M2wtNy41NzktOC4xNDZ6TTE2Ljc2OSAzOC40YzAtLjc3My0uNjItMS40LTEuMzg1LTEuNFMxNCAzNy42MjcgMTQgMzguNHYuMTAybC4yMTUgNi4yMjljLjIyMyAxLjY4LjcwMSAzLjA5NSAxLjgxMyA0LjIxOHMyLjUxIDEuNjA3IDQuMTcyIDEuODMzYzEuNi4yMTggMy42MzYuMjE4IDYuMTYuMjE4aDExLjI4bDYuMTYtLjIxOGMxLjY2Mi0uMjI2IDMuMDYxLS43MDkgNC4xNzItMS44MzNzMS41ODktMi41MzggMS44MTMtNC4yMThDNTAgNDMuMTEzIDUwIDQxLjA1NSA1MCAzOC41MDNWMzguNGMwLS43NzMtLjYyLTEuNC0xLjM4NS0xLjRzLTEuMzg1LjYyNy0xLjM4NSAxLjRsLS4xOSA1Ljk1OGMtLjE4MiAxLjM3LS41MTUgMi4wOTUtMS4wMjYgMi42MTJzLTEuMjI4Ljg1My0yLjU4MyAxLjAzOGMtMS4zOTUuMTktMy4yNDMuMTkzLTUuODkzLjE5M0gyNi40NjJjLTIuNjUgMC00LjQ5OC0uMDAzLTUuODkzLS4xOTMtMS4zNTUtLjE4NC0yLjA3Mi0uNTIxLTIuNTgzLTEuMDM4cy0uODQ0LTEuMjQyLTEuMDI2LTIuNjEyYy0uMTg3LTEuNDEtLjE5MS0zLjI3OS0uMTkxLTUuOTU4eiIvPjwvZz48L2c+PGRlZnM+PGNsaXBQYXRoIGlkPSJBIj48cGF0aCBmaWxsPSIjZmZmIiBkPSJNMCAwaDY1djY1SDB6Ii8+PC9jbGlwUGF0aD48L2RlZnM+PC9zdmc+)

### Upload your resume now

To unlock remote work opportunities and be discovered by global employers.

This job listing has been manually reviewed by the Jobicy Trust & Safety Team for compliance with our posting guidelines, including verification of the company's legitimacy, accuracy of job details, clarity of remote work policy, and absence of misleading or fraudulent content.

## How to apply

Did you apply? Let us know, and we’ll help you track your application.   Yes     No

For safety tips, [see our guides](#safety), and please let us know if you need any assistance.

Apply Now

Save  Log in to save  Add alert [Share](#share)    *   Report

## See a few more

Similar Technical Support remote jobs

![Astronomer logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2022/08/7a77cfb5b26b29196d5ed8cd14a3103a.jpg)

Astronomer

[Customer Reliability Engineer, Airflow](https://jobicy.com/jobs/147810-customer-reliability-engineer-airflow)

Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates…

![USA flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/us.svg)
US•Full TimeUSD 125k-130k/year*
![Platform.sh logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2021/09/06796d21c610209d4913c5aa23735c0d.png)

Platform.sh

[Cloud Support Engineer](https://jobicy.com/jobs/142481-cloud-support-engineer)

About Upsun (formerly Platform.sh) Upsun is the cloud application platform humans and robots love. It is built for today’s hybrid teams, where AI agents write and test code and humans focus…

![Spain flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/es.svg)
ES•Full TimeNEW*
![Binance logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/bc822f38-221.jpg)

Binance

[Technical Support Engineer (Blockchain & Backend Development)](https://jobicy.com/jobs/142500-technical-support-engineer-blockchain-backend-development)

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by 300+ million people in 100+ countries for…

![APAC flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/apac.svg)

![UAE flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/ae.svg)
APAC, AE•Full TimeNEW*
![Binance logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/bc822f38-221.jpg)

Binance

[Technical Support Engineer (Blockchain & Backend)](https://jobicy.com/jobs/142501-technical-support-engineer-blockchain-backend)

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by 300+ million people in 100+ countries for…

![APAC flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/apac.svg)

![UAE flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/ae.svg)
APAC, AE•Full TimeNEW*
![Veeam Software logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/f7716b71-221.jpg)

Veeam Software

[Technical Partner Manager – Chicago/NYC Metro Area](https://jobicy.com/jobs/142567-technical-partner-manager-chicago-nyc-metro-area)

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI…

![USA flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/us.svg)
US•Full TimeUSD 150k-214,300/year*
![NRG logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/59e14265-221.png)

NRG

[Field Service Technician – Dover, DE](https://jobicy.com/jobs/142576-field-service-technician-dover-de)

Welcome to the intersection of energy and home services. At NRG, we’re driven by our passion to create a smarter, cleaner and more connected future.Vivint Smart Home, an NRG owned…

![USA flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/us.svg)
US•Full TimeNEW*
![Freudenberg Group logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/9ab0da44-221-1.jpeg)

Freudenberg Group

[IT Consultant](https://jobicy.com/jobs/142696-it-consultant)

Working at Freudenberg: We will wow your world!Responsibilities:Analyze requirements and design target finance processes in collaboration with global Process Owners, Finance Shared Services, and local Finance teams, defining the SAP…

![Mexico flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/mx.svg)

![Malaysia flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/my.svg)
MX, MY•Full TimeNEW*
![Collibra logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/25583c1c-221.jpg)

Collibra

[Enterprise Customer Support Engineer](https://jobicy.com/jobs/142709-enterprise-customer-support-engineer)

Joining Collibra’s Support team Collibra seeks to expand our Support Engineering team with the addition of an Enterprise Customer Support Engineer who can support the most complex product issues and…

![Australia flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/au.svg)
AU•Full TimeNEW*
![Collibra logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2025/06/25583c1c-221.jpg)

Collibra

[Premium Support Engineer](https://jobicy.com/jobs/147743-premium-support-engineer)

Joining Collibra’s Premium Support teamCollibra seeks to expand our Premium Customer Support team with the addition of a Premium Support Engineer (PSE) to support the company’s growth and the growth…

![USA flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/us.svg)
US•Full TimeUSD 104k-130k/year*
![Logitech logo](https://jobicy.com/data/server-nyc0409/galaxy/mercury/2022/02/ff70d460a420205ceeea16f1c13e51b1.jpeg)

Logitech

[Sr. Customer Support Engineer](https://jobicy.com/jobs/143161-sr-customer-support-engineer)

Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.The Team and…

![USA flag](https://cloud.jobicy.com/nyc4-cold/img/round-flags/us.svg)
US•Full TimeUSD 78k-112k/year
[More Jobs](https://jobicy.com/jobs)