Senior Customer Reliability Engineer – Infra (EST)

Annual salary, USD
165,000 - 185,000
Job function
Technical Support
Job type
Full Time,
Job posted
Apply before
22 Oct 2024
Industry
Computer Software

About Astronomer

The Apache Airflow Company

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers’ usage of our managed Airflow service.

The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.

As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.

This role is directly customer-facing and gives exposure to very diverse problems and requirements. CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers’ success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.

This position includes a requirement to work from 9AM to 3PM EST, Monday to Friday. Your remaining work time is flexible.

What you get to do:

  • Provide solutions to customers to make them successful using our products.
  • Troubleshoot customer environments and engage in active triaging with customers
  • Participate in on-call rotation for weekend coverage
  • Provide feedback to the product development teams on customer needs and pain points.
  • Build out our monitoring and alerting systems.
  • Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible.
  • Help direct the architecture of the products and contribute where possible.
  • Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide β€œwhite glove” guidance on the path to production.
  • Participate remotely within a fully distributed team.
  • Enhance and enrich customer documentation
  • Work with the latest technology and multi-cloud implementations

What you bring to the role:

  • 6 years of experience, preferably with large, complex cloud infrastructures operating at scale
  • 4 years of experience with Kubernetes
  • Experience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure)
  • Strong Linux experience
  • Knowledge of how to operate and monitor issues for distributed systems
  • Previous experience in handling customers issues (internal or external)
  • Strong communication skills
  • DevOps or CI/CD experience
  • Python scripting
  • Good troubleshooting Skills

Bonus points if you have:

  • Experience as a Site Reliability Engineer
  • Worked with Kubernetes Custom Resources
  • Depth of knowledge with Azure
  • Airflow/Big Data Orchestration experience
  • IaC experience

The estimated salary for this role ranges from $165,000-185,000, along with an equity component. This range is merely an estimate, and the width of the range reflects willingness to consider candidates with broad prior seniority. Actual compensation may deviate from this range based on skills, experience, and qualifications.

Apply now >

Megaphone

Personalised job alerts

Set up personalised e-mail alerts about similar remote jobs

FacebookTwitterLinkedIn

How to apply

Did you apply? Let us know, and we’ll help you track your application.

See a few more

Similar remote jobs in Technical Support

Job Search Safety Tips

Here are some tips to help you search and apply for jobs safely:
Watch out for suspicious jobs Don't apply for jobs that offer high pay for little work or offer to hire you without an interview. Read more β€Ί
Check the employer's profile Make sure you're applying for a trustworthy job by visiting the employer's profile and learning more about them. Read more β€Ί
Protect your information Don't share personal details like your bank account or government-issued ID on suspicious websites or messengers. Read more β€Ί
Report jobs that feel unsafe If you see a job that seems misleading, inappropriate or discriminatory, report it for going against our policies and we'll review it.

Share this job

FAQ

What position is Astronomer hiring for?

Astronomer is hiring a remote Senior Customer Reliability Engineer – Infra (EST) from πŸ‡ΊπŸ‡Έ USA

What type of employment does Astronomer offer?

This is a Full Time role.

Network