Staff Site Reliability Engineer – Infrastructure Platform

Remote from
Anywhere 🌎
Job type
Full Time,
Opening date
Closing date
11 Apr 2023
Views
612

About Chainlink

Chainlink is a decentralized blockchain oracle network built on Ethereum

Actively Hiring
All roles with Chainlink Labs are globally remote based. We encourage you to apply regardless of your location.
All roles with Chainlink Labs are globally remote based. We encourage you to apply regardless of your location.
The Infrastructure Platform team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Recently, Chainlink crossed $7 trillion TVE (total value enabled) as an undisputed leader in the oracle space. Reliability is vital to the success of our company. As a staff SRE, you will help us accelerate and enable other engineering teams by increasing self-service and decreasing cognitive load. Key initiatives surrounding our mission include architecting and building a services catalog and an internal developer platform.
This job would be perfect for someone who has a strong DevOps mentality, is passionate about building and maintaining a mature GitOps environment, and has experience building and growing an internal developer platform. The entire engineering team is expanding, and you would have plenty of opportunities to build, learn, and grow.
We are distributed across time zones and continents, and we embrace remote work. Our on-call rotation uses the follow-the-sun pattern: you will be on-call some of the time, but your shifts will be during your day and our team is large.
We all have different backgrounds and are determined to help you succeed no matter where you are or who you are. If you think you would do a great job at Chainlink, we are looking forward to speaking with you, even if you don’t match 100% of the job requirements: those describe people we’ve usually had a great time working with, but they’re not a tick-box exercise.

Your Impact

  • Build and orchestrate large, distributed infrastructure
  • Ensure reliability, security, and performance exceed our defined SLAs
  • Understand what a successful internal developer platform looks like and continue to build and expand upon it from a product and customer focused mindset
  • Work with engineers from across the company to help troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load
  • Provide technical leadership across numerous engineering teams
  • Champion reliability and security by taking the time to do your work right the first time

Requirements

  • 7+ years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before
  • Ability to develop software outside of the scope of typical infrastructure requirements and configurations
  • Have led large cross-team initiatives and can demonstrate a successful track record with quantifiable metrics that impact the business
  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby
  • Expert knowledge in all aspects of designing, developing, and managing large real-time systems
  • Experience with monitoring and logging. You know how to export metrics using Prometheus, have built a Grafana dashboard or two, and have experience with a centralized logging solution like an ELK Stack, Splunk or LogDNA
  • Experience with distributed systems and container orchestration. You have maintained or even built Kubernetes clusters before and feel comfortable deploying complete new services on them
  • Strong communication skills. You can give and receive constructive feedback, and you do not shy away from planning meetings and code reviews
  • Familiar with most tools from our stack (see below)

Desired Qualifications

  • Excitement for blockchain, Web 3.0, and similar decentralized technologies.
  • Experience running any infrastructure in the blockchain/web3 space
  • Ability to scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Experience with internal developer platforms and service catalogs
  • Experience with setting team priorities (OKRs) and aligning business processes required to get a product/service from ideation to production (PRD, RFC, etc)
  • Experience working remotely in a distributed team
  • A strong desire to grow and challenge yourself. We would expect you to constantly find ways to improve and automate services to reduce toil

Personalised job alerts

Set up personalised e-mail alerts about similar remote jobs

Report jobShare

How to apply

See a few more

Similar remote jobs in DevOps & SysAdmin

Job Search Safety Tips

Here are some tips to help you search and apply for jobs safely:
Watch out for suspicious jobs Don't apply for jobs that offer high pay for little work or offer to hire you without an interview.
Check the employer's profile Make sure you're applying for a trustworthy job by visiting the employer's profile and learning more about them.
Protect your information Don't share personal details like your bank account or government-issued ID on suspicious websites or messengers.
Report jobs that feel unsafe If you see a job that seems misleading, inappropriate or discriminatory, report it for going against our policies and we'll review it.

Share this job

FAQ

What position is Chainlink hiring for?

Chainlink is hiring a remote Staff Site Reliability Engineer – Infrastructure Platform from Anywhere 🌎

What type of employment does Chainlink offer?

This is a Full Time role.