At Thumbtack, engineers at every level build products and systems that directly impact our customers and professionals. Our challenges span a wide variety of areas, ranging from building search and booking experiences, to optimizing pricing systems, to building tools to help professionals grow their businesses. We believe in tackling these hard problems together as a team, with strong values around collaboration, ownership and transparency. To read more about the hard problems that our team is taking on, visit our engineering blog.
About the Role
The Site Reliability Team’s mission is to design the next version of the core operations infrastructure that enables the success of all Thumbtack engineers. In this role you will be an integral part in designing high performance, high availability, high throughput, and reliable systems with the goal of minimizing service interruption, downtime, and latency. You will be part of a collaborative culture that succeeds together and requires bringing your authentic self.Challenge
The platform team at Thumbtack is building the next iteration developer experience in the business that enables the entire engineering organization to thrive. Site Reliability Engineering plays a key role up and down the stack to enable this transition as we strive to boost organizational productivity and velocity.Responsibilities
- Design, create, and maintain software to improve the availability, scalability, and efficiency of Thumbtack’s services
- Set the architectural direction of infrastructure and platform teams while supporting the engineering organization
- Design and implement tools and processes used for deployment, change management, and incident response
- Own, maintain, and continuously improve all systems provided as a service, such as monitoring, log aggregation, and datastores
- Engage in service capacity planning and demand forecasting, anticipating performance bottlenecks
- Participate in rotating on-call duties
What you’ll need
If you don’t think you meet all of the criteria below but still are interested in the job, please apply. Nobody checks every box, and we’re looking for someone excited to join the team.- Minimum of 5 years of industry experience in systems engineering, software engineering, or equivalent
- Experience writing, shipping, and operating production impacting services in a high level programming language like but not limited to: Python, Go, PHP, Java, Javascript, or Scala
- Demonstrated knowledge of AWS fundamentals
- Expertise in designing, analyzing, and troubleshooting large-scale distributed systems across web technologies like: DNS, TLS, HTTP/S, TCP/IP
- Demonstrable knowledge of docker, dockerfiles, container management and orchestration, and service mesh
- Track record of managing projects while communicating with stakeholders
Bonus points if you have
- Experience working with GCP
- Understanding of usage and benefits of Infrastructure and configuration as Code
- Passion for reducing toil and improving developer experience
Benefits & Perks
- Virtual-first working model coupled with in-person events
- 20 company-wide holidays including a week-long end-of-year company shutdown
- Library (optional use collaboration & connection hub) in San Francisco
- WiFi reimbursements
- Cell phone reimbursements (North America)
- Employee Assistance Program for mental health and well-being