About Guidewire Software
Navigate what's next.
Summary
At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether thatβs a time of crisis, a natural disaster, an accident, or exposure to cyber risks. We build the core applications that insurance companies use to sell and underwrite policies, settle claims, and bill their customers. We also have a portfolio of innovative products serving the needs of P&C insurance companies in areas such as data management, digital online portals, and predictive analytics. We run these products on the Guidewire Cloud Platform, and we help hundreds of insurance providers all over the world to handle billions of dollars of business.
We are proud to be voted a Top Cloud Employer on Glassdoor by our own employees and positioned as a market leader by industry experts like Gartner. We have a fun work environment and a culture that lives by our core values of integrity, rationality, and collegiality.
Weβre searching for people who are as passionate about working together to deliver quality products and support as we are. Join us and enjoy a career where you can make an impact. Youβll be inspired by those around you, and youβll be trusted and empowered to go further.
About the Role
As a Site Reliability Engineer at Guidewire, youβll join a passionate team dedicated to automating every process to ensure our systems run efficiently. Our Platform team is fully committed to developing and managing software that enhances the reliability of production systemsβsystems that serve hundreds of customers and support millions of transactions every day. You will play a key role in ensuring the stability of our flagship cloud platform products while building the tooling necessary for efficient operations and optimal availability of our SaaS multi-tenant, customer-focused systems. In close collaboration with our core product developers, youβll help ensure our cloud products meet both functional and non-functional requirements including availability, performance, observability, and maintainability.
If you thrive on teamwork, embrace responsibility, and have a passion for solving problems at scale with technologies like AWS, Kubernetes, and Aurora, then weβd love to hear from you. Weβre looking for someone who lives by the mantra, “If you have to do something more than once, automate it,” and who is eager to learn and master new tools and concepts. Bonus points if you have experience in production support for a SaaS platform and are comfortable working with cutting-edge, highly containerized, cloud-native environments in AWS.
Job Description
Drive Reliability & Automation:
Take a dedicated SRE approach to managing shared multi-tenant infrastructure for resilient SaaS microservice-based systems and customer-centric applications.
Oversee and continuously enhance our teamβs presence in AWS by automating deployment and operational tasks.
Innovate and Improve Core Systems:
Contribute to the development of our core infrastructure systemsβadding features, fixing bugs, and implementing reliability enhancements.
Engineer and maintain a complex single sign-on (SSO) authentication platform based on SAML/OAuth to ensure secure, seamless access for our users.
Enhance Observability & Incident Management:
Build and maintain comprehensive observability tooling, metrics, and dashboards to support our global platform infrastructure.
Improve our incident management lifecycle by identifying, mitigating, and learning from reliability risks, while helping to create a self-healing environment.
Empower the Team:
Develop system documentation and training materials to educate and empower your teammates.
Collaborate with various engineering teams, providing valuable feedback and contributing code when needed to enhance our products.
Technically Skilled:
You hold a Bachelorβs Degree in Computer Science or a related field.
You have proven software engineering and automation skills using Bash, Python, and/or Go.
Youβre well-versed in agile development methodologies (Scrum, Kanban, etc.) and have a deep background in Linux systems.
Cloud & DevOps Savvy:
You bring significant experience in automating and managing systems on Amazon Web Services (AWS) and supporting live production environments (Java/Apache/Tomcat).
You are proficient with Infrastructure as Code (IaC) tools such as Terraform, Terragrunt, or Terraspace, and have used devops/gitops tools (Git, Bitbucket, Flux CD, TeamCity) for smooth code promotions.
You have hands-on experience in containerization (Docker, Helm, Kubernetes/EKS, CNI, and Ingress networking) and a strong understanding of Single-Sign On, SAML, and OAuth (bonus if youβve worked with Okta).
Observability & Database Knowledge:
You are experienced with observability tools (Datadog, CloudWatch, PagerDuty) and familiar with event store/stream-processing technologies like Kafka or AWS SQS.
You have worked with relational databases such as Aurora Postgres or Oracle RDS and possess advanced exposure to application development, web UI design, JSON, and overall application architecture.
Exposure to Open Application Model systems like KubeVela or Crossplane is a plus.
A Collaborative Problem Solver:
You prefer writing robust code over clicking through a GUI and enjoy mentoring others.
Your outstanding troubleshooting skills, analytical mindset, and process-driven approach enable you to solve complex problems effectively.
You are a proactive team player with excellent communication skills, capable of explaining complex technical concepts to a varied audience.
You champion a culture of reliability by promoting practices such as blameless postmortems, SLO tracking, and continuous learning from incidents.
About Guidewire
Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.
As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.
For more information, please visit www.guidewire.com and follow us on Twitter: @Guidewire_PandC.
Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success. Qualified applicants will receive consideration without regard to race, color, ancestry, religion, sex, national origin, citizenship, marital status, age, sexual orientation, gender identity, gender expression, veteran status, or disability. All offers are contingent upon passing a criminal history and other background checks where it’s applicable to the position.
Annual salary information is not provided for this position. Explore salary ranges for similar roles in our Salary Directory βΊ
This job listing has been manually reviewed by the Jobicy Trust & Safety Team for compliance with our posting guidelines, including verification of the company's legitimacy, accuracy of job details, clarity of remote work policy, and absence of misleading or fraudulent content.
For safety tips, see our guides, and please let us know if you need any assistance.
Create a free account with us to save a history of all jobs you've shown interest in.
You can also continue as a guest if you prefer.
Similar Software Engineering remote jobs
Jobicy
578 professionals pay to access exclusive and experimental features on Jobicy
Free
USD $0/month
For people just getting started
Plus
USD $8/month
Everything in Free, and: