Site Reliability Engineer – Senior

Time zone
Anywhere 🌎
Full Time
Opening date
Closing date
28 Oct 2021

As a Site Reliability Engineer – Senior, your focus will be on developing solutions to solve complex business monitoring problems in Splunk ITSI, directly supporting efforts of other SREs and Enterprise Command Center (ECC) monitoring initiatives. A successful candidate will be able to lead business and technical system owners through the identification of Key Performance Indicators that will be used for service mapping and generation of system health scores. A statistical and mathematics background is required to be able to leverage Splunk’s machine learning capabilities and the candidate must understand which models and techniques should be used to instrument the given applications and set appropriate alerting thresholds. This position will be dedicated to Splunk support. This position will also support the migration from Splunk SaaS to a GovCloud based instance on Splunk.


  • Ability to translate business requirements, service level agreements (SLA) and service level objectives (SLO) into monitoring requirements
  • Utilize technical area expertise to develop technical solutions to solve the business problem as an organic part of the organization’s operational and functional baseline.
  • Development of a template-based approach to service mappings in ITSI.
  • Utilize Splunk ITSI to create dynamic thresholds and interface with data scientists if a more advanced statistical model is required.
  • Support Major Incidents by adjusting existing or instrumenting new monitoring to address monitoring deficiencies.
  • Support Problem Management’s enterprise root cause analysis (RCA) processes in collaboration with appropriate Office of Information and Technology (OIT) organizations.
  • Capture technical information from the relevant stakeholders and synthesize it into useful information in various formats for OIT senior management and other VA components.


Education and Experience:

  • Master’s Degree is preferred in Business Administration, Business Management, Computer Science, Information Systems, Information Resource Management, Industrial Engineering, Operations Research, or related fields
  • 2+ year of experience with Splunk ITSI
  • 5+ years of relative experience
  • Certifications in relevant software development or analytics plus 3-5 years of relevant experience
  • 8 to 10 years of relevant experience may be substituted for education (13-15 years total)


  • Ability to develop and implement service dependencies, service maps, KPIs, and thresholds in Splunk ITSI Service Analyzer and Glass Tables.
  • Experience designing and implementing orchestration and automation
  • Experience with other modern performance monitoring and diagnostics tools (examples: AppD, Dynatrace, WireShark, etc.)
  • Be a technical expert with expertise across multiple technology areas and the ability to diagnose complex issues throughout many technologies.
  • Must be able to identify risks to applications being instrumented and monitored
  • Must be able to provide oral and written discussion of analytical findings using narrative and graphic forms.
  • Must be able to use qualitative and quantitative analytical skills to assess the effectiveness of system operations.
  • Identifying symptoms for process improvement.
  • Analytical, investigation, and organization skills
  • Communications including being able to craft content for executive level presentations.
  • IT background and ability to understand technical content.

We are an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status.

Report · Embed ·

How to apply

ATTN. Be careful! You should never send cash or cheques to a prospective employer, or provide your bank details or any other financial information. We pay great attention to vetting all jobs that appear on our site, but please get in touch if you see any roles asking for such payments or financial details from you. The employer won't know who reported this job.

Share this job

Personalised job alerts

Set up personalised e-mail alerts about similar jobs.

See a few more

Related jobs in DevOps & SysAdmin

Report this job

    The employer won't know who reported this job. Contact your local law enforcement for immediate help if someone is in danger or the victim of a scam.
    All Job Ads are subject to Jobicy's Job Posting Policies. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by Jobicy. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

    Job Widget Code

    Place this code wherever you want the widget to appear on your page.

    <script src="//" async></script>

    Ask a Question

    Position: Site Reliability Engineer – Senior.

    Login to Send Message