This is a remote position.
Our GenAI platforms provide mission-critical insight for our clients, enabling them to identify and seize opportunities to grow their brands. We are seeking a talented Senior DevOps Engineer to enrich our Central DevOps practice, to provide thought leadership in driving the future of our DevOps centre of excellence.- Responsible for designing, building, and optimising the automation processes for provisioning infrastructure and applications – via infrastructure-as-code (IaC)
- Standardise and streamline build and release pipelines – CICD
- Configure and administer platforms and services
- Support operational activities by ensuring platforms and infrastructure for pipelines are optimal, recoverable, and easily scalable to meet the capacity demands
- Provide incident management oversight – root cause analysis, stakeholder communications, post-mortems, and manage preventative measures and resolutions
- Actively involved in regular deployments of platform and pipeline artefacts to ensure frequent and incremental value delivery while avoiding unnecessary accumulation of technical or operational debt
- Ensure security, high availability, and disaster recovery are always front of mind
- Continuous monitoring of cost and risk – to reduce and mitigate
- Ensure production and non-production environments stay in sync through the alignment of stable standards, code, and configurations
- Identify relevant emerging trends and build compelling cases for adoption, e.g., tool selection
- Involved in PoCs, prototypes, and innovation spikes to seek directional outcomes
- Engage with peer reviews of your code, and peer review other engineers’ code
- Help define and champion strategies and best practices to support best-in-class engineering practices and operational efficiency for all services
- Mentor and support other engineers, helping to grow our collaborative environment
- Identify and assess risk, reliability, and/or uncertainty in the design, system, or process, including the impact of change
Requirements
- 6+ years of experience in a similar role is essential
- Proficient with CICD toolchains (e.g., Azure DevOps, Jenkins, Git, Artefactory, etc.)
- Proficient in one or more scripting languages for automation (e.g., Linux Bash, PowerShell, Python)
- Proficient in provisioning platforms via IaC techniques (e.g., Terraform, YAML, Azure Resource Manager (ARM))
- Working experience configuring, securing, and administering platforms in Azure; knowledge in Cloud infrastructure and networking principles (e.g., Azure PaaS, IaaS)
- Demonstrable knowledge of working with distributed data platforms (e.g., Azure ADLS, Data Lakes)
- Experience working with vulnerability management and code-inspection tooling (e.g., Snyk, SonarQube)
- Possess an “automation-first” mindset when building solutions; considerations for self-healing and fault-tolerant methods to minimize manual intervention and downtime
- Ability to work autonomously in an environment of constant change.
- Experience building/maintaining an API-led event-driven architecture, e.g., using Azure Event Grid, Azure Functions, is desirable
- Cluster image container management (e.g., Azure Container Registry) is desirable
- Good understanding of network configuration – DNS, Routing, VPN, Firewalls, Endpoint management is desirable
- Experience in including vulnerability management within the deployment pipelines, for package dependencies and containers, is desirable
- Experience in implementing custom Data Observability, capturing telemetry to understand the health of data and pipelines better, is desirable
- Familiarity with Microsoft Azure DevOps is desirable