I am a Site Reliability Engineer (SRE) with almost a decade of hands-on experience delivering scalable, secure, and observable infrastructure. I have proven expertise in Azure, AWS, Terraform, AKS, Azure DevOps, GitLab CI/CD, and monitoring and alerting tools. My strong background in infrastructure as code, incident response, self-healing systems, and release automation has allowed me to excel in my field. I am skilled in cross-functional collaboration, root cause analysis, and continuous service improvement. I am adept with PowerShell, Azure CLI, and modern DevOps practices, including SDLC and CI/CD. I am known for reducing operational toil, enabling developer velocity, and mentoring infrastructure teams.
Built and maintained scalable CI/CD pipelines using Azure DevOps, GitLab CI, PowerShell, and Docker; integrated automated testing and monitoring into the pipeline lifecycle. Led incident response for critical outages, cutting MTTR by nearly 50% through automation and streamlined escalation procedures. Implemented automated monitoring and alerting. Designed, configured, deployed, and maintained infrastructure for over 100 applications and services, ensuring 99.982% yearly uptime. Developed and maintained infrastructure as code, improving deployment efficiency and consistency. Conducted post-mortem analysis to identify root causes and prevent recurrence of incidents. Introduced structured release pipelines, versioning, and rollback plans across core systems. Led DevOps transformation by introducing infrastructure-as-code (Terraform) and automation with Ansible and PowerShell. Managed Kubernetes clusters and implemented self-healing systems with alert-based remediation. Collaborated with teams to define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets for applications. Mentored a 10-member infrastructure team; introduced automation strategies that reduced manual workload.
Managed on-premises and cloud-hosted services, including Active Directory, Exchange, and Microsoft 365 stack. Developed internal automation scripts to patch and monitor critical services. Collaborated with system owners, key vendors, and MSPs to ensure seamless service delivery. Monitored infrastructure health β verifying hardware, server resources, and virtualization platforms β and conducted log reviews. Led upgrade and hardening efforts of Exchange, Windows & Linux operating systems and other services. Deployed and maintained SQL Server Always On availability groups, implementing backup and recovery using Veeam Backup & Replication. Provided on-call support for production systems.
Installed, configured, maintained, and supported MS Windows Server and Linux operating systems family. Maintained virtualization platforms such as VMware, Hyper-V. Led upgrade and hardening efforts of Exchange, Windows & Linux, and endpoint protection services. Managed and applied software patches and updates across various systems regularly.
Provided IT support to over 1,500 internal staff by responding to support calls and resolving technical issues. Managed and configured Windows 7/8.1/10 client systems and a variety of software applications for end users. Handled user support requests through a ticketing system, ensuring timely resolution and documentation of incidents and service requests.
Jobicy
571 professionals pay to access exclusive and experimental features on Jobicy
Free
USD $0/month
For people just getting started
Plus
USD $8/month
Everything in Free, and: