I am a Site Reliability and DevOps Engineer with hands-on experience managing containerized workloads and Kubernetes-based platforms. I specialize in cloud infrastructure on AWS and other providers, focusing strongly on observability tools such as Prometheus, Grafana, and CloudWatch. My expertise includes incident response and Infrastructure as Code using Terraform and Ansible.
I have built and maintained CI/CD pipelines, automating operational tasks with Python and Bash to improve deployment efficiency and reliability. I collaborate effectively with cross-functional teams to enhance system scalability and uptime in distributed systems.
Currently, I work as a Cloud DevOps Engineer at Ismile Technologies, where I administer cloud infrastructure and containerized workloads, improving system reliability significantly. I operate Kubernetes environments and optimize observability through dashboards and alerts.
Previously, I interned as a Cloud Engineer and DevSecOps Intern, where I gained experience in infrastructure management, security automation, and threat detection. I have developed automated response playbooks and integrated security tools into CI/CD pipelines to ensure secure deployments.
I am passionate about leveraging cloud technologies and automation to reduce downtime and improve operational workflows. I continuously seek opportunities to enhance system reliability and security while driving efficiency through automation and best practices.
CGPA: 8.3/10
Administer and support cloud infrastructure and containerized workloads across AWS and Linode, improving system reliability and reducing downtime by 89%. Operate and optimize Kubernetes-based environments and Docker deployments, configuring cAdvisor, Prometheus, Grafana, and CloudWatch dashboards and alerts. Develop and maintain CI/CD pipelines using Jenkins, Git, GitHub, and Azure DevOps. Own and implement Infrastructure as Code using Terraform and Ansible. Automate operational workflows using Python and Bash, improving efficiency by 75%. Participate in incident handling, monitoring logs, alerts, and dashboards, escalating issues and contributing to post-incident reviews.
Managed Linode-based production infrastructure, ensuring uptime and performance monitoring. Implemented proactive monitoring and alerting using Prometheus and Grafana, reducing downtime incidents by 35%. Maintained CI/CD pipelines across AWS and Microsoft Azure, improving deployment throughput by 30%. Troubleshot and optimized Docker containers, refined Dockerfiles, managed logs, and integrated Nginx. Automated Blockchain node operations with Bash scripts, reducing manual overhead.
Built Python-based automation for log analysis and threat detection, reducing manual effort by 31.2% and improving incident response time by 40%. Streamlined security data processing using Pandas and NumPy, increasing analysis efficiency by 25.6%. Performed security investigations using Matplotlib and Splunk, identifying critical threat indicators and reducing mean time to detect by 20%. Integrated SAST and DAST tools into CI/CD pipelines to enforce 100% code coverage for security checks. Developed automated response playbooks with Python and Ansible, reducing remediation time by 35%.
Jobicy
592 professionals pay to access exclusive and experimental features on Jobicy
Free
USD $0/month
For people just getting started
Plus
USD $8/month
Everything in Free, and: