Experience
View ResumeAssociate Site Reliability Engineer
2025 – Present• Participating in an on-call rotation to support production and non-production AWS EKS environments, restoring production availability and unblocking Dev/QA teams• Led a production deployment of an AI application on AWS EKS, provisioning infrastructure (VPC, RDS, MongoDB, Bedrock, S3), observability (Redis metrics, Grafana dashboards, alerts), DNS records in Cloudflare, and a Jenkins job for new releases• Configured NVIDIA GPU runtime and KEDA autoscaling for AI inference worker pods on AWS EKS• Developed a Terraform module for Asterisk deployments to eliminate manual provisioning and enable fast, consistent setup of EC2, load balancing, and auto-scaling resources• Refactored a Python CLI tool to report EBS volume usage across our AWS accounts and regions, enabling identification of unused volumes and cost-saving opportunities• Created ServiceMonitors for production microservices and deployed Elasticsearch and Redis exporters via Helm charts to monitor service health and performance• Unified environment-specific Grafana dashboards to reduce configuration drift and maintenance overhead
Sr. Associate DevOps Engineer
2024 – 2025• Led rotation of team and client Certificate Authorities, coordinating with stakeholders to define validation processes and ensure uninterrupted certificate issuance; mentored team members on key steps• Built Grafana dashboards to monitor F5 pool health, identifying a missing pool member
Associate DevOps Engineer
2023 – 2024• Provided HashiCorp Vault Enterprise as a critical security service to our infrastructure organization in 6 on- premise Linux environments, each handling millions of daily requests• Developed a Prometheus metric to track live Vault cluster nodes, integrated it into alerts & Grafana dashboards to automate observability updates during failovers, and cluster changes; saving our engineers 20 minutes per maintenance cycle by eliminating dashboard and alert regeneration• Developed a Threat Model for our Vault service using the STRIDE Framework, identifying risks and defining mitigation plans to improve the security posture of our clusters• Led internal client consultations for Vault integration, providing guidance on secure access patterns• Wrote and deployed secure configurations for our clients and servers using Terraform and Chef• Defined Availability and Latency SLIs to track performance insights of our Vault service
DevOps Engineer Intern
2022 – 2022• Automated the Terraform deployment workflow for onboarding internal clients to Vault by deploying Atlantis in Workday’s Private Cloud, integrating it with GitHub PRs, and enforcing security requirements via custom Conftest policies