Shubh Malhotra

Shubh Malhotra

Software Engineer — DevOps / SRE

Software Engineer (DevOps/SRE) with 5+ years experience in cloud infra automation, CI/CD, observability, and reliability engineering. Reduced deployment time by 60%, cut cloud costs ~20%, and implemented observability for large-scale microservices.

Experience

Software Engineer (DevOps) — Superhero Tech Pvt. Ltd., Bengaluru

Jul 2025 – Present

  • Engineered EKS workloads for 50+ microservices with HA; secured multi-account AWS infra (IAM, WAF, CloudFormation).
  • Automated Jenkins CI pipelines and implemented Grafana/Prometheus/Loki observability — lowered MTTR by ~35%.

Software Engineer (Senior SRE) — Superhero Tech Pvt. Ltd., Bengaluru

Apr 2022 – Jul 2025

  • Administered AWS infra (100+ EC2, RDS, Lambda, EKS); led Blue-Green deployments and scaling optimizations (-20% spend).

Software Engineer (SRE) — Treebo Hotels, Bengaluru

Dec 2021 – Apr 2022

  • Built Jenkins pipelines for ECS/EC2/S3, dashboards with Grafana & CloudWatch.

Advanced DevOps & SRE Skills

Curated set of practical skills, playbooks and examples you can reference in interviews or docs.

Kubernetes (EKS) — Production Patterns

Multi-tenant namespaces, network policies, PodDisruptionBudgets, HorizontalPodAutoscaler with custom metrics, vertical pod autoscaler for stateful workloads, and GitOps-driven manifests (Helm + Kustomize).

CI/CD — Declarative, Secure Pipelines

Jenkins & GitHub Actions templates implementing PR gating, canary & blue-green releases, automated rollbacks, artifact signing, and secrets via HashiCorp Vault / AWS Secrets Manager.

Infrastructure as Code

Modular CloudFormation + Terragrunt patterns; immutable infra; policy-as-code with OPA/Gatekeeper for compliance checks.

Observability & SLO-driven Engineering

Prometheus metrics + Loki logs + Grafana dashboards; SLOs & error budgets tied to alerting and release decisions. Automated runbooks (PagerDuty + Opsgenie integrations).

Cost & Capacity Optimization

Automated rightsizing pipeline, spot instance strategies, cluster autoscaler tuning, and scheduled non-prod shutdowns to reduce costs without impacting SLAs.

Security & Compliance

IAM least-privilege policies, ECR vulnerability scanning, AWS WAF rules, automated infra scanning (tfsec, checkov), and runtime security with Falco.

Pro Tips: Implement robust dependency tracking, enforce measurable SLOs, and continuously refine operational workflows.

Selected Projects

Cost Optimization Dashboard

Cost observability using Grafana + CloudWatch + AWS Cost Explorer APIs with automated rightsizing recommendations (reduced monthly cloud costs by ~25%).

Database Migration Automation

Python-based framework using AWS DMS for zero-downtime migrations, provisioning RDS and orchestrating schema syncs.

Contact

Email: shubhmalhotra07@gmail.com • Phone: +91-9654910542

This form uses your mail client (mailto) by default.