Jatin Sharma
DevOps,MLOps & Cloud Engineer passionate about automation, scalability, Observability and building reliable infrastructure that empowers development teams.
About
I specialize in DevOps, Cloud, Infrastructure Automation and MLops with hands-on expertise in tools like Kubernetes, Terraform, Ansible, Docker, CI/CD pipelines and Observability. I design and implement cloud-native architectures that are scalable, secure, and resilient with a strong focus on efficiency and reliability. As an open-source contributor, I actively collaborate with the global tech community, sharing solutions and tackling real-world challenges together. I'm driven by a mission to bridge development and operations through automation, helping teams ship faster, safer, and smarter in the cloud.
Work Experience
Opstree SolutionsRemoteLead
Lead DevOps & MLOps Engineer
- ▸Architected MLOps foundations using Python, DVC, and MLflow for secure data processing, reproducible training pipelines, and model lifecycle management.
- ▸Cars24 (Client): Executed migration of 200+ production services from AWS ECS to EKS, saving ~$20,000/month by replacing Datadog with a custom SigNoz observability stack.
- ▸Swiggy (Client): Managed 50+ AWS accounts using Terraform, ensuring 99.9% uptime during high-traffic national events like Diwali and New Year.
- ▸Nomupay (Client): Designed multi-account AWS structures with Terragrunt and migrated EKS networking to Cilium/Hubble for enhanced performance and security.
- ▸Barq Fintech (Client): Engineered resilient OKE (Oracle Kubernetes) environments, implementing Istio Service Mesh and Keycloak for centralized AuthN/AuthZ.
- ▸Drove innovation by creating a custom Cloud Map Kubernetes Controller and implementing Karpenter for intelligent EKS autoscaling.
- ▸Introduced Netbird VPN as the organizational standard.
- ▸Mentored 50+ engineers through the Ninja, Sanatak, and Ronin programs.
- ▸Designed and implemented production-ready MLOps pipelines using Python, scikit-learn, DVC, MLflow, and AWS S3 for reproducible model training and data versioning.
NEXTEON SolutionsOn-Site
Jr. DevOps Engineer
- ▸Managed and monitored 100+ websites running on Adobe Experience Manager (AEM) across on-premise data centers (RSDC & BSDC).
- ▸Led the successful migration of Rajasthan Government websites and AEM (6.1) infrastructure between data centers in Jaipur.
- ▸Collaborated with cross-functional teams to resolve infrastructure bottlenecks and ensure high availability for mission-critical government services.
- ▸Automated routine deployment tasks, reducing manual intervention and minimizing release-window downtime.
Education
RPSGOI, Balana
Bachelor of Technology, Computer Science & Engineering
- • Built a strong foundation in core engineering principles including Linux, Networking.
- • Explored interdisciplinary interests that led to a growing passion for automation and cloud infrastructure.
- • Worked on academic and practical projects involving design, analysis, and simulation of computer systems.
- • Participated in workshops and tech fests, developing early skills in problem-solving and teamwork.
- • This journey eventually sparked a shift toward DevOps, cloud computing, and open-source collaboration.
Skills
Other Skills
Impact
Volunteering & Mentorship
Trainer / Mentor
2025Opstree Global
Designed and delivered a foundational training program titled 'Journey of a DevOps Engineer — From Day 1 to Serving a Million' for new engineers, covering real-world DevOps practices from setup to large-scale production systems.
Awards & Recognition
ECS to EKS Migration Excellence
May 2025
Successfully migrated 200+ services for client with near zero downtime.
Opstree Ninja/Ronin Mentor
April 2023
Recognized for providing high-impact training and mentorship in DevOps programs.
AEM Data Center Migration
Feb 2020
Successfully migrated all Rajasthan Govt Websites to a new Data Center.
Appreciation
Reciedved appreciation multiple times from clients.
Certifications
Projects
Self-hosted VPN server setup & multi-cloud links.
Sync headless Services to AWS Cloud Map/Route53 with TTLs & audit logs.
This project provides a ready-to-use advanced monitoring platform for DevOps engineers and beginners. With just one command, you get Prometheus, Grafana, Loki, Alertmanager, Node Exporter etc
Menu-driven admin utility to manage Netbird resources.
Amazon EKS Cluster with Terraform
Terraform-based provisioning of an Amazon EKS Cluster for Kubernetes deployments.
CI/CD on EKS using GitHub Actions
CI/CD pipeline for deploying a Node.js app on Amazon EKS using GitHub Actions, Terraform, and Kubernetes.