Niranjan DevOps and SRENiranjan DevOps & SRE
Menu

$ ls ./projects --detailed

Projects

Real cloud, DevOps, Kubernetes, and DevSecOps projects with problems, architecture, tools, implementation, and measurable outcomes.

10+

Detailed Projects

AWS + GCP

Multi-Cloud Focus

DevSecOps

Security Included

All Projects

July 2024 - Present

DS Edify: AWS-Based Learning Platform

$ kubectl get pods -n ds-edify

Click to open / close details

Problem:Need to deploy a scalable, high-availability learning platform with rapid auto-scaling and real-time monitoring.

Implementation:Architected multi-tier AWS infrastructure using EC2, App Runner, RDS, S3, ALB, and Route 53. Added CloudWatch dashboards, alarms, and release automation.

AWS EC2App RunnerRDSS3ALBRoute 53CloudWatchDocker

Result / Impact:

  • Reduced incident detection time by 60%
  • Implemented high-availability scaling model
  • Delivered 99.9% uptime SLA
Aug 2024 - Present

GCP Multi-Application Infrastructure (Amrize)

$ gcloud run services list --platform managed

Click to open / close details

Problem:Manage 12+ microservices across GCP with automated deployments and secure service-to-service access.

Implementation:Built infrastructure on Cloud Run, App Engine, Cloud SQL, and Firestore. Implemented GitLab CI pipelines and IAM least-privilege policies.

GCP Cloud RunApp EngineCloud SQLFirestoreGitLab CI/CDIAMCloud Monitoring

Result / Impact:

  • Improved release frequency by 35%
  • Cut manual operations by 50%
  • Standardized IAM security controls
Sep 2024 - Dec 2024

DS Jobby: Dockerized Job Portal

$ docker compose up -d --build

Click to open / close details

Problem:Containerize a legacy job portal and deploy securely with isolated networking and identity controls.

Implementation:Built containerized services, pushed images to AWS ECR, and deployed with VPC isolation, security groups, NAT, and IAM policies.

DockerAWS ECRVPCIAM RolesSecurity GroupsEC2

Result / Impact:

  • Reduced deployment time by 45%
  • Hardened network boundary with isolated VPC
  • Enforced secure IAM-based integrations
Jan 2025 - Mar 2025

Automated DevSecOps Pipeline

$ trivy image myservice:latest

Click to open / close details

Problem:Integrate security scanning into CI/CD without slowing developer velocity.

Implementation:Engineered Jenkins shared libraries to run Terraform checks, Trivy image scans, SonarQube code analysis, and Wiz posture validation.

JenkinsTrivySonarQubeWizTerraformDocker

Result / Impact:

  • Detected 90% critical issues before production
  • Applied security guardrails across 20+ repos
  • Reduced audit turnaround by 80%
Nov 2024 - Jan 2025

EKS Cluster Optimization

$ kubectl top pods --all-namespaces

Click to open / close details

Problem:High cloud costs caused by inefficient scaling and overprovisioned Kubernetes workloads.

Implementation:Optimized EKS node groups with Spot + Karpenter, tuned HPA thresholds via Prometheus/Grafana metrics, and right-sized pods.

AWS EKSKarpenterPrometheusGrafanaKubernetesHelm

Result / Impact:

  • Reduced compute cost by 45%
  • Eliminated recurring OOM events
  • Improved autoscaling response time
Feb 2025 - Apr 2025

Zero Trust Cloud Networking Blueprint

$ sudo iptables -L -n

Click to open / close details

Problem:Multi-cloud workloads needed stronger east-west traffic control and access governance.

Implementation:Designed segmented VPC/VNet patterns, least-privilege IAM roles, workload identity federation, and private service endpoints.

AWS IAMGCP IAMVPCPrivate EndpointsWizTerraform

Result / Impact:

  • Reduced unnecessary network exposure by 70%
  • Improved compliance posture for production services
  • Standardized identity policy templates
Apr 2025 - Jun 2025

GitOps Kubernetes Platform Delivery

$ kubectl rollout status deploy/platform-api

Click to open / close details

Problem:Deployment drift and inconsistent environments across staging and production clusters.

Implementation:Implemented GitOps workflow with pull-based deployments, policy checks, and environment overlays for reliable rollout control.

KubernetesGitHub ActionsHelmArgo CDPolicy as Code

Result / Impact:

  • Eliminated manual drift across environments
  • Improved rollback speed during incidents
  • Reduced failed releases by 30%
Jun 2025 - Aug 2025

Observability & SRE Command Center

$ journalctl -u kubelet --since '1 hour ago'

Click to open / close details

Problem:Teams lacked centralized visibility for latency, saturation, errors, and security events.

Implementation:Built unified dashboards, alert routing, and error budgets using Prometheus, Grafana, CloudWatch, and log correlation pipelines.

PrometheusGrafanaCloudWatchAlertmanagerLokiSLO

Result / Impact:

  • Reduced MTTR by 45%
  • Improved alert quality with SLO-aligned rules
  • Enabled proactive incident prevention workflows
Sep 2025 - Nov 2025

Python Platform Automation Suite

$ python3 tools/platform_automation.py --env prod

Click to open / close details

Problem:Manual infrastructure and release tasks were creating delays and configuration drift across teams.

Implementation:Developed a Python automation toolkit for environment bootstrapping, compliance checks, backup scheduling, and release guardrails integrated with CI pipelines.

PythonBashGitHub ActionsTerraformLinux

Result / Impact:

  • Reduced repetitive operations effort by 55%
  • Standardized deployment checks across services
  • Improved release confidence with automated validation
Dec 2025 - Feb 2026

Cloud Disaster Recovery Readiness Program

$ bash dr-validate.sh --region ap-south-1

Click to open / close details

Problem:Critical services needed improved resiliency and tested recovery workflows across cloud regions.

Implementation:Designed cross-region backup and recovery runbooks, automated failover checks, and monitoring-based recovery triggers for production workloads.

AWSGCPCloudWatchTerraformPostgreSQLLinux

Result / Impact:

  • Lowered RTO/RPO with tested DR simulations
  • Automated backup validation and restore checks
  • Improved business continuity readiness