Naveen Kumar Reddy

Senior Software Engineer | Platform Architecture | Distributed Systems

Saved a $100M contract. Cut $560K in costs. Built platforms for 8+ teams.

What I've Built

  • Async job orchestration — sole architect, 10M+ events/sec, 1000+ concurrent jobs, 8 teams build on it
  • 7-system data reconciliation — Neptune graph lineage, sub-hour deviation detection, billions of records
  • Real-time analytics — Druid, petabyte-scale, ML-powered routing, 99.99% availability target
  • ETL platform redesign — 97 Scala files analyzed, 85% cost reduction, 70% code elimination

What I've Recovered

  • $100M+ Disney contract — Led 28-day crisis recovery, 10TB data, 7 pipelines, 100% accuracy
  • 3-year dead CI/CD — Docker→Kaniko, Java 8→17, security scanning restored
  • Production clusters — EKS failures, database exhaustion, certificate expirations, silent corruption

How I Debug

  • cgroups v2 OOMKill — traced to Linux kernel memory accounting change, unblocked 4 teams
  • EFS martian packets — asymmetric routing + rp_filter + VPC CIDR overlap. One node in hundreds.
  • NFS loopback deadlock — D-state process analysis, hard mount + dead server = infinite hang
  • EKS 1.23→1.33 — dockershim→containerd, reverse-engineered NodeConfig, became org standard

How I Optimize

  • O(n⁴)→O(1) — 29 billion operations eliminated, 2hr→5min, 97% reduction
  • 573,562× faster — stream filters → indexed data structures
  • $560K+/year saved — autoscaling $90K, ETL $50K, governance $360K
  • 90% deployment improvement — 2-3hr→10-15min, 30+ jobs→6 dynamic

How I Decide

  • Chose database over SQS for orchestration — simpler, cheaper, equally reliable
  • Rejected "add more memory" for OOMKill — traced to kernel behavior instead
  • Pushed back on sub-300ms latency target — honest analysis showed 500-850ms realistic

How I Lead

  • 30+ engineers mentored — reviews, pairing, knowledge transfer
  • 100+ MRs reviewed — "Naveen-approved" = organizational quality signal
  • 18 interviews, 83% rejection — bar raiser, 3 quality hires
  • 7MB+ documentation — 90+ page gold standards, research-level depth

Experience

Nielsen Senior Engineer 2024–
Flexcar Senior Engineer 2022–24
OYO Backend Engineer 2021–22

Stack

Java, Python, Scala, Go | Kafka, Spark, Druid, Airflow | AWS, Kubernetes, Terraform | PostgreSQL, Redis, DynamoDB