Current
Oracle - Senior Member of Technical Staff
Leading a 5-engineer team delivering scalable systems for lifecycle operations of 1M+ NICs, including provisioning, observability, failure triage, and remediation.
PROJECT HIGHLIGHTS
-
CAPS - Card Provisioning Service
- Built an automated provisioning platform from scratch to install operating systems on raw NICs.
- Eliminated manual ticket-based onboarding, reducing hardware provisioning time from ~2 weeks to real time.
- Authored requirements and architecture designs; led cross-team technical discussions.
- Implemented parallel provisioning for multiple NICs per host, reducing provisioning time by 50% (1 hr to 30 min).
-
TRS - Triage & Repair Service
- Developed an automated failure detection and remediation system for provisioning issues.
- Replaced manual triage workflows, improving response time and operational efficiency.
-
Cerebro - Real-Time Health Monitoring Service
- Re-architected a legacy monitoring platform supporting 1M+ NICs.
- Built real-time health detection pipelines, replacing slow polling cycles and significantly improving failure detection latency.