Sr. Site Reliability Engineer
Sr. Site Reliability Engineer based in Denver, Colorado. Eight years building and maintaining the infrastructure that keeps software honest. Container orchestration, cloud architecture, CI/CD pipelines, safe deployments and managed rollbacks, observability and visualization. I take change risk seriously and let data drive the decision.
I take pride in performant, readable, maintainable code. My experience spans early-stage startups, a Fortune 50 media and telecommunications company, and regulated cybersecurity environments through FedRAMP audit.
Math enthusiast. Guitarist. Mechanical keyboards. Overengineered home labs. Student of several human languages.
Kubernetes admission controller that enforces observability standards at deploy time. Rejects pods missing probes, resource limits, or required logging labels. Configured via CRD with no restarts needed.
Terminal dashboard for SRE on-call metrics. Pulls from PagerDuty and Datadog to show error budget burn rate, MTTR, and incident frequency over rolling windows. Fits in a tmux pane.
An alternate version of this portfolio built as an interactive terminal. Navigable filesystem, working vim overlay, tab autocomplete, easter eggs. Type 'help' to get started.
In progress.
Led migration of a core application serving all Comcast personal and business customers from legacy data centers to a distributed cloud platform, including CI/CD pipelines, infrastructure provisioning with Ansible and Terraform, and network micro-segmentation. Owned the team's full observability stack, driving a Zabbix-to-Prometheus transition and building out Grafana dashboards in collaboration with engineering and business partners.
Migrated 60+ production environments from custom deployment scripts to a Helm-based model, enabling cost optimization tooling that reduced AWS spend by 40%, projecting over $1M in annual savings. Managed Kubernetes cluster health across a regulated environment and drove FedRAMP audit remediation efforts.