Expert in DevOps & Site Reliability Engineering

Seasoned leader with 15+ years of experience blending DevOps principles with SRE practices to build, automate, and secure highly available environments across multi-cloud and bare-metal infrastructure.

About Me

My Philosophy: Reliability, Efficiency, & Security

My philosophy translates directly into tangible business value. I have a proven track record of increasing platform uptime while simultaneously driving significant cloud cost reductions. My approach goes beyond just "keeping the lights on"—I focus on building systems that are automated, observable, and secure by default. This foundation of reliability and efficiency has been instrumental in leading complex governance efforts to achieve SOC-2 and FedRamp compliance.

Bespin Cloud Consulting

Expertise in all things infrastructure

Leverage my expertise in architecture, design, implementation, automation, and monitoring to elevate your infrastructure.

Security & Compliance

Develop and implement security and compliance governance best practices. Achieve certifications like SOC-1, SOC-2, and FedRamp with proven strategies.

Multi-Cloud Management

Manage and optimize complex Kubernetes environments across AKS and GKE. Implement cost reduction initiatives that deliver measurable savings, like reducing monthly cloud spend by over 30%.

Reliability & Monitoring

Build robust observability stacks (Grafana, Prometheus, Loki) and establish the policies, procedures, runbooks, and SLA reporting required for 24/7/365 operations.

Technical Proficiencies

DevOps / SRE

Kubernetes Terraform Grafana Prometheus Docker Chef Puppet CloudFormation ELK Stack Icinga2 Nagios Cacti

Cloud & Security

AWS GCP Azure SOC-1 / SOC-2 FedRamp IAM Security Best Practices

Scripting & Methodologies

Python Bash Perl Agile Incident Response Postmortem Analysis CMDB

Professional Experience

Lead Global Site Reliability Engineer - Forcepoint

Apr 2024 - Present

  • Enhanced reliability and scalability, increasing uptime from 94% to 99.5% through proactive monitoring.
  • Expanded observability into our cloud platform utilizing Grafana, Prometheus, Loki, and Zabbix.
  • Lead initiative to migrate from Grafana Agent to Alloy from initial design to complete production rollout.
  • Conducted incident response, postmortems, and RCAs, implementing remediation strategies.

Senior Site Reliability Engineer - Sight Machine

Sep 2022 - Mar 2024

  • Served as a principal member of the Security Committee, driving efforts toward SOC-2 compliance.
  • Managed numerous Kubernetes clusters (AKS and GKE) utilizing Terraform, Atlantis, FluxCD, and Helm.
  • Spearheaded Azure infrastructure cost reduction initiatives (over 30% reduction in monthly cloud spend).

Lead DevSecOps Engineer - OutSystems

Jan 2021 - Sep 2022

  • Developed Security and Compliance governance best practices through policies, education, and monitoring.
  • Implemented CMDB, compliance/cost management/usage analytics engine across AWS, GCP, and Azure.
  • Analyzed IAM policies across 1800+ AWS accounts and proposed least privilege policy changes.

DevOps Guy - Authentic8

Dec 2017 - Dec 2020

  • Served as the initial lead engineer within the newly created Network Security Operations Center (NSOC).
  • Designed a self-aware Icinga2 monitoring backend across bare metal, MSP, AWS, and GCP.
  • Instrumental in advancing the organization toward SOC-1, SOC-2, and FedRamp compliance certifications.

Senior Systems Engineer - Move, Inc.

Oct 2015 - Dec 2017

  • Designed strategies and pipelines to migrate services from bare metal to AWS.
  • Introduced Docker as a manageable, scalable, and efficient alternative to traditional hardware implementations.
  • Integrated with various development teams on architecture and design for a smooth transition to AWS.

Lets Connect!

Have a project in mind or just want to discuss the future of infrastructure? I'd love to hear from you.

chandler@bespincloudconsulting.com