Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Logan Kuhn

Champaign,IL

Summary

Senior Site Reliability Engineer specializing in storage and configuration management, focused on building and maintaining highly available and scalable infrastructure. Experienced in automation, infrastructure-as-code, and optimizing Ceph storage clusters for performance and reliability.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer.Senior

Akamai
12.2022 - Current
  • Migrate 4 clusters running 6+ year old hardware and an EOL version of Ceph to new hardware and the current version of Ceph in 6 months. While coordinating with several stakeholders and minimizing customer impact
  • Manage salt pillar, states and formulas to configure and deploy Ceph and tooling that supports the storage team across 100+ block and object storage clusters.
  • Write Python script that monitors block and object storage performance across 100+ clusters and 20 DCs which allows for automated monitoring for performance problems before customers are aware.
  • Authored a script that automates the upgrade of over 4k Ceph hosts from Ubuntu 20.04 to 22.04 with zero downtime to the Ceph cluster and minimal customer impact.
  • Utilize Terraform for IaaS deployment of nearly 3k virtual object storage infrastructure, significantly improving deployment reliability and reducing deployment time.
  • Trained the React team on Ceph troubleshooting and runbook creation, reducing incident resolution time and empowering them to independently address common issues.
  • Leveraged Git and GitLab to streamline the development and deployment of Salt-based automation tooling, improving infrastructure consistency and reducing configuration errors
  • Troubleshoot client impacting incidents and write post mortems describing what happened and the solution as well as any new action items to prevent future occurrences

Senior Storage Engineer

Datto Inc
04.2021 - 12.2022
  • Work with a team of 10 on numerous projects including upgrading over 3000 servers from Ubuntu 18 to Ubuntu 20 with minimal downtime
  • Utilize Gitlab CI/CD to automate some code checks to ensure higher quality code prior to team evaluation
  • Deploy Ceph via ceph-ansible
  • Automate day 2 tasks to manage Ceph
  • Create new ansible playbooks for various management tasks, including adding ceph daemons, deploying openstack vms and other management tasks
  • Manage over 3000 ZFS based servers
  • Replace zpool drives, create new pools (mirrored and vdevs)
  • Take/prune snapshots, other misc zpool related tasks
  • Write Python code to automated various pieces of the job
  • Primarily around DNS and server onboarding
  • Utilize Jira for managing my task list
  • Participate in weekly code review

Systems Administrator II

Wolfram Research
05.2017 - 04.2021
  • Deploy RHEL based infrastructure via custom kickstarts
  • Manage RHEL, Debian, OSX infrastructure via Puppet 3
  • Write custom puppet modules for Wolfram specific needs
  • Write documentation for as much as possible
  • Create a prototype graylog infrastructure for log consolidation and analysis
  • Assist co-workers with learning and advance their knowledge
  • Upgrade middleware platforms (Jira, Bitbucket) and ensure stakeholders are able to test prior to release
  • Test and deploy the oVirt virtualization platform and via Python write a program to automate the migration of our virtualization infrastructure from the old platform to ovirt
  • This saved a significant amount of time per VM (300+ VMs) and removed the human from almost all of the migration process which lead to less potential for mistakes
  • Assist with testing and deployment of Promox virtualization as a comparison to the ovirt infrastructure
  • Deploy Ceph as our storage infrastructure so we have a distributed, unified storage platform for all future virtualization and filestore needs
  • Use Python anytime I can to automate the tedium away
  • Prototype and deploy future infrastructure (terraform, satellite, puppet5)
  • Deploy postgresql master/slave

Systems Administrator

Wolfram Research
11.2015 - 04.2021

Jr. Systems Administrator

Wolfram Research
04.2015 - 04.2021

Education

Associate - Network Administration

Lake Land College
Mattoon, IL

Bachelor's - Cloud and Systems Administration

Western Governors University
Salt Lake City, UT

Skills

  • Operating Systems: Linux (Administration, Performance Tuning)
  • Storage Technologies: ZFS, XFS, Ceph
  • Configuration Management: Puppet, Satellite, Salt
  • Cloud Technologies: Terraform, Linode/Akamai Cloud
  • Automation & Scripting: Python, Bash
  • Databases: MariaDB, PostgreSQL
  • Monitoring & Visualization: Grafana
  • Other: HAProxy, CI/CD, Clustered Storage

Certification

  • A+
  • Network+
  • LPE
  • LPIC 1
  • AWS SysOps
  • Cloud+

Timeline

Site Reliability Engineer.Senior

Akamai
12.2022 - Current

Senior Storage Engineer

Datto Inc
04.2021 - 12.2022

Systems Administrator II

Wolfram Research
05.2017 - 04.2021

Systems Administrator

Wolfram Research
11.2015 - 04.2021

Jr. Systems Administrator

Wolfram Research
04.2015 - 04.2021

Associate - Network Administration

Lake Land College

Bachelor's - Cloud and Systems Administration

Western Governors University
Logan Kuhn