Summary

Overview

Work History

Education

Skills

Certification

Timeline

Logan Kuhn

Champaign,IL

Summary

Senior Site Reliability Engineer specializing in storage and configuration management, focused on building and maintaining highly available and scalable infrastructure. Experienced in automation, infrastructure-as-code, and optimizing Ceph storage clusters for performance and reliability.

Overview

years of professional experience

Certification

Work History

Site Reliability Engineer.Senior

Akamai

12.2022 - Current

Migrate 4 clusters running 6+ year old hardware and an EOL version of Ceph to new hardware and the current version of Ceph in 6 months. While coordinating with several stakeholders and minimizing customer impact
Manage salt pillar, states and formulas to configure and deploy Ceph and tooling that supports the storage team across 100+ block and object storage clusters.
Write Python script that monitors block and object storage performance across 100+ clusters and 20 DCs which allows for automated monitoring for performance problems before customers are aware.
Authored a script that automates the upgrade of over 4k Ceph hosts from Ubuntu 20.04 to 22.04 with zero downtime to the Ceph cluster and minimal customer impact.
Utilize Terraform for IaaS deployment of nearly 3k virtual object storage infrastructure, significantly improving deployment reliability and reducing deployment time.
Trained the React team on Ceph troubleshooting and runbook creation, reducing incident resolution time and empowering them to independently address common issues.
Leveraged Git and GitLab to streamline the development and deployment of Salt-based automation tooling, improving infrastructure consistency and reducing configuration errors
Troubleshoot client impacting incidents and write post mortems describing what happened and the solution as well as any new action items to prevent future occurrences

Senior Storage Engineer

Datto Inc

04.2021 - 12.2022

Work with a team of 10 on numerous projects including upgrading over 3000 servers from Ubuntu 18 to Ubuntu 20 with minimal downtime
Utilize Gitlab CI/CD to automate some code checks to ensure higher quality code prior to team evaluation
Deploy Ceph via ceph-ansible
Automate day 2 tasks to manage Ceph
Create new ansible playbooks for various management tasks, including adding ceph daemons, deploying openstack vms and other management tasks
Manage over 3000 ZFS based servers
Replace zpool drives, create new pools (mirrored and vdevs)
Take/prune snapshots, other misc zpool related tasks
Write Python code to automated various pieces of the job
Primarily around DNS and server onboarding
Utilize Jira for managing my task list
Participate in weekly code review

Systems Administrator II

Wolfram Research

05.2017 - 04.2021

Deploy RHEL based infrastructure via custom kickstarts
Manage RHEL, Debian, OSX infrastructure via Puppet 3
Write custom puppet modules for Wolfram specific needs
Write documentation for as much as possible
Create a prototype graylog infrastructure for log consolidation and analysis
Assist co-workers with learning and advance their knowledge
Upgrade middleware platforms (Jira, Bitbucket) and ensure stakeholders are able to test prior to release
Test and deploy the oVirt virtualization platform and via Python write a program to automate the migration of our virtualization infrastructure from the old platform to ovirt
This saved a significant amount of time per VM (300+ VMs) and removed the human from almost all of the migration process which lead to less potential for mistakes
Assist with testing and deployment of Promox virtualization as a comparison to the ovirt infrastructure
Deploy Ceph as our storage infrastructure so we have a distributed, unified storage platform for all future virtualization and filestore needs
Use Python anytime I can to automate the tedium away
Prototype and deploy future infrastructure (terraform, satellite, puppet5)
Deploy postgresql master/slave

Systems Administrator

Wolfram Research

11.2015 - 04.2021

Jr. Systems Administrator

Wolfram Research

04.2015 - 04.2021

Education

Associate - Network Administration

Lake Land College

Mattoon, IL

Bachelor's - Cloud and Systems Administration

Western Governors University

Salt Lake City, UT

Skills

Operating Systems: Linux (Administration, Performance Tuning)
Storage Technologies: ZFS, XFS, Ceph
Configuration Management: Puppet, Satellite, Salt
Cloud Technologies: Terraform, Linode/Akamai Cloud

Automation & Scripting: Python, Bash
Databases: MariaDB, PostgreSQL
Monitoring & Visualization: Grafana
Other: HAProxy, CI/CD, Clustered Storage

Certification

A+
Network+
LPE
LPIC 1
AWS SysOps
Cloud+

Timeline

Site Reliability Engineer.Senior

Akamai

12.2022 - Current

Senior Storage Engineer

Datto Inc

04.2021 - 12.2022

Systems Administrator II

Wolfram Research

05.2017 - 04.2021

Systems Administrator

Wolfram Research

11.2015 - 04.2021

Jr. Systems Administrator

Wolfram Research

04.2015 - 04.2021

Associate - Network Administration

Lake Land College

Bachelor's - Cloud and Systems Administration

Western Governors University