KBS-343

Kubernetes Troubleshooting

Detailed Course Outline

Course Length

3 days

Course Overview

As Kubernetes is becoming the most widely used platform for deploying applications, it is of paramount importance to know how to address problems that may occur in these systems.

This course teaches participants a wide range of IT troubleshooting concepts and technics from the generic methodologies applicable in troubleshooting to the domain specific instructions that will address the various aspects of Kubernetes clusters and applications deployed in them.

Course Objectives

By the end of this course, learners will be able to apply a systematic troubleshooting methodology to diagnose and resolve Kubernetes issues across the control plane, nodes, workloads, networking, and storage. They will gain hands-on skills with Kubernetes, container, and system-level tools to identify root causes and restore cluster reliability.

Structure

40% lecture, 60% hands on lab exercises

Target Audience

Professionals who are involved in managing/operating Kubernetes clusters and the applications running in them.

Prerequisites

Working experience with Kubernetes, a general understanding of the Linux kernel, containerization, and networking concepts.

Course Modules

Module 1: Troubleshooting methodology and tools

  • Fault analysis methodology
  • Diagnosis methodology
  • Diagnosis tools
  • • System
  • • Container
  • • Kubernetes

Module 2: Kubernetes architecture

  • Control plane components, configuration, logging
  • Worker components, configurations, logging
  • Request processing
  • RBAC
  • Troubleshooting node issues

Module 3: Handling workload errors

  • Troubleshooting pod errors
  • Troubleshooting Deployments
  • Troubleshooting StatefulSets

Module 4: Troubleshooting the Networking

  • Network architecture
  • CNI
  • Troubleshooting services
  • Troubleshooting network policies

Module 5: Storage issues

  • Storage in Kubernetes
  • CSI
  • Troubleshooting storage issues