Andrew Pruski
Adjusting Pod Eviction Timings in Kubernetes
#1about 3 minutes
A DBA's journey to running SQL Server on Kubernetes
The speaker shares his background as a SQL Server DBA and the efficiency gains that led him to explore running stateful workloads in Kubernetes.
#2about 2 minutes
Why the default five-minute failover is unacceptable
When a node fails, Kubernetes waits five minutes by default before rescheduling pods, which is too long for stateful applications like SQL Server.
#3about 5 minutes
Demonstrating the default pod eviction delay in action
A live demo shows an nginx pod taking five minutes to be rescheduled to a healthy node after its original node is shut down in AKS.
#4about 2 minutes
How to configure faster pod eviction with tolerations
Pod eviction time can be reduced from five minutes to seconds by setting tolerations for not-ready and unreachable nodes in the deployment YAML.
#5about 3 minutes
Demo of a ten-second failover using tolerations
By adding tolerations to the nginx deployment, a new pod is spun up on a healthy node just ten seconds after the original node fails.
#6about 7 minutes
Why fast pod eviction fails for stateful apps
A demo with SQL Server shows that even with tolerations, the new pod gets stuck in a "ContainerCreating" state due to a multi-attach error on the persistent volume.
#7about 7 minutes
Achieving high availability with Portworx storage
A third-party tool called Portworx provides a storage class that correctly detaches and reattaches storage, enabling a SQL Server pod to fail over successfully in seconds.
#8about 1 minute
Key considerations for stateful app high availability
Achieving high availability for stateful apps in Kubernetes requires adjusting pod tolerations and using a storage solution that can handle volume reattachment across nodes.
Related jobs
Jobs that call for the skills explored in this talk.
VECTOR Informatik
Stuttgart, Germany
Senior
Kubernetes
Terraform
+1
Matching moments
03:17 MIN
Selecting strategic partners and essential event tools
Cat Herding with Lions and Tigers - Christian Heilmann
03:38 MIN
Balancing the trade-off between efficiency and resilience
What 2025 Taught Us: A Year-End Special with Hung Lee
04:57 MIN
Developing resilience by expanding your capacity for failure
What 2025 Taught Us: A Year-End Special with Hung Lee
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
02:39 MIN
Establishing a single source of truth for all data
Cat Herding with Lions and Tigers - Christian Heilmann
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
03:34 MIN
The business case for sustainable high performance
Sustainable High Performance: Build It or Pay the Price
Featured Partners
Related Videos
Databases on Kubernetes
Denis Souza Rosa
It's all about the Data
Michael Cade
Databases on Kubernetes: Why you should care
Denis Wilson Souza Rosa & Steffen Schneider
Azure-Well Architected Framework - designing mission critical workloads in practice
Paweł Siwek
Winning the Hybrid Cloud
Alex Soto
Mastering Kubernetes – Beginner Edition
Hannes Norbert Göring
Kubernetes Security Best Practices
Rico Komenda
Kubernetes Security - Challenge and Opportunity
Marc Nimmerrichter
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Scope AT
Glasgow, United Kingdom
Python
Ansible
Terraform
Kubernetes
Amazon Web Services (AWS)
+1


KA RESOURCES
München, Germany
Remote
€65-75K
Intermediate
Linux
VMware
Routing
+2


Accenture
Barcelona, Spain
Ceph
Linux
Docker
Ansible
Openshift
+2

Stolzberger GmbH
€60-100K
Linux
DevOps
Gitlab
Ansible
+4

AllatNet Recruiting GmbH & Co. KG
GIT
Azure
Linux
DevOps
Docker
+8


Scope AT
Glasgow, United Kingdom
DevOps
Openshift
Kubernetes
Continuous Integration
Amazon Web Services (AWS)
+1