November 13–15, 2018 - Shanghai, China
Click Here For Information & Registration

To view the Chinese version of this schedule please go here.

Simultaneous translation will be provided for all keynote and breakout sessions.
View analytic
Thursday, November 15 • 12:15 - 12:50
Large-Scale K8s Cluster Operation and Management - Lv Jiangzhao, JD

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
JDOS(JD Datacenter Operation System) is the very large-scale container cluster system that running in JD's datacenters across the world. It was designed and developed based on Kubernetes. Today, almost all the JD's business has been deployed and running on JDOS. At present, the number of containers in JD's production environment has been millions. How to manage such large-scale clusters is a challenging issue for JDOS developers and operators. However, JD have only 2 full-time SREs to manage the clusters. This presentation will share some of the following experiences:
1.Node Component's detection and management;
2.Master Component's fault detection and failure recovery, especially for the etcd nodes;
3.How to significantly reduce apiserver requests, in order to build a much larger k8s cluster.


Thursday November 15, 2018 12:15 - 12:50
2F Room 3
  • Skill Level Any