Kubernetes
7 posts
Multi‑Node Private Deployment of DeepSeek-r1:671B Full Version on K8s + SGLang
2025-02-19
This post walks through deploying the full DeepSeek-r1-671B model on Kubernetes with SGLang for production-grade, multi-node GPU inference. It explains how to orchestrate elastic multi-GPU workloads using LeaderWorkerSet and Volcano, optimize performance via RadixAttention and KV cache reuse, integrate Prometheus/Grafana for SLA-grade monitoring, and contrasts this K8s+SGLang stack with Ollama. A step-by-step environment and YAML guide is included.
1575 字
|
8 分钟
Using Thanos to Manage Multiple Prometheus Data Sources for High Availability
2024-08-12
This post walks through using Thanos to aggregate metrics from multiple Prometheus instances for high availability, centralized querying, and long‑term retention in object storage. It explains core components like Sidecar, Store, Query, and Ruler, and shows key Kubernetes deployment details, including image mirror replacement, exposing NodePort Services, and handling NetworkPolicy pitfalls.
676 字
|
3 分钟
Set Up a K8s Cluster with kubeadm
2023-12-03
Step-by-step guide to building a Kubernetes cluster with kubeadm on three Ubuntu 20.04 servers. Covers environment preparation, disabling SWAP and firewall, kernel and containerd configuration, installing kubeadm/kubelet/kubectl, initializing the control plane, deploying Flannel CNI, enabling shell completion, and joining worker nodes, with ready-to-use scripts and version notes.
880 字
|
4 分钟
Common Kubectl Commands
2023-10-07
A structured guide to essential kubectl commands, covering creation, querying, deployment, scaling, and troubleshooting. Explains key subcommands like apply, rollout, logs, and exec, and includes cluster management utilities plus a quick reference to common Kubernetes resource names and abbreviations to boost your daily Kubernetes efficiency.
1857 字
|
9 分钟
Deploying Applications to a Kubernetes Cluster
2023-09-14
Step-by-step guide to deploying applications on Kubernetes, from using `kubectl run` to managing Pod and Deployment YAML. Learn how labels connect Deployments and Pods, how to scale replicas, forward ports, and roll back versions. Includes a concise cheatsheet of common `kubectl` commands for querying resources, inspecting logs, and troubleshooting in real clusters.
436 字
|
2 分钟
Deploying a Kubernetes Cluster with Minikube on Debian 11
2023-09-07
This tutorial walks you through deploying a Kubernetes cluster with Minikube on Debian 11, using either a Hetzner dedicated server or a typical VPS. It covers environment and driver requirements, installing and verifying kubectl and Minikube, starting Minikube with the Docker driver under root (and handling the permission warning), checking cluster state, and enabling remote access to the Kubernetes Dashboard.
1023 字
|
5 分钟
Kubernetes Notes
2023-09-06
This post traces the evolution from traditional deployment to containers, then introduces Kubernetes as a container orchestration platform. It explains K8s origins, core features, and Master/Node components, and walks through deploying Nginx to illustrate scheduling, self‑healing, autoscaling, and service discovery, helping you get started quickly.
1128 字
|
6 分钟