集群规格:
apiVersion: eksctl.io/v1alpha5
kind: ClusterConfig
metadata:
name: mixedCluster
region: ap-southeast-1
nodeGroups:
- name: scale-spot
desiredCapacity: 1
maxSize: 10
instancesDistribution:
instanceTypes: ["t2.small", "t3.small"]
onDemandBaseCapacity: 0
onDemandPercentageAboveBaseCapacity: 0
availabilityZones: ["ap-southeast-1a", "ap-southeast-1b"]
iam:
withAddonPolicies:
autoScaler: true
labels:
nodegroup-type: stateless-workload
instance-type: spot
ssh:
publicKeyName: newkeypairbro
availabilityZones: ["ap-southeast-1a", "ap-southeast-1b"]
问题:
当我扩展我的应用程序(业务 pod)时,将为每个节点自动创建 CloudWatch pod。但是当我决定将我的业务 pod 缩减为零时,我的集群自动扩缩器不会耗尽或终止某些节点内的 cloudWatch 内容(pod)。因此,这将在我的集群内创建一个虚拟节点。
根据上图,最后一个节点是内部带有 cloudWatch pod 的虚拟节点:
预期结果:
如何在业务 pod 终止后优雅地(自动)耗尽 Amazon CloudWatch 节点?所以它不会创建一个虚拟节点?
这是我的自动缩放器配置:
Name: cluster-autoscaler
Namespace: kube-system
CreationTimestamp: Sun, 11 Apr 2021 20:44:28 +0700
Labels: app=cluster-autoscaler
Annotations: cluster-autoscaler.kubernetes.io/safe-to-evict: false
deployment.kubernetes.io/revision: 2
Selector: app=cluster-autoscaler
Replicas: 1 desired | 1 updated | 1 total | 1 available | 0 unavailable
StrategyType: RollingUpdate
MinReadySeconds: 0
RollingUpdateStrategy: 25% max unavailable, 25% max surge
Pod Template:
Labels: app=cluster-autoscaler
Annotations: prometheus.io/port: 8085
prometheus.io/scrape: true
Service Account: cluster-autoscaler
Containers:
cluster-autoscaler:
Image: k8s.gcr.io/autoscaling/cluster-autoscaler:v1.18.3
Port: <none>
Host Port: <none>
Command:
./cluster-autoscaler
--v=4
--stderrthreshold=info
--cloud-provider=aws
--skip-nodes-with-local-storage=false
--expander=least-waste
--node-group-auto-discovery=asg:tag=k8s.io/cluster-autoscaler/enabled,k8s.io/cluster-autoscaler/mixedCluster
Limits:
cpu: 100m
memory: 300Mi
Requests:
cpu: 100m
memory: 300Mi
Environment: <none>
Mounts:
/etc/ssl/certs/ca-certificates.crt from ssl-certs (ro)
Volumes:
ssl-certs:
Type: HostPath (bare host directory volume)
Path: /etc/ssl/certs/ca-bundle.crt
HostPathType:
Conditions:
Type Status Reason
---- ------ ------
Available True MinimumReplicasAvailable
Progressing True NewReplicaSetAvailable
OldReplicaSets: <none>
NewReplicaSet: cluster-autoscaler-54ccd944f6 (1/1 replicas created)
Events: <none>
我的尝试:
我试图用这个命令手动缩小它:
eksctl scale nodegroup --cluster=mixedCluster --nodes=1 --name=scale-spot
它不起作用,并返回:
[ℹ] scaling nodegroup stack "eksctl-mixedCluster-nodegroup-scale-spot" in cluster eksctl-mixedCluster-cluster
[ℹ] no change for nodegroup "scale-spot" in cluster "eksctl-mixedCluster-cluster": nodes-min 1, desired 1, nodes-max 10