我已经在我的 eks 集群中设置了这个https://github.com/actions-runner-controller/actions-runner-controller 。这会为每个跑步者创建一个 docker contanier 和一个跑步者容器。为了共享 docker 图像而不是每次我创建一个卷来共享/var/lib/docker
并将其注入所有 docker 容器中。问题是有时(有 1 个或 2 个 pod 不会失败)docker 容器初始化失败并出现以下错误:
evel=warning msg="grpc: addrConn.createTransport failed to connect to {unix:///var/run/docker/containerd/containerd.sock <nil> 0 <nil>}. Err :connection error: desc = \"transport: error while dialing: dial unix:///var/run/docker/containerd/containerd.sock: timeout\". Reconnecting..." module=grpc
我在 repo 中创建了一个问题,但我认为该错误与控制器无关。是我身边的东西。
https://github.com/actions-runner-controller/actions-runner-controller/issues/847
这是yamls:
apiVersion: actions.summerwind.dev/v1alpha1
kind: RunnerDeployment
metadata:
name: github-runner-deploy
namespace: actions-runner-system
spec:
template:
spec:
organization: org-name
volumes:
- name: docker-data-ci
persistentVolumeClaim:
claimName: docker-data-ci
dockerVolumeMounts:
- mountPath: /var/lib/docker
name: docker-data-ci
---
kind: PersistentVolumeClaim
apiVersion: v1
metadata:
name: "docker-data-ci"
spec:
storageClassName: gp2
accessModes:
- ReadWriteOnce
resources:
requests:
storage: 100Gi