2

我正在尝试在全新的 k8s 集群中使用操作员创建普罗米修斯我使用以下文件,

  1. 我正在创建一个命名空间监控
  2. 应用这个文件,它工作正常

apiVersion: apps/v1beta2
kind: Deployment
metadata:
  labels:
    k8s-app: prometheus-operator
  name: prometheus-operator
  namespace: monitoring
spec:
  replicas: 2
  selector:
    matchLabels:
      k8s-app: prometheus-operator
  template:
    metadata:
      labels:
        k8s-app: prometheus-operator
    spec:
      priorityClassName: "operator-critical"
      tolerations:
      - key: "WorkGroup"
        operator: "Equal"
        value: "operator"
        effect: "NoSchedule"
      - key: "WorkGroup"
        operator: "Equal"
        value: "operator"
        effect: "NoExecute"
      containers:
      - args:
        - --kubelet-service=kube-system/kubelet
        - --logtostderr=true
        - --config-reloader-image=quay.io/coreos/configmap-reload:v0.0.1
        - --prometheus-config-reloader=quay.io/coreos/prometheus-config-reloader:v0.29.0
        image: quay.io/coreos/prometheus-operator:v0.29.0
        name: prometheus-operator
        ports:
        - containerPort: 8080
          name: http
        securityContext:
          allowPrivilegeEscalation: false
          readOnlyRootFilesystem: true
      nodeSelector:
      serviceAccountName: prometheus-operator

现在我想应用这个文件(CRD)

apiVersion: monitoring.coreos.com/v1
kind: Prometheus
metadata:
  name: prometheus
  namespace: monitoring
  labels: 
    prometheus: prometheus
spec:
  replica: 1
  priorityClassName: "operator-critical"
  serviceAccountName: prometheus
  nodeSelector:
        worker.garden.sapcloud.io/group: operator
  serviceMonitorNamespaceSelector: {}
  serviceMonitorSelector:
    matchLabels:
      role: observeable
  tolerations:
  - key: "WorkGroup"
    operator: "Equal"
    value: "operator"
    effect: "NoSchedule"
  - key: "WorkGroup"
    operator: "Equal"
    value: "operator"
    effect: "NoExecute"

在我创建那些 CRD 之前

https://github.com/coreos/prometheus-operator/tree/master/example/prometheus-operator-crd

Pod 无法启动的问题(0/2),见下图。可能是什么问题呢?请指教

在此处输入图像描述

更新

当我参加舞会运营商的活动时,我看到以下错误creating: pods "prometheus-operator-6944778645-" is forbidden: no PriorityClass with name operator-critical was found replicaset-controller,知道吗?

4

2 回答 2

2

您正在尝试引用operator-critical 优先级。优先级类确定 pod 的优先级及其资源分配。

要解决此问题,您可以删除priorityClassName: "operator-critical"两个文件中的显式优先级 class( ) 或创建operator-critical该类:

apiVersion: scheduling.k8s.io/v1beta1
kind: PriorityClass
metadata:
  name: operator-critical
value: 1000000
globalDefault: false
description: "Critical operator workloads"
于 2019-03-15T07:28:51.587 回答
-1

Prometheus 和警报管理器 pod 需要持久卷来存储数据。确保这些 pv 存在并绑定到相应的 pod。或者,您可以使这些 pod 短暂。它应该工作

于 2019-03-14T19:52:52.140 回答