About Milvus
Get Started
Concepts
User Guide
Data Import
Administration Guide
Tools
Integrations
Tutorials
FAQs
API Reference

Home
Docs
Administration Guide
Monitoring, Alerts & Logs
Monitoring
Deploy Monitoring Services

Deploying Monitoring Services on Kubernetes

This topic describes how to use Prometheus to deploy monitoring services for a Milvus cluster on Kubernetes.

Monitor metrics with Prometheus

Metrics are indicators providing information about the running status of your system. For example, with metrics, you can understand how much memory or CPU resources are consumed by a data node in Milvus. Being aware of the performance and status of the components in your Milvus cluster makes you well-informed and hence making better decisions and adjusting resource allocation in a more timely manner.

Generally, metrics are stored in a time series database (TSDB), like Prometheus, and the metrics are recorded with a timestamp. In the case of monitoring Milvus services, you can use Prometheus to pull data from endpoints set by exporters. Prometheus then exports metrics of each Milvus component at http://<component-host>:9091/metrics.

However, you might have several replicas for one component, which makes manual configuration of Prometheus too complicated. Therefore, you can use Prometheus Operator, an extension to Kubernetes, for automated and effective management of Prometheus monitoring instances. Using Prometheus Operator saves you the trouble of manually adding metric targets and service providers.

The ServiceMonitor Custom Resource Definition (CRD) enables you to declaratively define how a dynamic set of services are monitored. It also allows selecting which services to monitor with the desired configuration using label selections. With Prometheus Operator, you can introduce conventions specifying how metrics are exposed. New services can be automatically discovered following the convention you set without the need for manual reconfiguration.

The following image illustrates Prometheus workflow.

Prometheus_architecture

Prerequisites

This tutorial uses kube-prometheus to save you the trouble of installing and manually configuring each monitoring and alerting component.

Kube-prometheus collects Kubernetes manifests, Grafana dashboards, and Prometheus rules combined with documentation and scripts.

Before deploying monitoring services, you need to create a monitoring stack by using the configuration in the kube-prometheus manifests directory.

$ git clone https://github.com/prometheus-operator/kube-prometheus.git
$ cd kube-prometheus
$ kubectl apply --server-side -f manifests/setup
$ kubectl wait \
        --for condition=Established \
        --all CustomResourceDefinition \
        --namespace=monitoring
$ kubectl apply -f manifests/

The default prometheus-k8s clusterrole can not capture milvus' metrics, need to patch:

kubectl patch clusterrole prometheus-k8s --type=json -p='[{"op": "add", "path": "/rules/-", "value": {"apiGroups": [""], "resources": ["pods", "services", "endpoints"], "verbs": ["get", "watch", "list"]}}]'

To delete a stack, run kubectl delete --ignore-not-found=true -f manifests/ -f manifests/setup.

Deploy monitoring services on Kubernetes

1. Access the dashboards

Forward the Prometheus service to port 9090, and Grafana service to port 3000.

$ kubectl --namespace monitoring --address 0.0.0.0 port-forward svc/prometheus-k8s 9090
$ kubectl --namespace monitoring --address 0.0.0.0 port-forward svc/grafana 3000

2. Enable ServiceMonitor

The ServiceMonitor is not enabled for Milvus Helm by default. After installing the Prometheus Operator in the Kubernetes cluster, you can enable it by adding the parameter metrics.serviceMonitor.enabled=true.

With Helm

You can enable the ServiceMonitor by setting the parameter metrics.serviceMonitor.enabled=true as follows if you have installed Milvus Helm chart.

```
$ helm upgrade my-release milvus/milvus --set metrics.serviceMonitor.enabled=true --reuse-values
```

When the installation completes, use kubectl to check the ServiceMonitor resource.

With Milvus Operator

You can enable the ServiceMonitor as follows if you have installed Milvus using the Milvus Operator.

Run the following command to edit the MIlvus custom resource. The following command assumes that the custom resource is named my-release.
```
$ kubectl edit milvus my-release
```

Edit the spec.components.disableMetric field to false.

...
spec:
  components:
    disableMetric: false # set to true to disable metrics
...

Save and exit the editor.
Wait for the operator to reconcile the changes. You can check the status of the Milvus custom resource by running the following command.
```
$ kubectl get milvus my-release -o yaml
```

The status.components.metrics.serviceMonitor.enabled field should be true.

3. Check the metrics

After enabling the ServiceMonitor, you can access the Prometheus dashboard at http://localhost:9090/.

Click on the Status tab and then Targets. You should see the targets of the Milvus components.

Prometheus_targets

Click on the Graph tab and enter the expression up{job="default/my-release"} in the expression input box. You should see the metrics of the Milvus components.

Prometheus_graph

4. Check the ServiceMonitor

$ kubectl get servicemonitor

NAME                           AGE
my-release-milvus              54s

What’s next

If you have deployed monitoring services for the Milvus cluster, you might also want to learn to:
- Visualize Milvus metrics in Grafana
- Create an Alert for Milvus Services
- Adjust your resource allocation
If you are looking for information about how to scale a Milvus cluster:
- Learn scale a Milvus cluster
If you are interested in upgrading the Milvus version,
- Read the guide for upgrading Milvus cluster and that for upgrade Milvus standalone.

Deploying Monitoring Services on Kubernetes
Monitor metrics with Prometheus
Prerequisites
Deploy monitoring services on Kubernetes
1. Access the dashboards
2. Enable ServiceMonitor
3. Check the metrics
4. Check the ServiceMonitor
What's next

Try Managed Milvus for Free

Zilliz Cloud is hassle-free, powered by Milvus and 10x faster.

Get Started

Feedback

Was this page helpful?