GitHub - andreistefanciprian/k8s-service-endpoint-prom-exporter: Prometheus exporter built in Python that publishes kubernetes service availability metrics

Endpoint Metrics Collector for Kubernetes Services

This tool monitors a Kubernetes service and collects valuable endpoint metrics, enabling you to gain insights into the health and readiness of your service's pods. The collected metrics are particularly useful for understanding the availability and responsiveness of your service's underlying infrastructure.

Collected Metrics

The tool captures the following metrics for a Kubernetes service:

srv_ready_pods: Displays the current number of service endpoints (pods) that are successfully passing Kubernetes startup, readiness, and liveness probes.
srv_not_ready_pods: Shows the current number of service pods that are not ready to serve traffic. This metric encompasses various scenarios:
- Pods that are failing Kubernetes startup, readiness, or liveness probes.
- Pods that encounter issues while pulling container images, among other startup challenges.
- Please note that this metric doesn't account for pods that are unscheduled due to different constraints.

Export Options

The collected metrics can be easily exported to Prometheus in the Prometheus format, allowing for seamless integration with Prometheus monitoring systems and dashboards.

Local Script Execution

To run the script locally, from outside the cluster, follow these steps:

Create a Python 3 virtual environment:
```
python3 -m venv venv
```
Activate the virtual environment:
```
source venv/bin/activate
```
Install required pip packages:
```
pip install -r requirements.txt
```

Run the script:

python main.py --service-name foo --namespace-name default --polling-interval 2

Or Kubernetes Deployment

To run the script as a Kubernetes deployment, follow these steps:

Build and push the container image:

docker build -f Dockerfile -t andreistefanciprian/endpoints-prom-exporter:latest .
docker image push andreistefanciprian/endpoints-prom-exporter

Apply Kubernetes resources for deployment, RBAC, and ServiceMonitor:

kubectl apply -f k8s/foo_deployment.yaml
kubectl apply -f k8s/rbac.yaml
kubectl apply -f k8s/servicemonitor.yaml
kubectl apply -f k8s/deployment.yaml

Testing

For testing purposes, use the following commands:

Check application logs while testing:

kubectl logs -l app=endpoints-prom-exporter -f

Access the metrics page at http://localhost:9153/

Execute various test scenarios to observe how the metrics change and adapt:

# Simulate failures
kubectl set image pod foo-f88c97f79-5dvph foo=nginx:fail
kubectl set image deployment foo foo=nginx:1.12.0
kubectl scale deployment foo -n default --replicas 0
kubectl scale deployment foo -n default --replicas 10
kubectl set image deployment foo foo=nginx:fail

# Fail liveness probe on one of the pods
kubectl exec -ti foo-f88c97f79-5dvph rm /usr/share/nginx/html/index.html

# Fail liveness probe on all pods
for pod in $(kubectl get pods --no-headers | grep foo | grep Running | awk '{print $1}'); do kubectl exec -ti $pod rm /usr/share/nginx/html/index.html; done

# Fail startup/readiness probe by redeploying with incorrect port number for these probes

Note: The srv_not_ready_pods counter doesn't capture pods that can't be scheduled for any reason. To simulate this, follow these steps:

# Cordon all nodes
for node in $(kubectl get nodes --no-headers | awk '{print $1}'); do kubectl cordon $node; done

# Scale the deployment
kubectl scale deployment foo -n default --replicas 5

# Uncordon all nodes
for node in $(kubectl get nodes --no-headers | awk '{print $1}'); do kubectl uncordon $node; done

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
k8s		k8s
.gitignore		.gitignore
Dockerfile		Dockerfile
main.py		main.py
prometheus_metrics_screenshot.png		prometheus_metrics_screenshot.png
readme.md		readme.md
requirements.txt		requirements.txt
srv_not_ready_pods.png		srv_not_ready_pods.png
srv_ready_pods.png		srv_ready_pods.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Endpoint Metrics Collector for Kubernetes Services

Collected Metrics

Export Options

Local Script Execution

Or Kubernetes Deployment

Testing

About

Releases

Packages

Languages

andreistefanciprian/k8s-service-endpoint-prom-exporter

Folders and files

Latest commit

History

Repository files navigation

Endpoint Metrics Collector for Kubernetes Services

Collected Metrics

Export Options

Local Script Execution

Or Kubernetes Deployment

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages