Kubernetes Monitoring

Complete observability for your Kubernetes clusters. Monitor nodes, pods, containers, and services with automatic discovery, Prometheus compatibility, and intelligent alerting.

Bleemeo Kubernetes Dashboard - Complete K8s cluster monitoring with node status, pod metrics, CPU requests, and memory usage

Full-Stack Kubernetes Observability

From cluster health to individual container metrics, get complete visibility into your Kubernetes environment.

Cluster Level

Control plane health, API server latency, etcd performance, scheduler metrics

Node Level

CPU, memory, disk, network, kubelet status, node conditions

Pod Level

Pod lifecycle, restart counts, resource requests vs limits, readiness

Container Level

CPU throttling, memory usage, OOM events, container states

What We Monitor

Control Plane

Monitor the heart of your Kubernetes cluster for reliability and performance.

  • API Server request latency
  • etcd health and latency
  • Scheduler queue depth
  • Controller manager metrics
  • Certificate expiration

Nodes & Kubelet

Track node health and kubelet performance across your cluster.

  • Node CPU, memory, disk
  • Kubelet health status
  • Node conditions (Ready, DiskPressure, etc.)
  • Pod capacity and allocation
  • Container runtime metrics

Pods & Containers

Deep visibility into workload performance and resource consumption.

  • CPU usage and throttling
  • Memory usage and OOM kills
  • Restart counts and crash loops
  • Resource requests vs limits
  • Container states and events

Services & Networking

Monitor service endpoints and network connectivity.

  • Service endpoint health
  • Ingress traffic and latency
  • Network policies effectiveness
  • DNS resolution times
  • Service mesh metrics (Istio, Linkerd)

Workload Resources

Track Deployments, StatefulSets, DaemonSets, and Jobs.

  • Deployment replica status
  • Rolling update progress
  • StatefulSet ordering
  • DaemonSet coverage
  • Job and CronJob completion

Persistent Storage

Monitor PersistentVolumes and storage performance.

  • PV/PVC binding status
  • Storage capacity usage
  • I/O throughput and latency
  • StorageClass provisioning
  • Volume mount errors

Kubernetes-Native Features

๐Ÿ” Auto-Discovery

Automatically discover and monitor pods, services, and endpoints. No manual configuration needed as workloads scale.

๐Ÿ“Š Prometheus Compatible

Native PromQL support. Scrape existing Prometheus endpoints. Use your existing recording rules and alerts.

๐Ÿท๏ธ Label-Aware

Filter and aggregate by Kubernetes labels and annotations. Group metrics by namespace, deployment, or custom labels.

๐Ÿ“ˆ Resource Optimization

Right-size resource requests and limits based on actual usage. Identify over-provisioned and under-provisioned workloads.

๐Ÿ”” Smart Alerting

Pre-configured alerts for common K8s issues: CrashLoopBackOff, pending pods, node NotReady, certificate expiry.

๐ŸŒ Multi-Cluster

Monitor multiple Kubernetes clusters from a single dashboard. Compare performance across environments.

๐Ÿ“ฆ Helm Deployment

Deploy Bleemeo agent with a single Helm chart. GitOps-ready with full customization options.

๐Ÿ”— OpenTelemetry

Ingest traces and metrics via OpenTelemetry. Correlate infrastructure metrics with application traces.

Quick Setup with Helm

1

Add Bleemeo Helm Repository

Add the official Bleemeo Helm chart repository to your Helm installation.

helm repo add bleemeo-agent https://packages.bleemeo.com/bleemeo-agent/helm-charts
helm repo update
2

Install the Agent

Deploy Glouton agent as a DaemonSet with your account credentials.

helm upgrade --install glouton bleemeo-agent/glouton \
    --set account_id="your_account_id" \
    --set registration_key="your_registration_key" \
    --set config.kubernetes.clustername="my_k8s_cluster_name" \
    --set namespace="default"
3

View Your Cluster

Nodes, pods, and services appear automatically in your Bleemeo dashboard within seconds.

Pre-Built Kubernetes Alerts

Get notified about common Kubernetes issues before they impact your users.

Pod Issues

  • CrashLoopBackOff detected
  • Pod stuck in Pending
  • High restart count
  • OOMKilled containers

Node Issues

  • Node NotReady
  • High CPU/memory pressure
  • Disk space low
  • Too many pods scheduled

Cluster Issues

  • API server errors
  • etcd latency high
  • Certificate expiring
  • PVC pending

Workload Issues

  • Deployment replicas unavailable
  • StatefulSet not ready
  • Job failed
  • HPA at max replicas

Works With Your Stack

Managed K8s EKS, GKE, AKS, DigitalOcean
Distributions OpenShift, Rancher, k3s, k0s
Service Mesh Istio, Linkerd, Consul Connect
Ingress NGINX, Traefik, HAProxy, Kong
Storage Ceph, Longhorn, OpenEBS, CSI
Observability Prometheus, Grafana, OpenTelemetry
CI/CD ArgoCD, Flux, Jenkins, GitLab
Databases PostgreSQL, MySQL, MongoDB, Redis

Why Bleemeo for Kubernetes?

Real-Time Visibility

See pod creation, scaling events, and failures as they happen. No delay in metrics collection.

Cost Optimization

Identify resource waste and right-size your workloads. Reduce cloud spending without impacting performance.

Lightweight Agent

Glouton uses minimal resources. Less than 100MB memory per node. Won't compete with your workloads.

13 Months Retention

Keep historical data for capacity planning and trend analysis. Compare performance over time.

Start Monitoring Your Kubernetes Clusters

Deploy in minutes. Get full visibility into your K8s infrastructure.