Kubernetes Monitoring
Complete observability for your Kubernetes clusters. Monitor nodes, pods, containers, and services with automatic discovery, Prometheus compatibility, and intelligent alerting.
Full-Stack Kubernetes Observability
From cluster health to individual container metrics, get complete visibility into your Kubernetes environment.
Cluster Level
Control plane health, API server latency, etcd performance, scheduler metrics
Node Level
CPU, memory, disk, network, kubelet status, node conditions
Pod Level
Pod lifecycle, restart counts, resource requests vs limits, readiness
Container Level
CPU throttling, memory usage, OOM events, container states
What We Monitor
Control Plane
Monitor the heart of your Kubernetes cluster for reliability and performance.
- API Server request latency
- etcd health and latency
- Scheduler queue depth
- Controller manager metrics
- Certificate expiration
Nodes & Kubelet
Track node health and kubelet performance across your cluster.
- Node CPU, memory, disk
- Kubelet health status
- Node conditions (Ready, DiskPressure, etc.)
- Pod capacity and allocation
- Container runtime metrics
Pods & Containers
Deep visibility into workload performance and resource consumption.
- CPU usage and throttling
- Memory usage and OOM kills
- Restart counts and crash loops
- Resource requests vs limits
- Container states and events
Services & Networking
Monitor service endpoints and network connectivity.
- Service endpoint health
- Ingress traffic and latency
- Network policies effectiveness
- DNS resolution times
- Service mesh metrics (Istio, Linkerd)
Workload Resources
Track Deployments, StatefulSets, DaemonSets, and Jobs.
- Deployment replica status
- Rolling update progress
- StatefulSet ordering
- DaemonSet coverage
- Job and CronJob completion
Persistent Storage
Monitor PersistentVolumes and storage performance.
- PV/PVC binding status
- Storage capacity usage
- I/O throughput and latency
- StorageClass provisioning
- Volume mount errors
Kubernetes-Native Features
๐ Auto-Discovery
Automatically discover and monitor pods, services, and endpoints. No manual configuration needed as workloads scale.
๐ Prometheus Compatible
Native PromQL support. Scrape existing Prometheus endpoints. Use your existing recording rules and alerts.
๐ท๏ธ Label-Aware
Filter and aggregate by Kubernetes labels and annotations. Group metrics by namespace, deployment, or custom labels.
๐ Resource Optimization
Right-size resource requests and limits based on actual usage. Identify over-provisioned and under-provisioned workloads.
๐ Smart Alerting
Pre-configured alerts for common K8s issues: CrashLoopBackOff, pending pods, node NotReady, certificate expiry.
๐ Multi-Cluster
Monitor multiple Kubernetes clusters from a single dashboard. Compare performance across environments.
๐ฆ Helm Deployment
Deploy Bleemeo agent with a single Helm chart. GitOps-ready with full customization options.
๐ OpenTelemetry
Ingest traces and metrics via OpenTelemetry. Correlate infrastructure metrics with application traces.
Quick Setup with Helm
Add Bleemeo Helm Repository
Add the official Bleemeo Helm chart repository to your Helm installation.
helm repo add bleemeo-agent https://packages.bleemeo.com/bleemeo-agent/helm-charts
helm repo update Install the Agent
Deploy Glouton agent as a DaemonSet with your account credentials.
helm upgrade --install glouton bleemeo-agent/glouton \
--set account_id="your_account_id" \
--set registration_key="your_registration_key" \
--set config.kubernetes.clustername="my_k8s_cluster_name" \
--set namespace="default" View Your Cluster
Nodes, pods, and services appear automatically in your Bleemeo dashboard within seconds.
Pre-Built Kubernetes Alerts
Get notified about common Kubernetes issues before they impact your users.
Pod Issues
- CrashLoopBackOff detected
- Pod stuck in Pending
- High restart count
- OOMKilled containers
Node Issues
- Node NotReady
- High CPU/memory pressure
- Disk space low
- Too many pods scheduled
Cluster Issues
- API server errors
- etcd latency high
- Certificate expiring
- PVC pending
Workload Issues
- Deployment replicas unavailable
- StatefulSet not ready
- Job failed
- HPA at max replicas
Works With Your Stack
Why Bleemeo for Kubernetes?
Real-Time Visibility
See pod creation, scaling events, and failures as they happen. No delay in metrics collection.
Cost Optimization
Identify resource waste and right-size your workloads. Reduce cloud spending without impacting performance.
Lightweight Agent
Glouton uses minimal resources. Less than 100MB memory per node. Won't compete with your workloads.
13 Months Retention
Keep historical data for capacity planning and trend analysis. Compare performance over time.
Start Monitoring Your Kubernetes Clusters
Deploy in minutes. Get full visibility into your K8s infrastructure.