Metrics Reference
Complete catalog of metrics collected and exported by Telegen.
Self-Telemetry Metrics
Metrics about Telegen’s own operation.
Collection Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
signal_type |
Total spans collected |
|
Counter |
signal_type, endpoint |
Spans exported successfully |
|
Counter |
reason |
Spans dropped |
|
Counter |
- |
Metrics collected |
|
Counter |
endpoint |
Metrics exported |
|
Counter |
- |
Logs collected |
|
Counter |
endpoint |
Logs exported |
|
Counter |
- |
Profiles collected |
eBPF Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
program_type |
Number of eBPF programs |
|
Gauge |
map_type |
Number of eBPF maps |
|
Gauge |
map_name |
Entries in each map |
|
Counter |
- |
Ring buffer events received |
|
Counter |
- |
Ring buffer events lost |
|
Counter |
cpu |
Perf buffer events |
|
Counter |
cpu |
Perf buffer events lost |
Export Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
endpoint, status |
Export requests |
|
Counter |
endpoint, error_type |
Export errors |
|
Histogram |
endpoint |
Export latency |
|
Histogram |
signal_type |
Batch sizes |
|
Gauge |
signal_type |
Current queue depth |
Process Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
- |
CPU time used |
|
Gauge |
- |
Memory usage |
|
Gauge |
- |
Virtual memory |
|
Gauge |
- |
Open file descriptors |
|
Gauge |
- |
Max file descriptors |
|
Gauge |
- |
Process start time |
|
Gauge |
- |
Number of goroutines |
|
Summary |
- |
GC pause duration |
Node Metrics (node_exporter Compatible)
When Node Exporter Fusion is enabled, Telegen exports Prometheus node_exporter compatible metrics.
CPU Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
cpu, mode |
CPU time per mode |
|
Counter |
cpu, mode |
Guest CPU time |
|
Gauge |
cpu |
CPU frequency |
|
Gauge |
cpu |
Max CPU frequency |
|
Gauge |
cpu |
Min CPU frequency |
Memory Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
- |
Total memory |
|
Gauge |
- |
Free memory |
|
Gauge |
- |
Available memory |
|
Gauge |
- |
Buffer memory |
|
Gauge |
- |
Cached memory |
|
Gauge |
- |
Total swap |
|
Gauge |
- |
Free swap |
|
Gauge |
- |
Cached swap |
|
Gauge |
- |
Active memory |
|
Gauge |
- |
Inactive memory |
|
Gauge |
- |
Slab memory |
Disk Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
device |
Read operations |
|
Counter |
device |
Write operations |
|
Counter |
device |
Bytes read |
|
Counter |
device |
Bytes written |
|
Counter |
device |
Read time |
|
Counter |
device |
Write time |
|
Counter |
device |
Total I/O time |
|
Gauge |
device |
I/Os in progress |
|
Counter |
device |
Discard operations |
Filesystem Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
device, fstype, mountpoint |
Total size |
|
Gauge |
device, fstype, mountpoint |
Free space |
|
Gauge |
device, fstype, mountpoint |
Available space |
|
Gauge |
device, fstype, mountpoint |
Total inodes |
|
Gauge |
device, fstype, mountpoint |
Free inodes |
|
Gauge |
device, fstype, mountpoint |
Read-only flag |
Network Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
device |
Bytes received |
|
Counter |
device |
Bytes transmitted |
|
Counter |
device |
Packets received |
|
Counter |
device |
Packets transmitted |
|
Counter |
device |
Receive errors |
|
Counter |
device |
Transmit errors |
|
Counter |
device |
Receive drops |
|
Counter |
device |
Transmit drops |
|
Gauge |
device |
Interface up status |
|
Gauge |
device |
Link speed |
|
Gauge |
device |
MTU |
Load Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
- |
1-minute load average |
|
Gauge |
- |
5-minute load average |
|
Gauge |
- |
15-minute load average |
System Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
- |
Boot time |
|
Counter |
- |
Context switches |
|
Counter |
- |
Forks |
|
Counter |
- |
Interrupts |
|
Gauge |
- |
Running processes |
|
Gauge |
- |
Blocked processes |
|
Gauge |
sysname, release, version, machine, nodename, domainname |
System info |
GPU Metrics
NVIDIA GPU metrics when AI/ML observability is enabled.
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
gpu, uuid |
GPU utilization (0-1) |
|
Gauge |
gpu, uuid |
Memory used |
|
Gauge |
gpu, uuid |
Total memory |
|
Gauge |
gpu, uuid |
Memory utilization (0-1) |
|
Gauge |
gpu, uuid |
GPU temperature |
|
Gauge |
gpu, uuid |
Power usage |
|
Gauge |
gpu, uuid |
Power limit |
|
Gauge |
gpu, uuid |
Graphics clock |
|
Gauge |
gpu, uuid |
SM clock |
|
Gauge |
gpu, uuid |
Memory clock |
|
Counter |
gpu, uuid |
PCIe TX bytes |
|
Counter |
gpu, uuid |
PCIe RX bytes |
|
Counter |
gpu, uuid, type |
ECC errors |
|
Counter |
gpu, uuid, link |
NVLink TX |
|
Counter |
gpu, uuid, link |
NVLink RX |
|
Gauge |
gpu, uuid |
Compute processes |
|
Gauge |
gpu, uuid |
Graphics processes |
LLM Inference Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
model, endpoint |
Inference requests |
|
Counter |
model |
Input tokens |
|
Counter |
model |
Output tokens |
|
Histogram |
model |
TTFT latency |
|
Gauge |
model |
Token generation rate |
|
Histogram |
model |
Batch sizes |
|
Gauge |
model |
KV cache hit ratio |
|
Gauge |
model |
Request queue |
Network Flow Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
src, dst, protocol, direction |
Flow bytes |
|
Counter |
src, dst, protocol, direction |
Flow packets |
|
Counter |
protocol |
Connection count |
|
Gauge |
protocol |
Active connections |
|
Histogram |
src, dst |
Round-trip time |
|
Counter |
src, dst |
Retransmissions |
Database Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
db_type, operation |
Query count |
|
Histogram |
db_type, operation |
Query latency |
|
Gauge |
db_type, host |
Active connections |
|
Counter |
db_type, error_type |
Database errors |
|
Counter |
db_type, operation |
Rows affected |
Connection Statistics Metrics
Emitted when a TCP connection closes, providing per-connection byte throughput data.
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
src_ip, dst_ip, dst_port, protocol |
Bytes sent over the connection lifetime |
|
Counter |
src_ip, dst_ip, dst_port, protocol |
Bytes received over the connection lifetime |
These metrics complement per-request traces, giving aggregate throughput even for protocols that are not fully parsed.
Kafka Consumer Group Metrics
Kafka spans with consumer group context include the following additional span attribute:
Attribute |
Description |
|---|---|
|
Consumer group identifier, extracted from JoinGroup and SyncGroup Kafka protocol events |
This attribute appears on spans emitted for Fetch requests and group management operations (JoinGroup, SyncGroup). Use it to filter group-specific traces and correlate consumer lag:
# Count Fetch spans by consumer group
count(telegen_spans_collected_total{
messaging_system="kafka",
messaging_kafka_consumer_group_id=~".+"
}) by (messaging_kafka_consumer_group_id)
SNMP Metrics
SNMP metrics use the MIB object names with snmp_ prefix.
Interface Metrics (IF-MIB)
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
ifIndex, ifDescr |
Input octets |
|
Counter |
ifIndex, ifDescr |
Output octets |
|
Counter |
ifIndex, ifDescr |
Input packets |
|
Counter |
ifIndex, ifDescr |
Output packets |
|
Gauge |
ifIndex, ifDescr |
Operational status |
|
Gauge |
ifIndex, ifDescr |
Interface speed |
|
Counter |
ifIndex, ifDescr |
Input errors |
|
Counter |
ifIndex, ifDescr |
Output errors |
System Metrics (SNMPv2-MIB)
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
- |
System uptime |
|
Info |
sysName |
System name |
|
Info |
sysDescr |
System description |
Storage Array Metrics
Common Storage Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
array, pool |
Total capacity |
|
Gauge |
array, pool |
Used capacity |
|
Gauge |
array, pool |
Free capacity |
|
Counter |
array, volume |
Read IOPS |
|
Counter |
array, volume |
Write IOPS |
|
Counter |
array, volume |
Read throughput |
|
Counter |
array, volume |
Write throughput |
|
Histogram |
array, volume |
Read latency |
|
Histogram |
array, volume |
Write latency |
|
Gauge |
array, controller |
Controller health |
|
Gauge |
array, disk |
Disk health |
Security Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Counter |
event_type, severity |
Security events |
|
Counter |
syscall, comm |
Syscall counts |
|
Counter |
path, operation |
File access |
|
Counter |
binary |
Process executions |
|
Counter |
process, direction |
Network connections |
|
Counter |
type |
Privilege escalations |
Kubernetes Metrics
Metric |
Type |
Labels |
Description |
|---|---|---|---|
|
Gauge |
namespace |
Pods discovered |
|
Gauge |
namespace |
Services discovered |
|
Gauge |
namespace |
Deployments discovered |
Metric Labels
Common Labels
Applied to most metrics:
Label |
Description |
Example |
|---|---|---|
|
Hostname |
|
|
Service name |
|
|
Namespace |
|
|
Pod name |
|
|
K8s namespace |
|
|
K8s node |
|
|
Deployment |
|
|
Container ID |
|
Metric Naming Conventions
Telegen follows these conventions:
Prefix:
telegen_for Telegen-specific metricsnode_exporter:
node_prefix for compatibilitySNMP:
snmp_prefix with MIB object namesUnits: Suffix with unit (
_bytes,_seconds,_total)Type: Counter ends with
_total
Next Steps
Semantic Conventions - OpenTelemetry naming
Monitoring Telegen - Using these metrics
Node Exporter Fusion - node_exporter compatibility