Skip to content

ingero-io/ingero

v0.8.1 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 2mo MCP Data & Storage
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

causal-tracing cuda cuda-graphs ebpf gpu gpu-monitoring
+11 more
gpu-observability incident-response kubernetes machine-learning mcp model-context-protocol nvidia observability pytorch sre distributed-tracing

Summary

AI summary

Aggregate flush starvation fixed for high‑throughput periods (400K+ events/min).

Full changelog

What's New

Seven fixes from RTX 4090 GPT-2 stress test analysis (5-phase, 237K+ events/min).

Highlights

  • DB compaction at shutdown — WAL checkpoint + VACUUM when >20% of pages are free. Integration test DB shrank from 57 MB to 2.7 MB (95% reduction)
  • Throughput-drop causal chains — new detection for when CUDA op rate drops >40% from peak but per-call latency stays flat. Catches GPU starvation that tail-ratio chains miss
  • Aggregate flush starvation fix — high-throughput periods (400K+ events/min) no longer starve the stats flusher. Event-count-based inline flush every 10K events
  • Process name persistence — dynamically discovered PIDs now have names in SQLite. explain --per-process shows process names instead of raw PIDs
  • Phase 5 block I/O visibility — checkpoint saves trigger fsync for block device tracepoint capture

Test Results

  • 224 unit tests
  • RTX 4090 GPU integration: 73 PASS, 0 FAIL, 1 SKIP, 6 WARN
  • 28/28 ML Eng AI-assisted investigations PASS
  • 944 causal chains including new throughput-drop chains

Quick Start

git clone https://github.com/ingero-io/ingero.git
cd ingero && make build
./bin/ingero demo incident    # No GPU needed

For real GPU tracing:

sudo ./bin/ingero trace --duration 30s
./bin/ingero explain --since 30s

Full changelog: see Release Notes

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track ingero-io/ingero

Get notified when new releases ship.

Sign up free

About ingero-io/ingero

eBPF-based GPU causal observability agent with MCP server. Traces CUDA Runtime/Driver APIs and host kernel events to build causal chains explaining GPU latency.

All releases →

Related context

Earlier breaking changes

  • v0.17.0 Dropped 'annotate --socket' option from CLI.

Beta — feedback welcome: [email protected]