Expand description
zernel autopilot — Autonomous training optimizer
Monitors training in real-time and automatically fixes problems:
- Detects GPU underutilization → suggests increasing DataLoader workers
- Detects memory pressure → suggests gradient checkpointing
- Detects NaN gradients → stops early and reports the layer
- Detects data bottleneck → suggests prefetching
- Tracks loss curve and detects divergence