๐Ÿ“‰TensorBoard

TensorBoardarrow-up-right provides real-time visualization of training metrics. SLM Lab logs data for TensorBoard in dev mode only (not train mode) to minimize overhead during full training runs.

Quick Start

TensorBoard event files are generated only in dev mode:

# Run in dev mode to enable TensorBoard logging
slm-lab run slm_lab/spec/benchmark/ppo/ppo_cartpole.json ppo_cartpole dev

# Start TensorBoard
uv run tensorboard --log_dir=data

# Open in browser
# http://localhost:6006
circle-info

Train mode: In train mode, SLM Lab generates CSV files and graphs instead of TensorBoard events to reduce overhead. Use the generated graphs for analysis.

What's Logged

Category
Metrics
Use Case

Scalars

Rewards, loss, learning rate, FPS

Track training progress

Graphs

Neural network architecture

Verify model structure

Histograms

Action distributions, weight distributions

Debug policy behavior

Viewing Training Progress

Scalars Tab

Shows training metrics over time:

  • total_reward: Episode returns

  • total_reward_ma: Moving average (100 checkpoints)

  • loss: Training loss components

  • lr: Learning rate schedule

  • fps: Training throughput

Histograms Tab

Reveals distributions that change over training:

TensorBoard histograms

Action distributions: For continuous control (e.g., BipedalWalker with 4 actions), you'll see 4 histogram groups showing how action values evolve. As the agent learns, these distributions should shift and narrow.

Weight distributions: Model parameters grouped by layer. Healthy training shows gradual, stable changes. Sudden shifts may indicate instability.

Tips

Speed Up Loading

TensorBoard can be slow with many experiments. Specify a single run:

Compare Multiple Runs

Point to the parent directory to overlay runs:

Use the "Runs" selector in the UI to toggle visibility.

Remote Access

When training on a remote server:

TensorBoard vs SLM Lab Graphs

Feature
TensorBoard
SLM Lab Graphs

Real-time

Yes

No (generated at checkpoints)

Interactivity

Full zoom/pan

Basic (Plotly HTML)

Aggregation

Manual comparison

Automatic trial averaging

Publication-ready

Requires export

PNG ready to use

Use TensorBoard for debugging during training. Use SLM Lab's generated graphs for final results and publications.

Last updated

Was this helpful?