MNIST MLP Visualizer

Note

This project is a fork of DFin/Neural-Network-Visualisation.

Interactive web visualisation for a compact multi-layer perceptron trained on the MNIST handwritten digit dataset. Draw a digit, watch activations propagate through the network in 3D, and inspect real-time prediction probabilities.

WIP

This is still in a rough state and under active development. If you want something useable for a museum etc check back later. I have a couple of features in mind (like being able to connect a tablet to draw a number) to make this a good educational visualisation.

Repository Layout

index.html / assets/ – Static Three.js visualiser and UI assets.
exports/mlp_weights.json – Default weights with timeline snapshots (generated from the latest training run).
training/mlp_train.py – PyTorch helper to train the MLP (with Apple Metal acceleration when available) and export weights for the front-end.

Quick Start

Install Python dependencies (PyTorch + torchvision):
```
python3 -m pip install torch torchvision
```
Generate Model Weights (Required): Since model weights are not stored in the repository, you must train the model locally to generate the exports/ directory.
```
python3 training/mlp_train.py
```
This will download MNIST, train the model, and save the weights to exports/mlp_weights.json.
(Optional) Generate Test Assets: The visualization uses pre-processed MNIST test images. These are usually included in assets/data/, but you can regenerate them if needed:
```
python3 tools/mnist_assets/prepare_mnist_test_assets.py
```
Launch a static file server from the repository root:
```
python3 -m http.server 8000
```
Open http://localhost:8000 in your browser. Draw on the 28×28 grid (left-click to draw, right-click to erase) and explore the 3D network with the mouse or trackpad.

Project Details

Data Input & Characteristics

The model is trained on the MNIST dataset, a classic collection of handwritten digits.

Input: 28×28 pixel grayscale images.
Preprocessing: Images are normalized using the global MNIST mean (0.1307) and standard deviation (0.3081) before being fed into the network.
Frontend Data: The web visualizer uses a custom binary format for test images to ensure fast loading. This is generated by tools/mnist_assets/prepare_mnist_test_assets.py.

Model Architecture

The default model is a Small MLP (Multi-Layer Perceptron) designed for real-time 3D rendering.

Structure: Fully connected layers.
- Input: 784 neurons (28×28 flattened).
- Hidden: Configurable (default: 128, 64). Uses ReLU activation.
- Output: 10 neurons (one for each digit 0-9).
Goal: To be lightweight enough for smooth browser visualization while still achieving reasonable accuracy (>95%).

Training Process

The training pipeline (training/mlp_train.py) is unique because it records the evolution of the network.

Optimizer: Adam (learning rate 1e-3).
Loss Function: Cross Entropy.
Timeline Snapshots: Instead of just saving the final model, the script saves "snapshots" of weights at specific milestones (e.g., 50 images seen, 100 images, 1 epoch, etc.). This allows the frontend to "replay" the training process, showing how the network learns over time.

Scripts Overview

training/mlp_train.py: The core training script. Downloads MNIST, trains the PyTorch model, and exports the JSON weights + timeline.
tools/mnist_assets/prepare_mnist_test_assets.py: Converts standard MNIST IDX files into compact binary blobs (.bin) and a manifest (.json) for the web frontend.
deploy.sh: A helper script to package the current commit and deploy it to the releases/ directory for hosting.

Training & Exporting New Weights

training/mlp_train.py trains a small MLP on MNIST and writes a JSON export the front-end consumes. Metal (MPS) is used automatically when available on Apple Silicon; otherwise the script falls back to CUDA or CPU.

Typical usage:

python3 training/mlp_train.py \
  --epochs 5 \
  --hidden-dims 128 64 \
  --batch-size 256 \
  --export-path exports/mlp_weights.json

Key options:

--hidden-dims: Hidden layer sizes (default 128 64). Keep the network modest so the visualisation stays responsive.
--epochs: Minimum training epochs (default 5). The script will automatically extend the run so the timeline hits the 50× dataset milestone.
--batch-size: Mini-batch size (default 128).
--device: Force mps, cuda, or cpu. By default the script picks the best available backend.
--skip-train: Export the randomly initialised weights without running training (useful for debugging the pipeline).

After training, update VISUALIZER_CONFIG.weightUrl in assets/main.js if you export to a different location/name. Refresh the browser to load the new weights.

Training timeline export

Every exported JSON now includes a timeline array spanning 35 checkpoints: densely spaced early snapshots (≈50, 120, 250, 500, 1k, 2k, 3.5k, 5.8k, 8.7k, 13k, 19.5k, 28.5k, 40k images), followed by dataset-multiple milestones from 1× through 50×. The JSON manifest stays small; each snapshot’s weights are stored separately as float16-encoded files under exports/<stem>/NNN_<id>.json, and the front-end streams them on demand so you can scrub the timeline without downloading the entire 50× run up front. Re-export the weights with the updated script to generate fresh timeline data for your own runs.

Notes & Tips

The visualiser highlights the top-N (configurable) strongest incoming connections per neuron to keep the scene legible.
Colors encode activation sign and magnitude (cool tones for negative/low, warm tones for strong positive contributions).
The default export (exports/mlp_weights.json) already includes timeline milestones from a multi-epoch training run. Retrain (and re-export) if you want to showcase a different progression.
If you adjust the architecture, ensure the JSON export reflects the new layer sizes; the front-end builds the scene dynamically from that metadata.

Customization

You can easily experiment with different network architectures by modifying the training command:

Changing Network Size

To train a deeper or wider network:

# A network with 3 hidden layers of size 64, 32, and 16
python3 training/mlp_train.py --hidden-dims 64 32 16

Note: Very large networks might reduce the frame rate of the 3D visualization.

Adjusting Training Duration

To see how the network behaves with more training:

# Train for 10 epochs
python3 training/mlp_train.py --epochs 10

Deployment

The server keeps live assets separate from active development under releases/:

releases/current/ – files served by your static HTTP server.
releases/backups/<timestamp>/ – point-in-time snapshots for quick rollback.
releases/.deploy_tmp/ – staging area used during deployment.

To publish the code you currently have checked out, run the deploy script from the repository root:

./deploy.sh

You can target a different commit or branch explicitly:

./deploy.sh <commit-ish>

The script exports the requested commit into the staging area, syncs it into releases/current/, and saves the same tree under releases/backups/<timestamp>/ with the commit hash recorded in .commit.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
assets		assets
exports		exports
tools/mnist_assets		tools/mnist_assets
training		training
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE.MD		NOTICE.MD
README.md		README.md
deploy.sh		deploy.sh
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST MLP Visualizer

WIP

Repository Layout

Quick Start

Project Details

Data Input & Characteristics

Model Architecture

Training Process

Scripts Overview

Training & Exporting New Weights

Training timeline export

Notes & Tips

Customization

Changing Network Size

Adjusting Training Duration

Deployment

About

Uh oh!

Releases

Packages

Languages

License

ipveka/Neural-Network-Visualisation

Folders and files

Latest commit

History

Repository files navigation

MNIST MLP Visualizer

WIP

Repository Layout

Quick Start

Project Details

Data Input & Characteristics

Model Architecture

Training Process

Scripts Overview

Training & Exporting New Weights

Training timeline export

Notes & Tips

Customization

Changing Network Size

Adjusting Training Duration

Deployment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages