Note

New to DeepInverse? Get started with the basics with the 5 minute quickstart tutorial.

Self-supervised learning with measurement splitting#

We demonstrate self-supervised learning with measurement splitting, to train a denoiser network on the MNIST dataset. The physics here is noisy computed tomography, as is the case in Noise2Inverse Hendriksen et al.[1]. Note this example can also be easily applied to undersampled multicoil MRI as is the case in SSDU Yaman et al.[2].

Measurement splitting constructs a ground-truth free loss \(\frac{m}{m_2}\| y_2 - A_2 \inversef{y_1}{A_1}\|^2\) by splitting the measurement and the forward operator using a randomly generated mask.

See deepinv.loss.SplittingLoss for full details.

from pathlib import Path

import torch
from torch.utils.data import DataLoader
from torchvision import transforms, datasets

import deepinv as dinv
from deepinv.utils.demo import get_data_home
from deepinv.models.utils import get_weights_url

torch.manual_seed(0)
device = dinv.utils.get_freer_gpu() if torch.cuda.is_available() else "cpu"

BASE_DIR = Path(".")
DATA_DIR = BASE_DIR / "measurements"
ORIGINAL_DATA_HOME = get_data_home()

Define loss#

Our implementation has multiple optional parameters that control how the splitting is to be achieved. For example, you can:

Use split_ratio to set the ratio of pixels used in the forward pass vs the loss;
Define custom masking methods using a mask_generator such as deepinv.physics.generator.BernoulliSplittingMaskGenerator or deepinv.physics.generator.GaussianSplittingMaskGenerator;
Use eval_n_samples to set how many realisations of the random mask is used at evaluation time;
Optionally disable measurement splitting at evaluation time using eval_split_input (as is the case in SSDU Yaman et al.[2]).
Average over both input and output masks at evaluation time using eval_split_output. See deepinv.loss.SplittingLoss for details.

Note that after the model has been defined, the loss must also “adapt” the model.

loss = dinv.loss.SplittingLoss(split_ratio=0.6, eval_split_input=True, eval_n_samples=5)

Prepare data#

We use the torchvision MNIST dataset, and use noisy tomography physics (with number of angles equal to the image size) for the forward operator.

Note

We use a subset of the whole training set to reduce the computational load of the example. We recommend to use the whole set by setting train_datapoints=test_datapoints=None to get the best results.

transform = transforms.Compose([transforms.ToTensor()])

train_dataset = datasets.MNIST(
    root=ORIGINAL_DATA_HOME, train=True, transform=transform, download=True
)
test_dataset = datasets.MNIST(
    root=ORIGINAL_DATA_HOME, train=False, transform=transform, download=True
)

physics = dinv.physics.Tomography(
    angles=28,
    img_width=28,
    noise_model=dinv.physics.noise.GaussianNoise(0.1),
    device=device,
)

deepinv_datasets_path = dinv.datasets.generate_dataset(
    train_dataset=train_dataset,
    test_dataset=test_dataset,
    physics=physics,
    device=device,
    save_dir=DATA_DIR,
    train_datapoints=100,
    test_datapoints=10,
)

train_dataset = dinv.datasets.HDF5Dataset(path=deepinv_datasets_path, train=True)
test_dataset = dinv.datasets.HDF5Dataset(path=deepinv_datasets_path, train=False)

train_dataloader = DataLoader(train_dataset, shuffle=True)
test_dataloader = DataLoader(test_dataset, shuffle=False)

/home/runner/work/deepinv/deepinv/deepinv/physics/tomography.py:188: UserWarning: The default value of `normalize` is not specified and will be automatically set to `True`. Set `normalize` explicitly to `True` or `False` to avoid this warning.
  warn(
Power iteration converged at iteration 7, ||A^T A||_2=756.66
Dataset has been saved at measurements/dinv_dataset0.h5

Define model#

We use a simple U-Net architecture with 2 scales as the denoiser network.

To reduce training time, we use a pretrained model. Here we demonstrate training with 100 images for 1 epoch, after having loaded a pretrained model trained that was with 1000 images for 20 epochs.

Note

When using the splitting loss, the model must be “adapted” by the loss, as its forward pass takes only a subset of the pixels, not the full image.

model = dinv.models.ArtifactRemoval(
    dinv.models.UNet(in_channels=1, out_channels=1, scales=2).to(device), pinv=True
)
model = loss.adapt_model(model)

optimizer = torch.optim.Adam(model.parameters(), lr=1e-3, weight_decay=1e-8)

# Load pretrained model
file_name = "demo_measplit_mnist_tomography.pth"
url = get_weights_url(model_name="measplit", file_name=file_name)
ckpt = torch.hub.load_state_dict_from_url(
    url, map_location=lambda storage, loc: storage, file_name=file_name
)

model.load_state_dict(ckpt["state_dict"])
optimizer.load_state_dict(ckpt["optimizer"])

Downloading: "https://huggingface.co/deepinv/measplit/resolve/main/demo_measplit_mnist_tomography.pth?download=true" to /home/runner/.cache/torch/hub/checkpoints/demo_measplit_mnist_tomography.pth

  0%|          | 0.00/5.13M [00:00<?, ?B/s]
100%|██████████| 5.13M/5.13M [00:00<00:00, 211MB/s]

Train and test network#

trainer = dinv.Trainer(
    model=model,
    physics=physics,
    epochs=1,
    losses=loss,
    optimizer=optimizer,
    device=device,
    train_dataloader=train_dataloader,
    plot_images=False,
    save_path=None,
    verbose=True,
    show_progress_bar=False,
    no_learning_method="A_dagger",  # use pseudo-inverse as no-learning baseline
)

model = trainer.train()

The model has 444737 trainable parameters
/home/runner/work/deepinv/deepinv/deepinv/loss/measplit.py:275: UserWarning: Mask generator not defined. Using new Bernoulli mask generator with shape torch.Size([40, 28]).
  warn(
/home/runner/work/deepinv/deepinv/deepinv/physics/forward.py:916: UserWarning: At least one input physics is a DecomposablePhysics, but resulting physics will not be decomposable. `A_dagger` and `prox_l2` will fall back to approximate methods, which may impact performance.
  warnings.warn(
Train epoch 0: TotalLoss=0.031, PSNR=0.304

Test and visualize the model outputs using a small test set. We set the output to average over 5 iterations of random mask realisations. The trained model improves on the no-learning reconstruction by ~7dB.

trainer.plot_images = True
trainer.test(test_dataloader)

Ground truth, No learning, Reconstruction

Eval epoch 0: PSNR=8.201, PSNR no learning=9.277
Test results:
PSNR no learning: 9.277 +- 0.174
PSNR: 8.201 +- 0.573

{'PSNR no learning': 9.277111053466797, 'PSNR no learning_std': 0.17405206062625767, 'PSNR': 8.201165294647216, 'PSNR_std': 0.5725918106744002}

Demonstrate the effect of not averaging over multiple realisations of the splitting mask at evaluation time, by setting eval_n_samples=1. We have a worse performance:

model.eval_n_samples = 1
trainer.test(test_dataloader)

Eval epoch 0: PSNR=4.63, PSNR no learning=9.277
Test results:
PSNR no learning: 9.277 +- 0.174
PSNR: 4.630 +- 0.805

{'PSNR no learning': 9.277111053466797, 'PSNR no learning_std': 0.17405206062625767, 'PSNR': 4.630410695075989, 'PSNR_std': 0.8053398353494943}

Furthermore, we can disable measurement splitting at evaluation altogether by setting eval_split_input to False (this is done in SSDU Yaman et al.[2]). This generally is worse than MC averaging:

model.eval_split_input = False
trainer.test(test_dataloader)

Eval epoch 0: PSNR=14.008, PSNR no learning=9.277
Test results:
PSNR no learning: 9.277 +- 0.174
PSNR: 14.008 +- 0.590

{'PSNR no learning': 9.277111053466797, 'PSNR no learning_std': 0.17405206062625767, 'PSNR': 14.008388805389405, 'PSNR_std': 0.5896116976399809}

References:

Total running time of the script: (0 minutes 9.447 seconds)

Gallery generated by Sphinx-Gallery

Self-supervised learning with measurement splitting#

Define loss#

Prepare data#

Define model#

Train and test network#

This Page