GENERATED FROM PYTHON SOURCE LINES 10-22 .. code-block:: Python from pathlib import Path import torch from torchvision import datasets from torchvision import transforms import deepinv as dinv from torch.utils.data import DataLoader from deepinv.optim.data_fidelity import L2 from deepinv.unfolded import unfolded_builder from deepinv.utils.demo import get_data_home .. GENERATED FROM PYTHON SOURCE LINES 23-26 Setup paths for data loading and results. ----------------------------------------- .. GENERATED FROM PYTHON SOURCE LINES 26-38 .. code-block:: Python BASE_DIR = Path(".") DATA_DIR = BASE_DIR / "measurements" RESULTS_DIR = BASE_DIR / "results" CKPT_DIR = BASE_DIR / "ckpts" ORIGINAL_DATA_DIR = get_data_home() # Set the global random seed from pytorch to ensure reproducibility of the example. torch.manual_seed(0) device = dinv.utils.get_freer_gpu() if torch.cuda.is_available() else "cpu" .. GENERATED FROM PYTHON SOURCE LINES 39-42 Load base image datasets and degradation operators. ---------------------------------------------------------------------------------------- In this example, we use MNIST as the base dataset. .. GENERATED FROM PYTHON SOURCE LINES 42-58 .. code-block:: Python img_size = 28 n_channels = 1 operation = "compressed-sensing" train_dataset_name = "MNIST_train" # Generate training and evaluation datasets in HDF5 folders and load them. train_test_transform = transforms.Compose([transforms.ToTensor()]) train_base_dataset = datasets.MNIST( root=ORIGINAL_DATA_DIR, train=True, transform=train_test_transform, download=True ) test_base_dataset = datasets.MNIST( root=ORIGINAL_DATA_DIR, train=False, transform=train_test_transform, download=True ) .. rst-class:: sphx-glr-script-out .. code-block:: none 0%| | 0.00/9.91M [00:00`_, the backbone algorithm we unfold is the proximal gradient algorithm which minimizes the following objective function .. math:: \begin{equation} \tag{1} \min_x \frac{1}{2} \|y - Ax\|_2^2 + \lambda \|Wx\|_1 \end{equation} where :math:`\lambda` is the regularization parameter. The proximal gradient iteration (see also :class:`deepinv.optim.optim_iterators.PGDIteration`) is defined as .. math:: x_{k+1} = \text{prox}_{\gamma \lambda g}(x_k - \gamma A^T (Ax_k - y)) where :math:`\gamma` is the stepsize and :math:`\text{prox}_{g}` is the proximity operator of :math:`g(x) = \|Wx\|_1` which corresponds to soft-thresholding with a wavelet basis (see :class:`deepinv.optim.WaveletPrior`). We use :func:`deepinv.unfolded.unfolded_builder` to define the unfolded algorithm and set both the stepsizes of the LISTA algorithm :math:`\gamma` (``stepsize``) and the soft thresholding parameters :math:`\lambda` as learnable parameters. These parameters are initialized with a table of length max_iter, yielding a distinct ``stepsize`` value for each iteration of the algorithm. .. GENERATED FROM PYTHON SOURCE LINES 122-139 .. code-block:: Python # Select the data fidelity term data_fidelity = L2() max_iter = 30 if torch.cuda.is_available() else 10 # Number of unrolled iterations stepsize = [torch.ones(1, device=device)] * max_iter # initialization of the stepsizes. # A distinct stepsize is trained for each iteration. # Set up the trainable denoising prior; here, the soft-threshold in a wavelet basis. # If the prior is initialized with a list of length max_iter, # then a distinct weight is trained for each PGD iteration. # For fixed trained model prior across iterations, initialize with a single model. level = 3 prior = [ dinv.optim.WaveletPrior(wv="db8", level=level, device=device) for i in range(max_iter) ] .. GENERATED FROM PYTHON SOURCE LINES 140-152 In practice, it is common to apply a different thresholding parameter for each wavelet sub-band. This means that the thresholding parameter is a tensor of shape (n_levels, n_wavelet_subbands) and the associated problem (1) is reformulated as .. math:: \begin{equation} \min_x \frac{1}{2} \|y - Ax\|_2^2 + \sum_{i, j} \lambda_{i, j} \|\left(Wx\right)_{i, j}\|_1 \end{equation} where :math:`\lambda_{i, j}` is the thresholding parameter for the wavelet sub-band :math:`j` at level :math:`i`. Note that in this case, the prior is a list of elements containing the terms :math:`\|\left(Wx\right)_{i, j}\|_1=g_{i, j}(x)`, and that it is necessary that the dimension of the thresholding parameter matches that of :math:`g_{i, j}`. .. GENERATED FROM PYTHON SOURCE LINES 153-182 .. code-block:: Python # Unrolled optimization algorithm parameters lamb = [ torch.ones(3, 3, device=device) * 0.01 # initialization of the regularization parameter. One thresholding parameter per wavelet sub-band and level. ] * max_iter # A distinct lamb is trained for each iteration. params_algo = { # wrap all the restoration parameters in a 'params_algo' dictionary "stepsize": stepsize, "lambda": lamb, } trainable_params = [ "stepsize", "lambda", ] # define which parameters from 'params_algo' are trainable # Define the unfolded trainable model. model = unfolded_builder( iteration="PGD", params_algo=params_algo.copy(), trainable_params=trainable_params, data_fidelity=data_fidelity, max_iter=max_iter, prior=prior, ).to(device) .. GENERATED FROM PYTHON SOURCE LINES 183-189 Define the training parameters. ------------------------------- We now define training-related parameters, number of epochs, optimizer (Adam) and its hyperparameters, and the train and test batch sizes. .. GENERATED FROM PYTHON SOURCE LINES 189-215 .. code-block:: Python # Training parameters epochs = 5 if torch.cuda.is_available() else 3 learning_rate = 0.01 # Choose optimizer and scheduler optimizer = torch.optim.Adam(model.parameters(), lr=learning_rate) # Choose supervised training loss losses = [dinv.loss.SupLoss(metric=dinv.metric.MSE())] # Logging parameters verbose = True wandb_vis = False # plot curves and images in Weight&Bias # Batch sizes and data loaders train_batch_size = 64 if torch.cuda.is_available() else 1 test_batch_size = 64 if torch.cuda.is_available() else 8 train_dataloader = DataLoader( train_dataset, batch_size=train_batch_size, num_workers=num_workers, shuffle=True ) test_dataloader = DataLoader( test_dataset, batch_size=test_batch_size, num_workers=num_workers, shuffle=False ) .. GENERATED FROM PYTHON SOURCE LINES 216-221 Train the network. ------------------------------------------- We train the network using the :class:`deepinv.Trainer` class. .. GENERATED FROM PYTHON SOURCE LINES 221-239 .. code-block:: Python trainer = dinv.Trainer( model, physics=physics, train_dataloader=train_dataloader, eval_dataloader=test_dataloader, epochs=epochs, losses=losses, optimizer=optimizer, device=device, save_path=str(CKPT_DIR / operation), verbose=verbose, show_progress_bar=False, # disable progress bar for better vis in sphinx gallery. wandb_vis=wandb_vis, # training visualization can be done in Weight&Bias ) model = trainer.train() .. rst-class:: sphx-glr-script-out .. code-block:: none The model has 100 trainable parameters Train epoch 0: TotalLoss=0.06, PSNR=12.371 Eval epoch 0: PSNR=12.678 Train epoch 1: TotalLoss=0.062, PSNR=12.219 Eval epoch 1: PSNR=12.679 Train epoch 2: TotalLoss=0.062, PSNR=12.22 Eval epoch 2: PSNR=12.679 .. GENERATED FROM PYTHON SOURCE LINES 240-247 Test the network. --------------------------- We now test the learned unrolled network on the test dataset. In the plotted results, the `Linear` column shows the measurements back-projected in the image domain, the `Recons` column shows the output of our LISTA network, and `GT` shows the ground truth. .. GENERATED FROM PYTHON SOURCE LINES 247-268 .. code-block:: Python trainer.test(test_dataloader) test_sample, _ = next(iter(test_dataloader)) model.eval() test_sample = test_sample.to(device) # Get the measurements and the ground truth y = physics(test_sample) with torch.no_grad(): # it is important to disable gradient computation during testing. rec = model(y, physics=physics) backprojected = physics.A_adjoint(y) dinv.utils.plot( [backprojected, rec, test_sample], titles=["Linear", "Reconstruction", "Ground truth"], suptitle="Reconstruction results", ) .. image-sg:: /auto_examples/unfolded/images/sphx_glr_demo_LISTA_001.png :alt: Reconstruction results, Linear, Reconstruction, Ground truth :srcset: /auto_examples/unfolded/images/sphx_glr_demo_LISTA_001.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none Eval epoch 0: PSNR=12.679, PSNR no learning=11.288 Test results: PSNR no learning: 11.288 +- 1.795 PSNR: 12.679 +- 1.296 .. GENERATED FROM PYTHON SOURCE LINES 269-271 Plotting the learned parameters. ------------------------------------ .. 