February 12, 2019 | Nabil Ibtehaz and M. Sohel Rahman
The paper "MultiResUNet: Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation" by Nabil Ibtehaz and M. Sohel Rahman addresses the limitations of the classical U-Net architecture in medical image segmentation, particularly in handling multimodal images with varying scales and complexities. The authors propose a novel architecture, MultiResUNet, which incorporates modifications to enhance the performance of U-Net. These modifications include the introduction of MultiRes blocks, which use different convolutional filters to capture features at multiple scales, and Res paths, which add non-linear transformations to reduce the semantic gap between encoder and decoder features. The proposed architecture is evaluated on five diverse datasets, including fluorescence microscopy, electron microscopy, dermoscopy, endoscopy, and MRI images. The results show that MultiResUNet consistently outperforms U-Net, achieving relative improvements of 10.15%, 5.07%, 2.63%, 1.41%, and 0.62% across the datasets. MultiResUNet also demonstrates better performance in challenging scenarios, such as images with vague boundaries, perturbations, and outliers, and is more reliable in segmenting the majority class. The authors conclude that MultiResUNet can be a potential successor to the classical U-Net architecture, offering significant advancements in multimodal biomedical image segmentation.The paper "MultiResUNet: Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation" by Nabil Ibtehaz and M. Sohel Rahman addresses the limitations of the classical U-Net architecture in medical image segmentation, particularly in handling multimodal images with varying scales and complexities. The authors propose a novel architecture, MultiResUNet, which incorporates modifications to enhance the performance of U-Net. These modifications include the introduction of MultiRes blocks, which use different convolutional filters to capture features at multiple scales, and Res paths, which add non-linear transformations to reduce the semantic gap between encoder and decoder features. The proposed architecture is evaluated on five diverse datasets, including fluorescence microscopy, electron microscopy, dermoscopy, endoscopy, and MRI images. The results show that MultiResUNet consistently outperforms U-Net, achieving relative improvements of 10.15%, 5.07%, 2.63%, 1.41%, and 0.62% across the datasets. MultiResUNet also demonstrates better performance in challenging scenarios, such as images with vague boundaries, perturbations, and outliers, and is more reliable in segmenting the majority class. The authors conclude that MultiResUNet can be a potential successor to the classical U-Net architecture, offering significant advancements in multimodal biomedical image segmentation.