Multimodal Continual Learning with Sonographer Eye-Tracking in Fetal Ultrasound

Abstract

Deep networks have been shown to achieve impressive accuracy for some medical image analysis tasks where large datasets and annotations are available. However, tasks involving learning over new sets of classes arriving over extended time is a different and difficult challenge due to the tendency of reduction in performance over old classes while adapting to new ones. Controlling such a ‘forgetting’ is vital for deployed algorithms to evolve with new arrivals of data incrementally. Usually, incremental learning approaches rely on expert knowledge in the form of manual annotations or active feedback. In this paper, we explore the role that other forms of expert knowledge might play in making deep networks in medical image analysis immune to forgetting over extended time. We introduce a novel framework for mitigation of this forgetting effect in deep networks considering the case of combining ultrasound video with point-of-gaze tracked for expert sonographers during model training. This is used along with a novel weighted distillation strategy to reduce the propagation of effects due to class imbalance.

Publication
International Workshop on Advances in Simplifying Medical Ultrasound - ASMUS 2021, Medical Image Computing and Computer Assisted Intervention – MICCAI 2021

BibTex

@InProceedings{patra2021multimodal,
author="Patra, Arijit and Cai, Yifan and Chatelain, Pierre and Sharma, Harshita and Drukker, Lior and Papageorghiou, Aris T. and Noble, J. Alison",
editor="Noble, J. Alison and Aylward, Stephen and Grimwood, Alexander and Min, Zhe and Lee, Su-Lin and Hu, Yipeng",
title="Multimodal Continual Learning with Sonographer Eye-Tracking in Fetal Ultrasound",
booktitle="Simplifying Medical Ultrasound",
year="2021",
publisher="Springer International Publishing",
address="Cham",
pages="14--24",
abstract="Deep networks have been shown to achieve impressive accuracy for some medical image analysis tasks where large datasets and annotations are available. However, tasks involving learning over new sets of classes arriving over extended time is a different and difficult challenge due to the tendency of reduction in performance over old classes while adapting to new ones. Controlling such a `forgetting' is vital for deployed algorithms to evolve with new arrivals of data incrementally. Usually, incremental learning approaches rely on expert knowledge in the form of manual annotations or active feedback. In this paper, we explore the role that other forms of expert knowledge might play in making deep networks in medical image analysis immune to forgetting over extended time. We introduce a novel framework for mitigation of this forgetting effect in deep networks considering the case of combining ultrasound video with point-of-gaze tracked for expert sonographers during model training. This is used along with a novel weighted distillation strategy to reduce the propagation of effects due to class imbalance.",
isbn="978-3-030-87583-1"
}

Related