Skip to content

s-sd/system-II-vision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automating fast-and-slow thinking enhances machine vision

Shaheer U. Saeed [1,2∗], Yipei Wang [1,2], Veeru Kasivisvanathan [3], Brian R. Davidson [3], Matthew J. Clarkson [1,2], Yipeng Hu [1,2], Daniel C. Alexander [1,4]

[1] UCL Hawkes Institute, University College London, UK
[2] Department of Medical Physics and Biomedical Engineering, University College London, UK
[3] Division of Surgery and Interventional Sciences, University College London, UK
[4] Department of Computer Science, University College London, UK
[∗] Correspondence e-mail: shaheer.saeed.17@ucl.ac.uk

Citation

To-be-added

Description

This repository contains code to implement a dual-process method for machine cognition, which consists of two modules:

  1. The System I module: for fast decisions in familiar scenarios, trained adversarially across multiple tasks
  2. The System II module: for slow and effortful reasoning to refine solutions in novel tasks, using reinforcement learning self-play to propose, consider and implement decision strateiges, akin to human reasoning

See a description of the implementation here


How-to

Dependencies

python3.10 -m pip install tensorflow==2.13 shimmy>=2.0 gym==0.26 matplotlib numpy gym tqdm stable-baselines3

Code

cd system-II-vision
python3 training_example.py

Datasets used in our experiments

In addition to using the MNIST digit dataset and the ImageNet-LUSS dataset for our experiments, we also use medical imaging datasets as outlined below:

Dataset Link Organ Modality Role
Medical Segmentation Decathalon Spleen CT Training
Medical Segmentation Decathalon Liver Vessels CT Training
Multi-Atlas Labeling Beyond the Cranial Vault Gallbladder CT Training
Multi-Atlas Labeling Beyond the Cranial Vault Adrenal Gland CT Training
Multi-Atlas Labeling Beyond the Cranial Vault Major Vessels CT Training
Multi-Atlas Labeling Beyond the Cranial Vault Stomach CT Training
CT-ORG Kidneys CT Training
CT-ORG Bladder CT Training
CHAOS Liver MR Training
CHAOS Kidneys MR Training
CHAOS Spleen MR Training
AMOS Bladder MR Training
AMOS Gallbladder MR Training
AMOS Prostate MR Training
AMOS Major Vessels MR Training
Medical Segmentation Decathalon Prostate MR Training
PROMIS Prostate Tumour MR Evaluation
Medical Segmentation Decathalon Liver Tumour CT Evaluation
Medical Segmentation Decathalon Pancreas Tumour CT Evaluation
Medical Segmentation Decathalon Colon Tumour CT Evaluation
KITS Kidney Tumour CT Evaluation

Computer-vision experiments

Please refer to the example training script in training_example.py for the task of MNIST digit segmentation from noisy images, implemented using our proposed approach. The System I module is trained across digits 0-4 (see ln 88 in training_example.py) to predict initial solutions, as outline din the first pane of the figure below. This System I module is then adapted using 4 samples from each of the target digits 6-9 (see ln 222 in training_example.py). The System II module is then used to predict segmentations for the the target digits (see ln 258 in training_example.py), as outlined in the second pane in the figure below.

Screenshot from 2025-02-10 11-22-11


Cancer segmentation on medical image experiments

Our method outperforms other common methods, setting a new state-of-the-art in the tested cancer localisation tasks:

Method Pre-Train Labels Prostate Liver Pancreas Colon Kidney
System II Yes 8 62.4±5.1 85.3±8.6 60.9±9.7 65.1±10.7 75.6±6.3
System II Yes 12 63.0±4.8 83.9±10.2 63.6±8.9 63.3±10.2 74.9±5.9
System II Yes 16 62.8±5.8 84.3±9.1 66.2±9.4 64.7±11.0 74.1±6.2
nnUNet [Isensee 2021; Yan 2024] No >100 42.3±5.6 - - - -
CLIP [Liu 2023] Yes >100 - 79.4±8.1 62.3±9.8 63.1±10.6 -
AutoSeg [Myronenko 2023] No >100 - - - - 76.4±5.5

It also demonstrates the salient features of System II congition in humans, when evaluated for the challenging task of cancer segmentation on medical images, which can enable non-invasive cancer diagnoses but often requires extensive expertise and is plagued by limited data availability:

Screenshot from 2025-02-10 13-05-30

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages