Added input channel pre-processing to enable vision models functioning on MNIST #64

mau-mar · 2024-03-17T19:16:05Z

I'm Mauro Marino from Group 0 (Mauro Marino, William Powell) working on Project 1 (TensorRT integration in MASE).

An issue was opened in which errors in processing MNIST dataset with vision models was brought to attention. This is traceable to MNIST dataset being grayscale, hence providing only one channel, whilst MASE convolutional models expect 3 channels.

The proposed fix applies a series of checks to ascertain that the model-dataset combination requires intervention (since some feedforward neural network models are able to run on MNIST without further action), then, if needed, overrides the model architecture when using MNIST by including a single Conv2d, mapping the single input channel to 3 output channels, before the first convolutional layer.

* print args in cli * print args, remove redundant args * rename quantize *_bits to *_width, *_fraction_bits to *_frac * add MaseTracer and mark_as_leaf * replace "bits" with "width" when specifying bit width * add integer matmul * create MaseTracer * only sync_dist on epoch end * on_epoch=False for wrapper training step * add new modifier and tracer. support custom func/module as leaf node * remove old modifier * supported save_name specified by users * remove redundant comments * get_dummy_inputs for bert-base-uncased and roberta-base * save modified as pickle * fixed bugs in MaseTracer * OPTAttention mode 1 and 3 work * traceable OPTDecoderLayer * add func get_patched_nlp_model, which works in a way similar to get_nlp_model * more facebook/opt models supported, not tested yet * remove "mase_output" dir and use args.save instead * fixed bugs in modifier, new get_dummy_inputs * support training quantized facebook/opt * use a unified cache dir now we have software/ |-- cache |-- model_cache_dir |-- dataset_cache_dir |-- tokenizer_cache_dir * update README.md and gitignore * use user's output dir * agreed --project_dir and --project for saving generated files

jianyicheng · 2024-03-25T19:01:57Z

@mau-mar

Hi, we found there is a bug the yml files that stop CI running from forked repos. It has now been fixed and merged into the upstream.
Could you merge the main branch of the upstream into this PR which should trigger the CI properly?

Thanks,

removed docker credentials (from upstream/main)

mau-mar · 2024-03-26T14:54:48Z

@jianyicheng

Hi, I just merged upstream changes into the PR branch. Does it work now?

sync with upstream main

vision models functioning on mnist (added input channel pre-processing)

92b4704

mau-mar mentioned this pull request Mar 17, 2024

Vision models unable to run on MNIST dataset #63

Closed

Merge pull request #3 from DeepWok/main

5950780

removed docker credentials (from upstream/main)

mau-mar and others added 4 commits March 26, 2024 23:18

fixex formatting to keep up with expected coding style

105ee5f

coding spacing adjustments

9d2ab48

adjust mnist dummy_inputs for vision models

33509f2

Merge pull request #7 from DeepWok/main

db8be83

sync with upstream main

jianyicheng deleted the branch DeepWok:x April 3, 2024 15:40

jianyicheng closed this Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added input channel pre-processing to enable vision models functioning on MNIST #64

Added input channel pre-processing to enable vision models functioning on MNIST #64

mau-mar commented Mar 17, 2024 •

edited

Loading

jianyicheng commented Mar 25, 2024

mau-mar commented Mar 26, 2024

Added input channel pre-processing to enable vision models functioning on MNIST #64

Added input channel pre-processing to enable vision models functioning on MNIST #64

Conversation

mau-mar commented Mar 17, 2024 • edited Loading

jianyicheng commented Mar 25, 2024

mau-mar commented Mar 26, 2024

mau-mar commented Mar 17, 2024 •

edited

Loading