Skip to content
High dimensional mean estimation and outlier detection in nearly-linear time.
Branch: master
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data update README Jun 27, 2019
README.md
ads.py initial commit Jun 20, 2019
baselines.py update README with arxiv link Jun 28, 2019
cifar_corruptor.py initial commit Jun 20, 2019
data.py initial commit Jun 20, 2019
mean.py reorganization and add more to README Jun 27, 2019
part_utils.py
pixel.py
utils.py update README Jun 27, 2019
words.py initial commit Jun 20, 2019

README.md

Fast mean estimation and outlier detection

This repo contains code for our paper Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection.

Description of select scripts:

  • mean.py contains the backbone of the experimental setup and evaluation.
  • utils.py contains various utilities methods, such as fast JL computation.
  • Auxiliary scripts specific to certain experiments: pixel.py used for running the hot pixels experiments on CIFAR data, words.py used when running word embeddings experiments.

The data directory contains select data for running the experiments. GloVe embeddings can be downloaded here. CIFAR images can be downloaded from here.

The main script mean.py with appropriate arguments. For instance, to run experiments on synthetic data with respect to varying alpha:

python mean.py --experiment_type syn_lamb

And the same on word embeddings data:

python mean.py --experiment_type text_lamb

To run experiments on CIFAR images:

python pixel.py --experiment_type image_lamb

For more available runtime options see:

python mean.py -h

You can’t perform that action at this time.