PPReCOGG

What is this?

PPReCOGG is a machine learning model for the Per-Pixel Recognition of Cancers using Oriented Gabor filters on the GPU.

More helpfully, PPReCOGG is software that uses texture-based features to help classify and differentiate early breast cancer lesions in microscopy images of fluorescently labelled breast tissue, such as that you'd extract from a breast biopsy.

This library is almost, but not quite complete; much like the master's thesis for which it was written.

Documentation

Documentation is available here.

Installation

N.B.: Ease of installation and portability needs to be improved. Environments other than Anaconda Python 3 on Windows & Linux are not supported at this time.

PPReCOGG Dependencies:

h5py (v.2.7.0)
matplotlib (v.2.0.2)
numpy (v1.13.1)
pillow (v4.2.1)
pytables (v.3.2.2)
scikit-image (v.0.13.0)
theano (v0.9.0)

Usage

PPReCOGG can be used as a python library or interactively through its CLI.

Library Usage Example

from pprecogg import gaborExtract, classifyFeatures

# path to the image you wish to classify
unknown_img_path = "/path/to/unknown/image"

# paths to the images whose class you know
adh_img_path = "/path/to/adh/image"
dcis_img_path = "/path/to/dcis/image"

# features are extracted into HDF5 files, and extract_gabor_features
# returns the path to said file
unknown_features_path = gaborExtract.extract_gabor_features(unknown_img_path)
adh_features_path = gaborExtract.extract_gabor_features(adh_img_path)
dcis_features_path = gaborExtract.extract_gabor_features(dcis_img_path)

# classify features from unknown image. 
# returns an array of class ID and an array of classified coordinates
# indexed by class (see: that array of class IDs)
class_names,classified_coords = classifyFeatures.classify_features(unknown_features_path,
known_features_paths)


# we can convert this into a dictionary where the key is the class name
# and the value are the coordinates that belong to it
classified_coords_dict = {class_names[class_num]: class_coords for class_num, class_coords in enumerate(classified_coords)}

# small ergonomic function to plot classified pixels on to the
# unknown image
classifyFeatures.plot_coords(classified_coords_dict,
                                 unknown_img_path)

CLI Usage

Simplest way to use PPReCOGG in CLI mode is to use the full_auto mode.

Step One: Create configuration file config.json

{
  "unknown_image": "/path/to/unknown/image",

  /* optional, for rerunning */
  "unknown_features": "/path/to/unknown/features.h5",

  "known_images":[
    "/path/to/known/image",
    "/path/to/known/image"],

  /* optional, for rerunning */
  "known_features":[
    "/path/to/known/features.h5",
    "/path/to/known/features.h5"
  ],

  /* 
      the smaller, the faster the computations. 
      the bigger, the higher resolution output.
  */
  "resize": 510
}

Step Two: Run PPReCOGG in full_auto mode

python -m pprecogg full_auto --config_file config.json

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
pprecogg		pprecogg
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
config.json		config.json
pprecogg_results.png		pprecogg_results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPReCOGG

What is this?

Documentation

Installation

Usage

Library Usage Example

CLI Usage

About

Releases 1

Packages

Languages

License

jszym/PPReCOGG

Folders and files

Latest commit

History

Repository files navigation

PPReCOGG

What is this?

Documentation

Installation

Usage

Library Usage Example

CLI Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages