EMMA: Perception

Important

If you have questions or find bugs or anything, you can contact us in our organisation's discussion.

About

This repository holds the object detector and feature extractor for running things. Essentially, this is the model that takes an image and returns a series of features for that image. This repository can be used as a standalone to extract features before running things on policy, or used as an API to extract features during inference.

Writing code and running things

Run the server for the Alexa Arena

Running this command as is will automatically download and use the fine-tuned checkpoint from our HF models repo and use the same settings we used when we ran experiments within the Alexa Arena.

python src/emma_perception/commands/run_server.py

Extracting features

For training things, we need to extract the features for each image.

Here's the command you can use to extract features from images. Obviously, you can change the paths to the folder of images, and the output dir, and whatever else you want.

python src/emma_perception/commands/extract_visual_features.py --images_dir <path_to_images> --output_dir <path to output dir>

argparse arguments for the command

perception/src/emma_perception/commands/extract_visual_features.py

Lines 20 to 50 in e20855a

    
           parser = Trainer.add_argparse_args(parser)  # type: ignore[assignment] 
        
           parser.add_argument( 
        
               "-i", 
        
               "--images_dir", 
        
               required=True, 
        
               help="Path to a folder of images to extract features from", 
        
           ) 
        
           parser.add_argument( 
        
               "--is_arena", 
        
               action="store_true", 
        
               help="If we are extracting features from the Arena images, use the Arena checkpoint", 
        
           ) 
        
           parser.add_argument("-b", "--batch_size", type=int, default=2) 
        
           parser.add_argument("-w", "--num_workers", type=int, default=0) 
        
           parser.add_argument( 
        
               "-c", "--output_dir", default="storage/data/cache", help="Path to store visual features" 
        
           ) 
        
           parser.add_argument( 
        
               "--num_gpus", 
        
               type=int, 
        
               default=None, 
        
               help="Number of GPUs to use for visual feature extraction", 
        
           ) 
        
           parser.add_argument( 
        
               "opts", 
        
               default=None, 
        
               nargs=argparse.REMAINDER, 
        
               help="Modify config options using the command-line. Used for VinVL extraction", 
        
           ) 
        
           return parser.parse_args()

Extracting features for the Alexa Arena

If you want to use the fine-tuned model to extract features with the model we trained on the Alexa Arena, just add --is_arena onto the above command. This will automatically download and use the fine-tuned checkpoint from our HF models repo and use the same settings we used when we ran experiments within the Alexa Arena.

Developer tooling

Dependency management with Poetry
Easier task running with Poe the Poet
Code formatting with Black and Prettier
Linting with pre-commit and Flake8, using the strict wemake-python-styleguide
Automated Python Docstring Formatting with docformatter
Continuous integration with GitHub Actions
Testing with pytest
Code coverage with coverage.py
Static type-checking with mypy
Automated Python syntax updates with pyupgrade
Security audit with Bandit
Manage project labels with GitHub Labeler

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
.vscode		.vscode
scripts		scripts
src/emma_perception		src/emma_perception
storage		storage
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.flake8		.flake8
.gitignore		.gitignore
.kodiak.toml		.kodiak.toml
.mypy.ini		.mypy.ini
.pre-commit-config.yaml		.pre-commit-config.yaml
.releaserc.js		.releaserc.js
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EMMA: Perception

About

Writing code and running things

Run the server for the Alexa Arena

Extracting features

Extracting features for the Alexa Arena

Developer tooling

About

Packages

Contributors 7

Languages

	parser = Trainer.add_argparse_args(parser) # type: ignore[assignment]
	parser.add_argument(
	"-i",
	"--images_dir",
	required=True,
	help="Path to a folder of images to extract features from",
	)
	parser.add_argument(
	"--is_arena",
	action="store_true",
	help="If we are extracting features from the Arena images, use the Arena checkpoint",
	)
	parser.add_argument("-b", "--batch_size", type=int, default=2)
	parser.add_argument("-w", "--num_workers", type=int, default=0)
	parser.add_argument(
	"-c", "--output_dir", default="storage/data/cache", help="Path to store visual features"
	)
	parser.add_argument(
	"--num_gpus",
	type=int,
	default=None,
	help="Number of GPUs to use for visual feature extraction",
	)
	parser.add_argument(
	"opts",
	default=None,
	nargs=argparse.REMAINDER,
	help="Modify config options using the command-line. Used for VinVL extraction",
	)

	return parser.parse_args()

emma-heriot-watt/perception

Folders and files

Latest commit

History

Repository files navigation

EMMA: Perception

About

Writing code and running things

Run the server for the Alexa Arena

Extracting features

Extracting features for the Alexa Arena

Developer tooling

About

Topics

Resources

Stars

Watchers

Forks

Packages 0

Contributors 7

Languages

Packages