forked from jianminsun/fMoW-final-code
fMoW/second_place_solution
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
master
Could not load branches
Nothing to show
Could not load tags
Nothing to show
{{ refName }}
default
Name already in use
A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code
-
Clone
Use Git or checkout with SVN using the web URL.
Work fast with our official CLI. Learn more about the CLI.
- Open with GitHub Desktop
- Download ZIP
Sign In Required
Please sign in to use Codespaces.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching Xcode
If nothing happens, download Xcode and try again.
Launching Visual Studio Code
Your codespace will open once ready.
There was a problem preparing your codespace, please try again.
This branch is up to date with jianminsun/fMoW-final-code:master.
# fMoW: Functional Map of the World This code was developed by Jianmin Sun. ## Dependencies The following libraries were used for training/testing the deep learning models: Mxnet 0.11.0 Keras 2.0.8 Tensorflow 1.3.0 fMoW/baseline from(https://github.com/fMoW/baseline) ## Dataset The following directory structure was used for training/testing: ``` fmow_dataset/ train/ airport/ airport_0/ airport_0_0_rgb.jpg airport_0_0_rgb.json ... airport_0_5_rgb.jpg airport_0_5_rgb.json ... ... zoo/ zoo_0/ zoo_0_0_rgb.jpg zoo_0_0_rgb.json ... zoo_0_8_rgb.jpg zoo_0_8_rgb.json ... test/ 0000000/ 0000000_0_rgb.jpg 0000000_0_rgb.json ... 0000000_5_rgb.jpg 0000000_5_rgb.json ... ``` ## Results Format This code will output txt files in the format required by Topcoder, where each line contains comma-separated values of the bounding box ID and a string containing the category. ## Running the Code To first prepare the dataset for training and testing, prepare the RGB-only version of the dataset in ./data: ``` docker build -t fmow . nvidia-docker run -v ./data/:/data -it fmow ``` for training (whole processes will take over 100 hours on g3.16xlarge) baseline each epoch takes about 7 hours, total 6x7 hours needed three mxnet models with eight epoch and one with six epoch mxnet model each epoch takes about 2.2 hours, total 2.2x8x4 hours needed bash train.sh /data/train bash test.sh /data/train /data/test /work/out.txt model Our best performing model is the CNN with meta data approach, which sums predictions over each temporal view and then takes an argmax. ## License The license is Apache 2.0. See LICENSE.
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 98.3%
- Shell 1.7%