Toy example for 'Adversarial Image-to-Frequency Transform'

Toy example for 'Adversarially Learnt Image to Frequency Transform Network (AIFT)' on the paper ( Unsupervised Pixel-level Road Defect Detection via Adversarial Image-to-Frequency Transform) (Implemented with Py36 and Pytorch)

The paper is submitted to IV2020, and it is under the review.
In this implementation, we implement the generation model as two separate models which are the image-to-frequency generator and the frequency-to-image generator for implementation efficiency.
This source code is a toy example for AITF, and it does not include the evaluation code for the experiments on the paper. Contact: [jm.andrew.yu@gmail.com] Any questions or discussions are welcomed!

If you use this source code, please cite the paper as follows.

@misc{yu2020unsupervised,
    title={Unsupervised Pixel-level Road Defect Detection via Adversarial Image-to-Frequency Transform},
    author={Jongmin Yu and Duyong Kim and Younkwon Lee and Moongu Jeon},
    year={2020},
    eprint={2001.11175},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Abstract.

In the past few years, the performance of road defect detection has been remarkably improved thanks to advancements on various studies on computer vision and deep learning. Although a large-scale and well-annotated datasets enhance the performance of detecting road pavement defects to some extent, it is still challengeable to derive a model which can perform reliably for various road conditions in practice, because it is intractable to construct a dataset considering diverse road conditions and defect patterns. To end this, we propose an unsupervised approach to detecting road defects, using Adversarial Image-to-Frequency Transform (AIFT). AIFT adopts the unsupervised manner and adversarial learning in deriving the defect detection model, so AIFT does not need annotations for road pavement defects. We evaluate the efficiency of AIFT using GAPs384 dataset, Cracktree200 dataset, CRACK500 dataset, and CFD dataset. The experimental results demonstrate that the proposed approach detects various road detects, and it outperforms existing state-of-the-art approaches.

File configuration

.
├── frequency_discriminator.pkl
├── image_discriminator.pkl
├── inception_score_graph.txt
├── logs
│   └── events.out.tfevents.1578130324.neumann-System-Product-Name
├── main.py
├── model
│   ├── aiftn.py
│   └── pycache
│   └── aiftn.cpython-36.pyc
├── negative_generator.pkl
├── positive_generator.pkl
├── README.md
├── src
│   ├── config.py
│   ├── dataset.py
│   ├── init.py
│   ├── pycache
│   │   ├── config.cpython-36.pyc
│   │   ├── dataset.cpython-36.pyc
│   │   ├── init.cpython-36.pyc
│   │   ├── tensorboard_logger.cpython-36.pyc
│   │   └── utils.cpython-36.pyc
│   ├── tensorboard_logger.py
│   └── utils.py
├── tb_log.txt
├── test.png
├── _t_main.py
└── training_result_vis

How to train

python main.py

Curves of the cost functions on AITF.

(Loss for Discriminator, generator for positive phase, generator for negative phase)

Given and transformed samples (40000 Iterations)

Given image and frequency samples.

Transformed image and frequency samples.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
logs		logs
model		model
readme		readme
results		results
src		src
README.md		README.md
_t_main.py		_t_main.py
frequency_discriminator.pkl		frequency_discriminator.pkl
image_discriminator.pkl		image_discriminator.pkl
inception_score_graph.txt		inception_score_graph.txt
main.py		main.py
negative_generator.pkl		negative_generator.pkl
positive_generator.pkl		positive_generator.pkl
tb_log.txt		tb_log.txt
test.png		test.png

andreYoo/Adversarial_IFTN

Folders and files

Latest commit

History

Repository files navigation

Toy example for 'Adversarial Image-to-Frequency Transform'

Abstract.

File configuration

How to train

Curves of the cost functions on AITF.

Given and transformed samples (40000 Iterations)

About

Resources

Stars

Watchers

Forks

Languages