umdfaces2VOC2007

Mattzheng 注明

forked from luuuyi，其中test.py内容需要进行修正。

新增了图像另存到文件夹:JPEGImages的功能：

cv2.imwrite(IMGSTORE+jpg_file,img,[int(cv2.IMWRITE_JPEG_QUALITY),95])

同时，原作者本来base=3000000这里，如果第二个文件夹开跑就需要另外的起点base，需要注意。
新增了ImageSets.py专门为第三个文件夹准备

以上的修改，已经完全覆盖了caffe/SSD初期数据准备，详情可见我的博客：http://blog.csdn.net/sinat_26917383/article/details/68068113

实际转换过程中出现的问题：

umdfaces_batch1文件中，好像很多图像都是没有的。其他两个umdfaces_batch2、umdfaces_batch3也有较少的图像没有。大概最后我实践的图像集有22W+的图像。 80%训练集，20%测试集。

整个方法+数据集的缺陷：一个图只标注了一个物体;图像较多的是写真（非真实环境）如果图像中有多个人脸，那么只默认标注概率最高的一个

Mattzheng 2017-5-4

原作者的描述Luuuyi

将umdfaces数据库做成VOC2007格式，方便深度学习训练，umdfaces数据库的下载地址为：umdfaces

项目一共包含三个脚本文件：

tool_csv.py: 封装了csv模块读取文件函数
tool_lxml.py: 封装了lxml模块用于生成VOC2007格式的xml文档的函数
test.py: 主函数，用于读取文件路径做初始化

Dependencies

Python2.7或者更高版本
python lxml，numpy，cv2

Tips

在安装python依赖模块的时候，国内的小伙伴可以使用豆瓣的镜像通过pip下载(root用户)：

cd ~ && mkdir .pip && cd .pip
vim pip.conf

在其中加上：

[global]
index-url = http://pypi.douban.com/simple/
trusted-host = pypi.douban.com

之后就可以愉快的下载python模块了

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
JPEGImages		JPEGImages
Annotations_and_JPEGImages.py		Annotations_and_JPEGImages.py
ImageSets.py		ImageSets.py
README.md		README.md
tool_csv.py		tool_csv.py
tool_lxml.py		tool_lxml.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JPEGImages

JPEGImages

Annotations_and_JPEGImages.py

Annotations_and_JPEGImages.py

ImageSets.py

ImageSets.py

README.md

README.md

tool_csv.py

tool_csv.py

tool_lxml.py

tool_lxml.py

Repository files navigation

umdfaces2VOC2007

Mattzheng 注明

原作者的描述Luuuyi

Dependencies

Tips

About

Releases

Packages

Languages

mattzheng/umdfaces2VOC2007

Folders and files

Latest commit

History

Repository files navigation

umdfaces2VOC2007

Mattzheng 注明

原作者的描述Luuuyi

Dependencies

Tips

About

Resources

Stars

Watchers

Forks

Languages