Skip to content

Implementation of Vision Based Page Segmentation algorithm in Java

License

Notifications You must be signed in to change notification settings

tpopela/vips_java

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vips_java

Implementation of Vision Based Page Segmentation algorithm in Java.

The implementation utilizes CSSBox (X)HTML/CSS rendering engine written by Radek Burget.

Description of VIPS and my implementation in my master's thesis (in Czech)

http://www.fit.vutbr.cz/study/DP/DP.php?id=14163&file=t

Original work by Microsoft

http://www.cad.zju.edu.cn/home/dengcai/VIPS/VIPS_July-2004.pdf

CSSBox

http://cssbox.sourceforge.net

Compilation

This project uses Apache Maven. Compile it by running mvn package and run java -cp target/vips-java-*jar-with-dependencies.jar org.fit.vips.VipsTester to start the VipsTester.

Usage

Just pass the URL of web page you want to analyze as argument to VipsTester class.

Preferences of implementation can be changed also there.

About

Implementation of Vision Based Page Segmentation algorithm in Java

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages