Skip to content

lutece-platform/lutece-tech-plugin-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lutece-tech-plugin-parser

This plugin exposes service for parsing different types of documents stream (pdf, office, openDocument, ooXML, pkg...). It uses tika and pdfbox projects.

Example:

IParserFactory factoryParser = SpringContextService.getBean( IParserFactory.BEAN_FILE_PARSER_FACTORY );
Optional parser = factoryParser.getParser( "application/pdf" );
File file = new File(sourcePdfPath);
InputStream inputStream = new FileInputStream(file);
String result = parser.get().parse( inputStream );\

About

This plugin exposes a service for parsing streams using tika and pdfbox project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages