Skip to content
Create AEM Pages from a Wikipedia Dump File
Java HTML
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
src/main
.gitignore
LICENSE
README.md
pom.xml
wiki_dump_full.xml
wiki_dump_small.xml

README.md

Wiki2AEM

This project provides a simple way to create AEM Pages from a Wikipedia XML Dump.

It uses handlebars to provide an easy extensibility of the created pages. By default wcm.io Content Pages are created and the wikipedia article's text is used as richtext.

Find more information on my blog.

Usage

Build the project using mvn clean install and run it with the following arguments:

  • input filename
  • output folder

Example

java -jar target\wiki2aem-1.0-SNAPSHOT-jar-with-dependencies.jar wiki_dump_small.xml output

Output

After a successful run you'll find the created pages in separate folders in the given output folder. Furthermore a .content.xml for the root page is created. You can now copy this content into you content-page and install in AEM.

Warning

This tool can create very many pages, your AEM instance should be capable to handle this amount of content.

You can’t perform that action at this time.