Skip to content
Apache Pig Latin Script to Convert EPrints XML to Graph GML files and geocoded CSV files
JavaScript CSS HTML PigLatin Python XSLT PHP
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
CYTOSCAPE
GEPHI
OUTPUT
SIGMA
TMP
XML
XSLT
ArtexteInstructions.docx
README.md
SIGMA.zip
convert-data.py
datafu-pig-incubating-1.3.1.jar
eartexte-convert.pig
geocode-generate-fusion-table.py
geocode.py
geteartextedata.py
gml-header-footer.py
pig_util.py

README.md

EPrintsData2GML

Apache Pig Latin Script to Convert EPrints XML to Graph GML files and geocoded CSV files

eartexte-convert.pig is the main Pig Latin script that converts EPrints XML data from e-artexte (http://e-artexte.ca)

About running Pig scripts:

https://pig.apache.org/docs/r0.7.0/setup.html

Convert data using Pig: Generate graph files (GML) and edge files (CSV):

pig -x local -param datafile="XML/data_humanist_photography.xml" eartexte-convert.pig

Visualization layout with Gephi

Gephi: https://gephi.org/

To increase the memory available for Gephi, see: https://gephi.org/users/install/#memory

File -> Open -> Select GML file

Statistics -> Run (Network Diameter) -> Select Undirected, Normalize Centralities in [0,1]

Statistics -> Run (Modularity)

Layout – Force Atlas 2 -> scaling (12, depending on size of network)

Appearance – Nodes -> Size -> Ranking -> Betweenness Centrality (5-20) on a spline

Appearance – Nodes -> Partition > Modularity Class

Optional Filters:

•	Filters - > Topology > Giant Component

•	Filters -> Topology -> Degree Range

•	Filters –> Attributes -> Range -> Betweenness Centrality

•	Filters -> Edges -> Edge Weight

Export -> Sigma.js Template -> fill in

Sigma Exporter https://github.com/oxfordinternetinstitute/gephi-plugins/tree/sigmaexporter-plugin

Visualization using Cytoscape

Cytoscape: http://www.cytoscape.org/

Import -> Network file > [choose CSV file from the /OUTPUT/EDGELISTS]

Demo visualizations:

http://photomedia.ca/visualizations/artexte/

About the author:

Tomasz Neugebauer (tomasz.neugebauer@concordia.ca) Digital Projects & Systems Development Librarian at Concordia University in Montreal

License:

MIT License

You can’t perform that action at this time.