Skip to content

photomedia/EPrintsData2GML

master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
TMP
 
 
XML
 
 
 
 
 
 
 
 
 
 
 
 

EPrintsData2GML

Apache Pig Latin Script to Convert EPrints XML to Graph GML files and geocoded CSV files

eartexte-convert.pig is the main Pig Latin script that converts EPrints XML data from e-artexte (http://e-artexte.ca)

About running Pig scripts:

https://pig.apache.org/docs/r0.7.0/setup.html

Convert data using Pig: Generate graph files (GML) and edge files (CSV):

pig -x local -param datafile="XML/data_humanist_photography.xml" eartexte-convert.pig

Visualization layout with Gephi

Gephi: https://gephi.org/

To increase the memory available for Gephi, see: https://gephi.org/users/install/#memory

File -> Open -> Select GML file

Statistics -> Run (Network Diameter) -> Select Undirected, Normalize Centralities in [0,1]

Statistics -> Run (Modularity)

Layout – Force Atlas 2 -> scaling (12, depending on size of network)

Appearance – Nodes -> Size -> Ranking -> Betweenness Centrality (5-20) on a spline

Appearance – Nodes -> Partition > Modularity Class

Optional Filters:

•	Filters - > Topology > Giant Component

•	Filters -> Topology -> Degree Range

•	Filters –> Attributes -> Range -> Betweenness Centrality

•	Filters -> Edges -> Edge Weight

Export -> Sigma.js Template -> fill in

Sigma Exporter https://github.com/oxfordinternetinstitute/gephi-plugins/tree/sigmaexporter-plugin

Visualization using Cytoscape

Cytoscape: http://www.cytoscape.org/

Import -> Network file > [choose CSV file from the /OUTPUT/EDGELISTS]

Demo visualizations:

http://photomedia.ca/visualizations/artexte/

About the author:

Tomasz Neugebauer (tomasz.neugebauer@concordia.ca) Digital Projects & Systems Development Librarian at Concordia University in Montreal

License:

MIT License

About

Apache Pig Latin Script to Convert EPrints XML to Graph GML files and geocoded CSV files

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published