Skip to content
eComparatio: text diff and support for digital edition
JavaScript CSS HTML
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
dumpsNEW
exampledumps
manual
.gitignore
CHANGELOG.md
GentiumPlus-R.ttf
LICENSE
README.md
ed.css
ed.js
index.html
sameED9.js
sameED9worker.js
screen1.png
screen2.png
screen3.png
screen4.png
screen5.png
screen6.png

README.md

eComparatio

The main objective of the eComparatio project was to develop a software that is able to achieve an extremely accurate alignment of a text-diff-analysis in a synoptic representation. It turned out that most text diff algorithms are not capable of putting out results of the strength that would be needed to be used as a starting point for a digital edition. We wanted to implement a software that allows the user to focus on the text and to be aware of differences and their clustering in comparison to the base text, as well as a traditional representation similar to the one that can be found in scientific editions, i.e. an apparatus of variants (but with the space and interactive benefits of the screen live), and a synoptic representation of a straight juxtaposition of the lines compared. After we were able to build a comparison program (text diff, differential text analysis etc. etc.) able to supply the needed results, it showed that comparing is much harder a task than one would expect. We started adding classes of differences the program indicates. These classes are: “ganzer Unterschied” ( i.e. total difference), “Unterschied d. Gr.- und Kleinschreibung” (capitalization); “Unterschied d. diakritischen Zeichen”, (diacritics), “Ligatur-Unterschied”, (ligatures), “Umbruch-Unterschied” (line break), “Unterschied d. Interpunktion” (punctuation), “Mehr als im anderen Text” (more than in the text compared), “Weniger als im anderen Text” (less than in the text compared), “Klammerung unterschiedlich” (brackets), “lateinisches U und V” (Latin letters u and v), “Vertauschung” (permutation), “Verdrehung von Passagen” (contortion of words in a given passage), “Mehr im Referenztext” (more words in the text compared), “einzelner Buchstabe” (single letter), “wenige Buchstaben” (a few letters), “Trennung” (word wrap). To evaluate all this classes, and to generate all needed results, we were forced to give up most of the speed coming out of the efficient algorithms, moreover we were forced to evaluate possible errors, to align the most probable text positions, that means we added a optimization like part to the software. To integrate well with other projects, we decided to deliver a primary URN output, to get references to eComparatio, and a CTS URN Input mechanism, to get text from CTS resources (and to reference them).

How to

Use manual: manual/README.md or use video tutorials: U-tube

Usecases and manuals in german language: Handbuch

Change Log / Software Version

Consult the change log for the pre-GitHub Development. This Github Repository starts with the version 1.0.

Try not you can only loose

Try the open example installation (quick test cases included):

http://www.ecomparatio.net/

Tested on

Firefox

Firefox Quantum 57.0 - Win 10 64-Bit (24.11.2017)

Firefox Quantum 66.0.3 - Linux 64-Bit, (15.05.2019)

Opera

Opera 46.0 - Linux 64-Bit, Win 10 64-Bit (24.11.2017)

Opera 49 - Win 10 64 Bit (28.11.2017)

Opera 60.0.3255.70 - Linux 64-Bit (15.05.2019)

Chrome / Chromium

Chromium Version 73.0.3683.86 (Offizieller Build) (15.05.2019)

Chrome 62 - Win 10 (27.11.2017)

Chrome Version 74.0.3729.131 (Offizieller Build) (Linux 64-Bit) (15.05.2019)

Edge / IE

Microsoft Edge 40 - Win 10 64 Bit (28.11.2017)

Safari

Safari 10.2 (29.11.2017)

Screenshots

ScreenShot ScreenShot ScreenShot ScreenShot ScreenShot ScreenShot

You can’t perform that action at this time.