Start developing Debate XML import/export #1076

tienne-B · 2019-04-24T20:43:49Z

This commit starts creating an interface for the import/export and implementation of the Debate XML specification. Here, a side-wide view for importing an XML file is created, with a consumer link for asynchronous processing.

More to come

tabbycat/importer/templates/archive_importer.html

This commit starts creating an interface for the import/export and implementation of the Debate XML specification. Here, a side-wide view for importing an XML file is created, with a consumer link for async processing.

This commit adds a tournament url for tournament export with a stub page.

tienne-B · 2019-05-11T02:20:03Z

I'm a bit concerned I'm putting the cart before the horse here, as the schema has not been reviewed or discussed: TabbycatDebate/DebateXML#4. @czlee, @philipbelesky: Mind looking it over (even just cursorily)? I should get back to this in a week or so.

philipbelesky · 2019-05-13T05:12:42Z

Happy to look over it, although might need to wait a few days.

philipbelesky · 2019-05-16T09:53:58Z

I've mostly commented over in the DTA repo, but just was curious about the desire to write an importer and whether you were planning to do that before/after/simultaneous with the exporter? It's certainly an interesting idea. Initially I pictured the point of DTA as enabling much more of an archival-storage/ data-vis purpose, but I guess having a Tabbycat importer means that even if the spec doesn't take off more broadly we would have a robust way of mitigating against data loss and/or combining tournaments into a single instance.

tienne-B · 2019-05-16T11:04:44Z

An importer would be useful for me as a way to collate various tournaments previously on individual sites to a central site, of which is needed by SUCDI for end-of-year prizes. Having the importer would also be useful as TC already has some data visualisations and analyses (and result views) that would make the archive format more accessible.

I am creating the importer simultaneously with the exporter as I think having the importer gives a visible meaning for the exporter (although that isn't the main point); aiding with adoption.

This commit adds an exporter for single tournaments into an XML format as specified in dta-spec.

tienne-B · 2019-05-22T22:45:43Z

So exporting seems to be working well but with N+1 SQL inefficiencies that I'm finding hard to resolve in the .add_rounds() method. What prefetches on round_set should I do so that DebateResult does not fetch from the database for each debate?

In order to expose the debug toolbar to view the SQL queries, I add a typo in another method.

philipbelesky · 2019-05-26T04:41:02Z

Hmm, I think to avoid DebateResult set being fetched the debates need to be gathered without any reference to their current status (and obviously the winners involved). Are the N+1 issues severe, or just inelegant? Given the scope of the exporter I wonder if it might be easier to accept some of them.

tienne-B · 2019-05-26T04:50:11Z

Sorry, I don't understand on the points of "fetching without any reference of current status", nor what would be the threshold between inelegance and being severe. On the first, is it an inability to prefetch a table in various ways (e.g. from speakerscore: debateteam__team__speaker and speaker)?

A new page is added detailing the XML format and how to import/export it to/from Tabbycat. A distinction has been made between it and database backups.

Also separate methods for adding debates, add gender to people, and removing team short name.

This commit starts an importer for Debate XML (The "Agora Project") with support for all that can be exported by Tabbycat. Some considerations are marked in the documentation page about this format.

tienne-B · 2019-08-31T03:34:14Z

@czlee: From #1199, should .bulk_create() be avoided here in favour of one-by-one?

czlee · 2019-09-02T14:49:11Z

Umm… yeah. Unless it's painfully slow, like, more than a minute or so. But otherwise I think I'd rather display a progress bar and have it more robust to future changes in .save() functions. Otherwise we'd have no choice but to violate DRY (or promise never to use .save(), pre_save and post_save).

I'm open to discussion on this, though.

tienne-B · 2019-09-02T19:44:43Z

Understood. I'll prepare a patch for performance comparison (after #1180 as it would improve result importation). Having a progress bar would be quite difficult as import does not use consumers.

czlee · 2019-09-03T08:33:46Z

Hmm, should the import use consumers? Normally this is best practice for anything that takes a nontrivial amount of time?

(It's fine if you don't want to do this for a first implementation! Just thinking out loud.)

Some methods have also been added to DebateResult classes for the change.

tienne-B · 2019-10-05T03:45:58Z

This PR depends on #1180 for the use of DebateResult for XML. The refactor should be merged before this PR, as merging this will indirectly merge the other.

To be able to use new DebateResult classes in importing/exporting.

This commit replaces the uses of `.bulk_create()`, instead saving each DB object one-by-one. This is to trigger the `.save()` of the models and the triggers. The results importer has not been touched, and will be modified to make use of DebateResult in another commit.

This commit replaces all the logic to determine the result of a debate, such as the margin/splits, etc. with completing a DebateResult object and saving that. This commit refers to methods created as part of TabbycatDebate#1180.

This is necessary as get_winner() has the same effect.

This commit fixes the DebateResult generation as well as some typos and omissions caused from previous changes.

Also rename uses of 'test score'.

As booleans must be either 'true' or 'false' in XML (note the use of lowercase), rather than the capitalized keywords used in Python.

# Conflicts: # config/requirements_core.txt # tabbycat/importer/urls.py # tabbycat/importer/views.py # tabbycat/options/views.py # tabbycat/results/dbutils.py # tabbycat/results/result.py # tabbycat/results/utils.py # tabbycat/urls.py

Even though email addresses and conflicts would not be typically added to an exported XML, the importer should still use them if they exist.

tienne-B commented Apr 26, 2019

View reviewed changes

tabbycat/importer/templates/archive_importer.html Show resolved Hide resolved

Start developing Debate XML import

42bde0b

This commit starts creating an interface for the import/export and implementation of the Debate XML specification. Here, a side-wide view for importing an XML file is created, with a consumer link for async processing.

tienne-B force-pushed the debate-xml branch from e04d944 to 42bde0b Compare April 27, 2019 02:28

tienne-B changed the title ~~Start developing Debate XML import~~ Start developing Debate XML import/export Apr 30, 2019

Start interface for tournament export

e7bfc8f

This commit adds a tournament url for tournament export with a stub page.

tienne-B force-pushed the debate-xml branch 8 times, most recently from 26887db to b632ee9 Compare May 22, 2019 21:35

Create XML exporter for tournaments

469b140

This commit adds an exporter for single tournaments into an XML format as specified in dta-spec.

tienne-B force-pushed the debate-xml branch from b632ee9 to 469b140 Compare May 22, 2019 22:02

tienne-B force-pushed the debate-xml branch from 51ac236 to 2dd5a32 Compare June 2, 2019 00:31

Add documentation about the tournament XML

ace7332

A new page is added detailing the XML format and how to import/export it to/from Tabbycat. A distinction has been made between it and database backups.

tienne-B force-pushed the debate-xml branch 4 times, most recently from 88c173f to 051c100 Compare June 3, 2019 22:21

tienne-B added 2 commits June 3, 2019 20:01

Add speaker/break categories to XML export

3a88f24

Also separate methods for adding debates, add gender to people, and removing team short name.

Start XML import functionality

5302030

This commit starts an importer for Debate XML (The "Agora Project") with support for all that can be exported by Tabbycat. Some considerations are marked in the documentation page about this format.

tienne-B mentioned this pull request Aug 30, 2019

Use bulk creation with importation methods #1199

Open

Use DebateResult with "Latest Results" pane

eaa7dc8

Some methods have also been added to DebateResult classes for the change.

tienne-B added 4 commits October 4, 2019 23:49

Merge branch 'feature/1003-b' into debate-xml

703d6b6

To be able to use new DebateResult classes in importing/exporting.

XML Import: Use DebateResult for results

079d116

This commit replaces all the logic to determine the result of a debate, such as the margin/splits, etc. with completing a DebateResult object and saving that. This commit refers to methods created as part of TabbycatDebate#1180.

Switch isinstance() with object attributes

cdccaf7

tienne-B force-pushed the debate-xml branch from ec446a5 to cdccaf7 Compare October 5, 2019 03:50

tienne-B added 4 commits October 14, 2019 15:45

Remove uses of result.advancing_sides()

2c55544

This is necessary as get_winner() has the same effect.

Various little fixes to Importer

0392825

This commit fixes the DebateResult generation as well as some typos and omissions caused from previous changes.

Merge branch 'develop' into debate-xml

8fd43d7

Also rename uses of 'test score'.

Use booleans in lowercase

314dab5

As booleans must be either 'true' or 'false' in XML (note the use of lowercase), rather than the capitalized keywords used in Python.

philipbelesky force-pushed the develop branch from 2000c80 to 9fa23c9 Compare May 31, 2020 01:34

czlee modified the milestones: Manx, N-Release Jun 13, 2020

tienne-B modified the milestones: Nebelung, O-Release Dec 1, 2020

tienne-B marked this pull request as ready for review December 25, 2020 18:16

tienne-B mentioned this pull request Dec 25, 2020

Deprecate anorak/boots importers in favour of API #1699

Open

tienne-B added 5 commits December 25, 2020 16:11

Merge branch 'develop' into debate-xml

23917e5

# Conflicts: # config/requirements_core.txt # tabbycat/importer/urls.py # tabbycat/importer/views.py # tabbycat/options/views.py # tabbycat/results/dbutils.py # tabbycat/results/result.py # tabbycat/results/utils.py # tabbycat/urls.py

Fix capitalization of booleans

fce4cc0

Import emails/conflicts where provided

4c9eb2e

Even though email addresses and conflicts would not be typically added to an exported XML, the importer should still use them if they exist.

Get M2M creation to ignore empty keys

f391407

Don't mark rounds as completed without results

4215998

tienne-B merged commit 4215998 into TabbycatDebate:develop Jan 13, 2021

tienne-B deleted the debate-xml branch January 13, 2021 21:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Start developing Debate XML import/export #1076

Start developing Debate XML import/export #1076

tienne-B commented Apr 24, 2019

tienne-B commented May 11, 2019

philipbelesky commented May 13, 2019

philipbelesky commented May 16, 2019

tienne-B commented May 16, 2019

tienne-B commented May 22, 2019

philipbelesky commented May 26, 2019

tienne-B commented May 26, 2019

tienne-B commented Aug 31, 2019

czlee commented Sep 2, 2019

tienne-B commented Sep 2, 2019

czlee commented Sep 3, 2019

tienne-B commented Oct 5, 2019

Start developing Debate XML import/export #1076

Start developing Debate XML import/export #1076

Conversation

tienne-B commented Apr 24, 2019

tienne-B commented May 11, 2019

philipbelesky commented May 13, 2019

philipbelesky commented May 16, 2019

tienne-B commented May 16, 2019

tienne-B commented May 22, 2019

philipbelesky commented May 26, 2019

tienne-B commented May 26, 2019

tienne-B commented Aug 31, 2019

czlee commented Sep 2, 2019

tienne-B commented Sep 2, 2019

czlee commented Sep 3, 2019

tienne-B commented Oct 5, 2019