The Anthology is not properly indexed by major search engines, particularly Google. We have some information and suggestions from our contacts there, which we should implement. This project is also bound up with the static site rewrite which will help address some of these issues.
This project looks both inward and forward: inward, to find and correct existing Anthology errors via manual and automated methods, and forward, to prevent future errors from making their way into the anthology. Errors can be factual mistakes but also formatting issues (such as incorrect month formats or mistakes in title casing in BibTeX). The focus is on the authoritative data (the XML files in the imports/ directory), but extends to derived formats, particularly the BibTeX export.
The Anthology is current built as a ruby-on-rails dynamic site with caching used to help with performance. We would like to rebuild the site as completely statically generated as possible, including all conference, paper, and author pages.