- Added clues from puzzles published since previous release date.
- Consolidated
html
,json
andpuz
tables into a singleraw
table (with acontent_type
column). This is in preparation for PDF ingestion.
- Added clues scraped from blog posts published from previous release date.
- Migrated domain from
cryptics.eigenfoo.xyz
tocryptics.georgeho.org
. All URLs should automatically redirect.
- Updated documentation to reflect new domain and manual update schedule (from
v2022.01.16
).
- Removed daily "update-deploy" job (from
v2021.11.07
). The dataset will once again be updated only manually by me.
- Updated contact email.
- Added
robots.txt
at https://cryptics.eigenfoo.xyz/robots.txt - Added
sitemap.xml
at https://cryptics.eigenfoo.xyz/sitemap.xml
- Added a daily "update-deploy" job to scrape new blog posts, parse their clues and publish the latest dataset.
- Added one-off CSV dump of
cru_cryptics
three PDF puzzles, courtesy of Michael F. Gill. - Added clues scraped from blog posts published from previous release date.
- Fixed
raw_definitions
in several parsing functions; re-ran on unparsed blog posts.
- Changed source of Cru Cryptics from The New York Times' archive page to William I. Johnston's archive. Thanks to Michael F. Gill for pointing this out!
- Remove erroneous
puzzle_date
s from clues parsed from.puz
files. - Added clues scraped from blog posts published from previous release date.
- Fixed some minor bugs with
review.py
- Added
indicators
,indicators_by_clue
,charades
andcharades_by_clue
tables. All indicators and charades are identified via regexes.
- Renamed database from
clues
todata
. Added appropriate URL redirects.
- Manually corrected several hundred rows.
- Added Clicky analytics to Datasette webpages.
- Improved function docstrings.
- Refactored
main.py
to both populate database with new blog posts and also parse new blog posts.
- Added Michael F. Gill's digital archive of The New York Times cryptic crosswords.
- Added National Post Cryptic Crossword Archive for Cox and Rathvon (a.k.a. Hex) cryptic crosswords.
- Allowed caching of raw
.puz
files. - Added
source
column, droppedpuzzle_url
columns. - Added facet over
source
column.
- Added The Hindu Crossword Corner as a scraped blog, which adds approximately 67,000 clues.
- Deleted duplicate rows in the
clues
table. - Added clues scraped from blog posts published from previous release date.
- Licensed database under the Open Database License (ODbL) and contents under the Database Contents License (DCL).
- Added
metadata
table withlicense
andlast_built
datetime. - Added
source_url
column to theclues
table.
- Added Clicky analytics to documentation webpages.
- Acknowledged Blue Oak Model
License from
pandoc-markdown-css-theme
, and license remaining code under the MIT License.
Initial release on https://cryptics.eigenfoo.xyz/.