Releases: arquivo/pwa-technologies
Godhelpus release
Main changes:
- Update functional tests
- Deploy of new CitationSaver service (beta version)
- Upgrade to GAnalytics 4
Issues:
#1319 Update sobre.arquivo.pt footer to be equal to www.arquivo.pt
#1310 Memorial landing page title is wrong for archived subpages
#1305 Change arquivo404 arquivo.pt archiveApiUrl
#1304 Migrate to Google Analytics 4 property
#1300 Site recorded with SavePageNow giving error 429 - Too Many Requests
#1295 Segregate internal and external API
#1294 Implement monthly uptime reports for each service
#1293 Review content removal technical process
#1291 Create aliases for the Arquivo.pt Memorial websites
#1290 Replace existing FCT logo on sobre.arquivo.pt
#1289 Replace FCT logo on footer of www.arquivo.pt
#1284 SavePageNow can't record websites whose CA certificate is slightly misconfigured
#1268 Move arquivo.pt/collections short link to sobre.arquivo.pt and replace content with static HTML page
#1267 Review script qa.py on arquivo-operation
#1265 esquerda.net not archiving
#1253 Update PHP sobre.arquivo.pt
#1245 bportugal.pt can not be properly crawl
#1238 Update Arquivo.pt Alarms process to test availability of all collections
#1237 Update Jenkins
#1231 Replace links on home page to short links
#1226 "Complete page" should present URL, date and Back button
#1224 SavePageNow/Complete page does not archive embeded images
#1220 Verify fulltext indexing of the latest AWP collections AWP36, etc.
#1176 Mockups CitationSaver to HTML/CSS
#1175 Create regression tests for Advanced Page Search page
#1174 Create regression tests for Images Search page
#1173 Create regression tests for Advanced Image Search page
#1172 Create regression tests for URL Search page
#1171 Create regression tests for Replay page
#1170 Create regression tests for Pages Search page
Fortuna release
Main changes:
- Deployed new CDXJ indexes after cleaning warc/revisits created by Brozzler that were crashing pyWB due to their high volume
- Fix minor UI issues
- Deployed new broker servers
Closed issues:
#1303 Create template SavePageNow URL no found.
#1301 "Advanced search" label get cropped on the button only on English version
#1292 On replay, the "Table" button sometimes does not work
#1288 "Visit" button is broken
#1285 Some files can only be accessed via noFrame
#1282 URL search Table results should center on available links
#1281 Broker high CPU usage on certain requests
#1280 Create new card on Arquivo.pt home page: Notícias
#1272 Review 429 errors and API thresholds
#1271 Review CryptoCurrency news and publish
#1266 Setup letsencrypt SSL certificates for contamehistorias.pt
#1264 SavePageNow check URL
#1258 No version is being presented on CDX
#1256 Install patiki-client on all servers
#1255 Configure new servers - 105 -> 108
#1254 Block external connections to Wordpress administration console
#1252 Review robots.txt
#1251 Analyze results of security audit performed by RCTS CERT
#1249 Segregate the SavePageNow service from Broker
#1248 Memorial template needs improvement
#1247 Labels in English when we have set the Portuguese language (Footer, Mouseover)
#1246 SavePageNow, problems with the replay of the URL "rnca.fccn.pt"
#1244 Improve Arquivo404 documentation
#1240 Configure the new brokers
#1236 Image Advanced Search should support multiple format search
#1235 Advanced search parameters disappear between Image Advanced Search UI and SERP
#1234 Click on image should show image in original size
#1233 Side-swipe does not move Table of Versions
#1231 Replace links on home page to short links
#1229 Harmonize label for confirmation message of "Copy link"
#1228 Add new links to arquivo404 and Open datasets on www.arquivo.pt and sobre.arquivo.pt footers
#1227 Save Page Now UI should present URL being archived
#1223 Address not found Save Page Now
#1221 Deploy 2020 collections
#1218 Use letsencrypt in Memorial
#1217 Banner and Logo from SavePageNow are not positioned correctly
#1216 Integrate new cards in home page
#1212 Put limits on Qos and set rules on robots.txt (SavePageNow)
#1209 Some URLs not recognized as valid URL on SavePageNow
#1206 Review 404 error page on Apache httpd
#1201 www.11cnef.pt was not crawled or cannot be replayed
#1195 Encoding URL on replay
#1190 Review advanced search labels
#1189 All blocked content should be monitored by Icinga
#1188 Page search Index small collections
#1187 Assess the impact of Google takeout as alternative to Backup and Sync
#1185 Create a link on sobre.arquivo.pt with all information about SavePageNow
#1183 Create new cards for homepage: Arquivo.pt Awards
#1182 Create new cards for homepage: Memorial
#1180 Create new cards for homepage: SavePageNow
#1179 Create new cards for homepage: Sugira site
#1166 Message in English presents date in Portuguese
#1164 Create new roll-up for Arquivo.pt Award
#1146 Check interconnection between logs and front-end
#1145 Save Page Now save videos?
#1135 "Last updated on XX" needs to go to the bottom of the page (sobre.arquivo.pt)
#1134 recomendacoes shortlink on footer is broken
#1133 search box and button aren't completely visible after searching on sobre.arquivo.pt
#1132 Back Button does not work (Image View)
#1125 Duplicate entries in cdxj (Save Page Now)
#1119 Problem replaying images
#1117 Replace footer on Wordpress with new one
#1112 Problems collecting the response from POST URLs
#1111 Page Advanced Search should support multiple format search
#1110 Advanced Search with multiple sites should be indicated
#1095 Exclude ".open" WARCs from Image and Page Indexing
#1088 Image that generate 404/500 errors should be hidden from the results
#1062 Revise how to process quoted queries in page and image search
#1046 Wrong year on the left sidebar
#1019 Add regression/integration tests to page search full text API
#1011 The page search result logging service is removing query string parameters from the Wayback redirect
#927 WGET is enclosing the WARC-TARGET-URI with <>
#914 Improve FAWP crawling
#897 Inconsistent naming in Text Search API output JSON fields
#534 List versions - show list view by default and not table view
#417 Add navigation icons to internal search engine results page
Francisco release
Revert to two separate pywbs (framed, unframed).
#1165 Designing Arquivo.pt Award 2022 graphics for dissemination
#1203 Arquivo404 Should point towards the oldest version of a website
#1205 Arquivo404 does not display in framed replay
#1207 Drop-down components must be consistent across UIs
#1210 Arquivo.pt Titles should reflect the page we're on
Eros release
- #1202 Change the language twice does not work
- #1194 Image Advanced Search does not work
- #1193 Keep same Apache Log Format with the new front end
- #1192 Apache fails after waiting more than 60 seconds
- #1191 Update Memorial template
- #1186 Harmonize Arquivo.pt logs directories
- #1177 Add card with book instead of awards 2021
- #1175 Create regression tests for Advanced Page Search page
- #1174 Create regression tests for Images Search page
- #1173 Create regression tests for Advanced Image Search page
- #1172 Create regression tests for URL Search page
- #1171 Create regression tests for Replay page
- #1170 Create regression tests for Pages Search page
- #1169 Add new option when the URL is not in Arquivo.pt
- #1168 Broken "Replay with old browser" links to classic.oldweb.today that was deactivated
- #1167 Broken link in the footer and different link text
- #1163 Create Mockups (HTML/CSS) CitationSaver
- #1162 Save Page Now with URL parameter
- #1160 Review robots.txt
- #1159 Advance search button incorrect layout
- #1158 Update Arquivo.pt presentation video - english version
- #1155 Activate temporal narrative
- #1154 Update arquivo.pt presentation video
- #1153 Deploy latest memorial template on preprod
- #1152 Broken link in the footer of arquivo.pt page
- #1150 Update User Agent from crawlers
- #1148 Collection on replay page should link to our google sheet rather than Archive.org
- #1143 Layout problems on URL search
- #1142 Suggestion not aligned
- #1140 Generate and deploy graphical artes for “Conheça os vencedores do Prémio Arquivo.pt 2021”
- #1139 Design Save Page Now submission form
- #1136 New form for Page Save Now (arquivo.pt)
- #1127 Improve QA of Arquivo.pt releases
- #1126 change email on Apache HTTPd error messages
- #1124 Recover all possible information from .warc.gz.open files
- #1121 Revise image search page on Sobre
- #1118 "Subscribe to mailing list" link not working properly
- #1117 Replace footer on Wordpress with new one
- #1116 Replace footer on www.arquivo.pt
- #1115 Page search filtered by site is not showing enough results
- #1114 Add frame while Completing the page
- #1113 Screenshot service is not working for some pages
- #1109 Wildcard expansion not working properly on Arquivo.pt
- #1108 API parameters name differences between query string and get parameter
- #1107 Create "Advanced Image search" about page
- #1106 Revise " Pesquisa avançada de páginas" about page
- #1105 Change earliest date in the date filter
- #1104 Replace old icon
- #1102 Change labels on Image Advanced search
- #1100 Lack of documentation in the API about the parameter linkToOriginalFile
- #1099 Maintain consistency in home page design
- #1092 Replay list versions problem
- #1080 Date format must be consistent across page and image SERP
- #1077 White images with transparent background show up as white rectangles in image search results
- #1070 Deploys 2019 collections
- #1068 White Page when user clicks on the browser back button (Image Search)
- #1067 Left sidebar without versions
- #1064 URLs with underscore
- #1056 Memorial (image links)
- #1055 Design/develop a new Pages Search page results structure: HTML+ CSS
- #1054 Design/develop a new Replay page structure: HTML+CSS
- #1053 Design/develop a new URL Search page structure: HTML+CSS
- #1052 Design/develop a new Advanced Image Search page structure: HTML+CSS
- #1051 Design/develop a new Images Search page structure: HTML+ CSS
- #1050 Design/develop a new Advanced Page Search page structure: HTML+ CSS
- #1041 Generate new Google Analytic codes for the different environments
- #1040 Develop a new homepage interface structure: HTML+CSS.
- #1036 Format of lighthouse message in "Pesquisar noutros arquivos" should be consistent with others
- #1018 Add SVG support
- #1015 Create regression tests for the image indexing information extractor
- #1006 Prepare infrastructure to support API versioning
- #976 Link to contamehistorias.pt
- #968 Refactor CSS/HTML code to facilitate maintenance
- #904 Display in some mobile devices doesn't fit
- #879 Order of the image search results is wrong on the two column result layout
- #842 Amplify Brozzler Cluster with Docker Swarm
- #805 Change dedupField to URL when searching for site
- #791 CDX output below expected
- #702 Save Page now: create similar service on Arquivo.pt
- #692 image modal arrows need to be hidden on first and last position
- #640 The version table years must be fixed when scrolling down.
- #275 Accessibility level A WCAG 2.0
Dionisius release
ImageSearchApi
https://github.com/arquivo/image-search-api/releases/tag/Dionisius-release
- Index base64 images", #423
- Improve exception handling", #732
- Add better message when Servlet can't connect to SOLR", #734
- Add domain level spam filter to image search", #771
- Log the search results of each image search API call", #929
- Change tstamp field on the API to reflect that it is the crawling date", #939
- Site search: add subdomain expansion to by default to image and page search", #987
- Prepare infrastructure to support API versioning", #1006
- API queries with site restrictions are displaying images from all subdomains", #1014
- Revise Image Search API log format", #1065
- Update ImageSearch API documentation", #1069
- Image Search API SimpleDateFormat is not thread safe", #1071
PageSearch
https://github.com/arquivo/page-search/releases/tag/Dionisius-release
- Revise Page Search API log format", #1066
PyWb
https://github.com/arquivo/pywb-arquivo/releases/tag/Dionisius-release
- Problem replaying page, which may be related to jquery", #743
Webapp
https://github.com/arquivo/arquivo-webapp/releases/tag/Dionisius-release
- Link to contamehistorias.pt", #976
- Site search: add subdomain expansion to by default to image and page search", #987
- Information about image with no labels", #1002
- Create an image resizing service", #1004
- Submit query button only works if you click the "Pesquisar" text", #1101
- Update ImageSearch API documentation", #1069
- Merge log information in one single line (Page View and Image View).", #1073
- Inconsistent image alignment in image SERP", #1074
- Page and Image SERP results are not being redirected correctly when %20 is present in the URL", #1075
- Image SERP page does not stop showing loading spinner on error", #1076
- Date format must be consistent across page and image SERP", #1080
- Image detail SERP must have the exact API labels", #1081
- Image SERP are misaligned due to original image sizes", #1082
- The "Tabela/Lista" shows up on replay SERP, but results do not appear.", #1085
- Setting the safeSearch=off parameter breaks next page flow", #1087
- Image that generate 404/500 errors should be hidden from the results", #1088
Spellchecker
https://github.com/arquivo/PwaSpellchecker/releases/tag/Dionisius-release
Basileus release
- ugh is this time. site boost to 0.01f from 0, so it is not removed by LuceneQueryOptimizer
- revert url boost and add some logging
- set url boost to 0.01f so the LuceneQueryOptimizer doesn't remove the URL Clause in some situations (eawp14)
- Someone changed directly the NutchAnalysis.java instead of running the javaCC... updating the NutchAnalysis.jj with what was changed directly.
- Changed the lexical analyzer to accept '*' as a word punctuation instead of transforming it into a white space. So for now * will be considered part of a word and the query probably will not return any results. On SolrServer the asterisk will have query meaning.
Caronte release
- change title on Arquivo.pt #1060
- Improve messages on Complete page (English) #1039
- Number of results are not being presented (Images) #1038
- Advanced search does not recognize quoted queries correctly #1034
- Query highlight not working for title on quoted queries #1033
- Refactor CSS/HTML code to facilitate maintenance #968
Webapp release
Winterfell release mobile
- Cropped cards on the side on mobile version #701
- Encoding Error: URL not found on URL Search results #705
- Apply site:$HOST instead of site:$URL when the query is an URL #552
- "Select One" default and "Excluír" typo #679
- some URLs are not working in syntax wayback/*/url #386
- Validate if begin date is always earlier than end date on date range restriction #630
- Same expression in the advanced image search form #691
- Dates component slider laziness make rest of the screen slide down #698
- Date sliders is over the datepickers #666
- The replay options shouldn't be visible on the not archived page #686
- Make query suggestion API location configurable #696
- Next on image results jump to the homepage #688
- "Did you mean" does not appears on search image #687
Winterfell release desktop
- Replay options disappear with lower resolutions (Desktop version) #670
- Encoding Error: URL not found on URL Search results #705
- Apply site:$HOST instead of site:$URL when the query is an URL #552
- Change logo with a new one related to Arquivo.pt Award 2020 #710
- Ortographic error on desktop version preprod.arquivo.pt #707
- some URLs are not working in syntax wayback/*/url #386
- Button disappear when switching from Page Search to Image Search (Desktop Version) #697