Skip to content

Releases: arquivo/pwa-technologies

Godhelpus release

19 Dec 17:18
528f903
Compare
Choose a tag to compare

Main changes:

  • Update functional tests
  • Deploy of new CitationSaver service (beta version)
  • Upgrade to GAnalytics 4

Issues:
#1319 Update sobre.arquivo.pt footer to be equal to www.arquivo.pt
#1310 Memorial landing page title is wrong for archived subpages
#1305 Change arquivo404 arquivo.pt archiveApiUrl
#1304 Migrate to Google Analytics 4 property
#1300 Site recorded with SavePageNow giving error 429 - Too Many Requests
#1295 Segregate internal and external API
#1294 Implement monthly uptime reports for each service
#1293 Review content removal technical process
#1291 Create aliases for the Arquivo.pt Memorial websites
#1290 Replace existing FCT logo on sobre.arquivo.pt
#1289 Replace FCT logo on footer of www.arquivo.pt
#1284 SavePageNow can't record websites whose CA certificate is slightly misconfigured
#1268 Move arquivo.pt/collections short link to sobre.arquivo.pt and replace content with static HTML page
#1267 Review script qa.py on arquivo-operation
#1265 esquerda.net not archiving
#1253 Update PHP sobre.arquivo.pt
#1245 bportugal.pt can not be properly crawl
#1238 Update Arquivo.pt Alarms process to test availability of all collections
#1237 Update Jenkins
#1231 Replace links on home page to short links
#1226 "Complete page" should present URL, date and Back button
#1224 SavePageNow/Complete page does not archive embeded images
#1220 Verify fulltext indexing of the latest AWP collections AWP36, etc.
#1176 Mockups CitationSaver to HTML/CSS
#1175 Create regression tests for Advanced Page Search page
#1174 Create regression tests for Images Search page
#1173 Create regression tests for Advanced Image Search page
#1172 Create regression tests for URL Search page
#1171 Create regression tests for Replay page
#1170 Create regression tests for Pages Search page

Fortuna release

28 Sep 13:30
528f903
Compare
Choose a tag to compare

Main changes:

  • Deployed new CDXJ indexes after cleaning warc/revisits created by Brozzler that were crashing pyWB due to their high volume
  • Fix minor UI issues
  • Deployed new broker servers

Closed issues:
#1303 Create template SavePageNow URL no found.
#1301 "Advanced search" label get cropped on the button only on English version
#1292 On replay, the "Table" button sometimes does not work
#1288 "Visit" button is broken
#1285 Some files can only be accessed via noFrame
#1282 URL search Table results should center on available links
#1281 Broker high CPU usage on certain requests
#1280 Create new card on Arquivo.pt home page: Notícias
#1272 Review 429 errors and API thresholds
#1271 Review CryptoCurrency news and publish
#1266 Setup letsencrypt SSL certificates for contamehistorias.pt
#1264 SavePageNow check URL
#1258 No version is being presented on CDX
#1256 Install patiki-client on all servers
#1255 Configure new servers - 105 -> 108
#1254 Block external connections to Wordpress administration console
#1252 Review robots.txt
#1251 Analyze results of security audit performed by RCTS CERT
#1249 Segregate the SavePageNow service from Broker
#1248 Memorial template needs improvement
#1247 Labels in English when we have set the Portuguese language (Footer, Mouseover)
#1246 SavePageNow, problems with the replay of the URL "rnca.fccn.pt"
#1244 Improve Arquivo404 documentation
#1240 Configure the new brokers
#1236 Image Advanced Search should support multiple format search
#1235 Advanced search parameters disappear between Image Advanced Search UI and SERP
#1234 Click on image should show image in original size
#1233 Side-swipe does not move Table of Versions
#1231 Replace links on home page to short links
#1229 Harmonize label for confirmation message of "Copy link"
#1228 Add new links to arquivo404 and Open datasets on www.arquivo.pt and sobre.arquivo.pt footers
#1227 Save Page Now UI should present URL being archived
#1223 Address not found Save Page Now
#1221 Deploy 2020 collections
#1218 Use letsencrypt in Memorial
#1217 Banner and Logo from SavePageNow are not positioned correctly
#1216 Integrate new cards in home page
#1212 Put limits on Qos and set rules on robots.txt (SavePageNow)
#1209 Some URLs not recognized as valid URL on SavePageNow
#1206 Review 404 error page on Apache httpd
#1201 www.11cnef.pt was not crawled or cannot be replayed
#1195 Encoding URL on replay
#1190 Review advanced search labels
#1189 All blocked content should be monitored by Icinga
#1188 Page search Index small collections
#1187 Assess the impact of Google takeout as alternative to Backup and Sync
#1185 Create a link on sobre.arquivo.pt with all information about SavePageNow
#1183 Create new cards for homepage: Arquivo.pt Awards
#1182 Create new cards for homepage: Memorial
#1180 Create new cards for homepage: SavePageNow
#1179 Create new cards for homepage: Sugira site
#1166 Message in English presents date in Portuguese
#1164 Create new roll-up for Arquivo.pt Award
#1146 Check interconnection between logs and front-end
#1145 Save Page Now save videos?
#1135 "Last updated on XX" needs to go to the bottom of the page (sobre.arquivo.pt)
#1134 recomendacoes shortlink on footer is broken
#1133 search box and button aren't completely visible after searching on sobre.arquivo.pt
#1132 Back Button does not work (Image View)
#1125 Duplicate entries in cdxj (Save Page Now)
#1119 Problem replaying images
#1117 Replace footer on Wordpress with new one
#1112 Problems collecting the response from POST URLs
#1111 Page Advanced Search should support multiple format search
#1110 Advanced Search with multiple sites should be indicated
#1095 Exclude ".open" WARCs from Image and Page Indexing
#1088 Image that generate 404/500 errors should be hidden from the results
#1062 Revise how to process quoted queries in page and image search
#1046 Wrong year on the left sidebar
#1019 Add regression/integration tests to page search full text API
#1011 The page search result logging service is removing query string parameters from the Wayback redirect
#927 WGET is enclosing the WARC-TARGET-URI with <>
#914 Improve FAWP crawling
#897 Inconsistent naming in Text Search API output JSON fields
#534 List versions - show list view by default and not table view
#417 Add navigation icons to internal search engine results page

Francisco release

21 Jan 16:38
Compare
Choose a tag to compare

Revert to two separate pywbs (framed, unframed).

#1165 Designing Arquivo.pt Award 2022 graphics for dissemination
#1203 Arquivo404 Should point towards the oldest version of a website
#1205 Arquivo404 does not display in framed replay
#1207 Drop-down components must be consistent across UIs
#1210 Arquivo.pt Titles should reflect the page we're on

Eros release

21 Jan 16:37
6610dc5
Compare
Choose a tag to compare
  • #1202 Change the language twice does not work
  • #1194 Image Advanced Search does not work
  • #1193 Keep same Apache Log Format with the new front end
  • #1192 Apache fails after waiting more than 60 seconds
  • #1191 Update Memorial template
  • #1186 Harmonize Arquivo.pt logs directories
  • #1177 Add card with book instead of awards 2021
  • #1175 Create regression tests for Advanced Page Search page
  • #1174 Create regression tests for Images Search page
  • #1173 Create regression tests for Advanced Image Search page
  • #1172 Create regression tests for URL Search page
  • #1171 Create regression tests for Replay page
  • #1170 Create regression tests for Pages Search page
  • #1169 Add new option when the URL is not in Arquivo.pt
  • #1168 Broken "Replay with old browser" links to classic.oldweb.today that was deactivated
  • #1167 Broken link in the footer and different link text
  • #1163 Create Mockups (HTML/CSS) CitationSaver
  • #1162 Save Page Now with URL parameter
  • #1160 Review robots.txt
  • #1159 Advance search button incorrect layout
  • #1158 Update Arquivo.pt presentation video - english version
  • #1155 Activate temporal narrative
  • #1154 Update arquivo.pt presentation video
  • #1153 Deploy latest memorial template on preprod
  • #1152 Broken link in the footer of arquivo.pt page
  • #1150 Update User Agent from crawlers
  • #1148 Collection on replay page should link to our google sheet rather than Archive.org
  • #1143 Layout problems on URL search
  • #1142 Suggestion not aligned
  • #1140 Generate and deploy graphical artes for “Conheça os vencedores do Prémio Arquivo.pt 2021”
  • #1139 Design Save Page Now submission form
  • #1136 New form for Page Save Now (arquivo.pt)
  • #1127 Improve QA of Arquivo.pt releases
  • #1126 change email on Apache HTTPd error messages
  • #1124 Recover all possible information from .warc.gz.open files
  • #1121 Revise image search page on Sobre
  • #1118 "Subscribe to mailing list" link not working properly
  • #1117 Replace footer on Wordpress with new one
  • #1116 Replace footer on www.arquivo.pt
  • #1115 Page search filtered by site is not showing enough results
  • #1114 Add frame while Completing the page
  • #1113 Screenshot service is not working for some pages
  • #1109 Wildcard expansion not working properly on Arquivo.pt
  • #1108 API parameters name differences between query string and get parameter
  • #1107 Create "Advanced Image search" about page
  • #1106 Revise " Pesquisa avançada de páginas" about page
  • #1105 Change earliest date in the date filter
  • #1104 Replace old icon
  • #1102 Change labels on Image Advanced search
  • #1100 Lack of documentation in the API about the parameter linkToOriginalFile
  • #1099 Maintain consistency in home page design
  • #1092 Replay list versions problem
  • #1080 Date format must be consistent across page and image SERP
  • #1077 White images with transparent background show up as white rectangles in image search results
  • #1070 Deploys 2019 collections
  • #1068 White Page when user clicks on the browser back button (Image Search)
  • #1067 Left sidebar without versions
  • #1064 URLs with underscore
  • #1056 Memorial (image links)
  • #1055 Design/develop a new Pages Search page results structure: HTML+ CSS
  • #1054 Design/develop a new Replay page structure: HTML+CSS
  • #1053 Design/develop a new URL Search page structure: HTML+CSS
  • #1052 Design/develop a new Advanced Image Search page structure: HTML+CSS
  • #1051 Design/develop a new Images Search page structure: HTML+ CSS
  • #1050 Design/develop a new Advanced Page Search page structure: HTML+ CSS
  • #1041 Generate new Google Analytic codes for the different environments
  • #1040 Develop a new homepage interface structure: HTML+CSS.
  • #1036 Format of lighthouse message in "Pesquisar noutros arquivos" should be consistent with others
  • #1018 Add SVG support
  • #1015 Create regression tests for the image indexing information extractor
  • #1006 Prepare infrastructure to support API versioning
  • #976 Link to contamehistorias.pt
  • #968 Refactor CSS/HTML code to facilitate maintenance
  • #904 Display in some mobile devices doesn't fit
  • #879 Order of the image search results is wrong on the two column result layout
  • #842 Amplify Brozzler Cluster with Docker Swarm
  • #805 Change dedupField to URL when searching for site
  • #791 CDX output below expected
  • #702 Save Page now: create similar service on Arquivo.pt
  • #692 image modal arrows need to be hidden on first and last position
  • #640 The version table years must be fixed when scrolling down.
  • #275 Accessibility level A WCAG 2.0

Dionisius release

21 Jan 16:37
6610dc5
Compare
Choose a tag to compare

ImageSearchApi

https://github.com/arquivo/image-search-api/releases/tag/Dionisius-release

  • Index base64 images", #423
  • Improve exception handling", #732
  • Add better message when Servlet can't connect to SOLR", #734
  • Add domain level spam filter to image search", #771
  • Log the search results of each image search API call", #929
  • Change tstamp field on the API to reflect that it is the crawling date", #939
  • Site search: add subdomain expansion to by default to image and page search", #987
  • Prepare infrastructure to support API versioning", #1006
  • API queries with site restrictions are displaying images from all subdomains", #1014
  • Revise Image Search API log format", #1065
  • Update ImageSearch API documentation", #1069
  • Image Search API SimpleDateFormat is not thread safe", #1071

PageSearch

https://github.com/arquivo/page-search/releases/tag/Dionisius-release

  • Revise Page Search API log format", #1066

PyWb

https://github.com/arquivo/pywb-arquivo/releases/tag/Dionisius-release

  • Problem replaying page, which may be related to jquery", #743

Webapp

https://github.com/arquivo/arquivo-webapp/releases/tag/Dionisius-release

  • Link to contamehistorias.pt", #976
  • Site search: add subdomain expansion to by default to image and page search", #987
  • Information about image with no labels", #1002
  • Create an image resizing service", #1004
  • Submit query button only works if you click the "Pesquisar" text", #1101
  • Update ImageSearch API documentation", #1069
  • Merge log information in one single line (Page View and Image View).", #1073
  • Inconsistent image alignment in image SERP", #1074
  • Page and Image SERP results are not being redirected correctly when %20 is present in the URL", #1075
  • Image SERP page does not stop showing loading spinner on error", #1076
  • Date format must be consistent across page and image SERP", #1080
  • Image detail SERP must have the exact API labels", #1081
  • Image SERP are misaligned due to original image sizes", #1082
  • The "Tabela/Lista" shows up on replay SERP, but results do not appear.", #1085
  • Setting the safeSearch=off parameter breaks next page flow", #1087
  • Image that generate 404/500 errors should be hidden from the results", #1088

Spellchecker

https://github.com/arquivo/PwaSpellchecker/releases/tag/Dionisius-release

  • Solve problem without no suggestion #136
  • Fix log rotation #1063

Basileus release

11 Nov 11:32
Compare
Choose a tag to compare
  • ugh is this time. site boost to 0.01f from 0, so it is not removed by LuceneQueryOptimizer
  • revert url boost and add some logging
  • set url boost to 0.01f so the LuceneQueryOptimizer doesn't remove the URL Clause in some situations (eawp14)
  • Someone changed directly the NutchAnalysis.java instead of running the javaCC... updating the NutchAnalysis.jj with what was changed directly.
  • Changed the lexical analyzer to accept '*' as a word punctuation instead of transforming it into a white space. So for now * will be considered part of a word and the query probably will not return any results. On SolrServer the asterisk will have query meaning.

Caronte release

19 Jan 18:30
Compare
Choose a tag to compare
  • change title on Arquivo.pt #1060
  • Improve messages on Complete page (English) #1039
  • Number of results are not being presented (Images) #1038
  • Advanced search does not recognize quoted queries correctly #1034
  • Query highlight not working for title on quoted queries #1033
  • Refactor CSS/HTML code to facilitate maintenance #968

Webapp release

15 Apr 11:02
Compare
Choose a tag to compare
  • Remove total_items from Arquivo.pt API #520
  • Date range limitation parameters are not communicated to Advanced Search form #678
  • Add Israblog EAWP19 rapid search page #750
  • "From" and "To" parameters are not working well on SearchPage API #891

Winterfell release mobile

09 Jan 11:46
Compare
Choose a tag to compare
  • Cropped cards on the side on mobile version #701
  • Encoding Error: URL not found on URL Search results #705
  • Apply site:$HOST instead of site:$URL when the query is an URL #552
  • "Select One" default and "Excluír" typo #679
  • some URLs are not working in syntax wayback/*/url #386
  • Validate if begin date is always earlier than end date on date range restriction #630
  • Same expression in the advanced image search form #691
  • Dates component slider laziness make rest of the screen slide down #698
  • Date sliders is over the datepickers #666
  • The replay options shouldn't be visible on the not archived page #686
  • Make query suggestion API location configurable #696
  • Next on image results jump to the homepage #688
  • "Did you mean" does not appears on search image #687

Winterfell release desktop

09 Jan 11:46
Compare
Choose a tag to compare
  • Replay options disappear with lower resolutions (Desktop version) #670
  • Encoding Error: URL not found on URL Search results #705
  • Apply site:$HOST instead of site:$URL when the query is an URL #552
  • Change logo with a new one related to Arquivo.pt Award 2020 #710
  • Ortographic error on desktop version preprod.arquivo.pt #707
  • some URLs are not working in syntax wayback/*/url #386
  • Button disappear when switching from Page Search to Image Search (Desktop Version) #697