Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Search times out on v0.7.2 installed on Synology using Portainer #1318

Closed
MyNameIsOka opened this issue Jan 12, 2024 · 2 comments
Closed

Comments

@MyNameIsOka
Copy link

MyNameIsOka commented Jan 12, 2024

Describe the bug

Searching doesn't work. No matter, what I search for, the page loads for some time and then displays the Synology page "Sorry, the page you are looking for is not found.".
I just noticed it now and have recently started with ArchiveBox so I don't know if it worked in the past.
I also tried it in different Browsers (Brave, Mullvad Browser, Chrome), but nothing changes.

Steps to reproduce

  • installed via Portainer on Synology
  • env vars
- ALLOWED_HOSTS=*
- MEDIA_MAX_SIZE=2000m
- PUID=1029
- PGID=100
- PUBLIC_INDEX=False             
- PUBLIC_SNAPSHOTS=False 
- PUBLIC_ADD_VIEW=False     
- SAVE_ARCHIVE_DOT_ORG=False
- SAVE_FAVICON=False 
- CHROME_BINARY=/usr/bin/chromium-browser

Screenshots or log output

Screenshot 2024-01-12 at 13 07 12

ArchiveBox version

0.7.2
ArchiveBox v0.7.2+editable COMMIT_HASH=e888869 BUILD_TIME=2024-01-05 03:55:52 1704426952
IN_DOCKER=True IN_QEMU=False ARCH=x86_64 OS=Linux PLATFORM=Linux-4.4.302+-x86_64-with-glibc2.36 PYTHON=Cpython
FS_ATOMIC=True FS_REMOTE=True FS_USER=1029:100 FS_PERMS=644
DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND=ripgrep LDAP=False

[i] Dependency versions:
 √  PYTHON_BINARY         v3.11.7         valid     /usr/local/bin/python3.11                                                   
 √  SQLITE_BINARY         v2.6.0          valid     /usr/local/lib/python3.11/sqlite3/dbapi2.py                                 
 √  DJANGO_BINARY         v3.1.14         valid     /usr/local/lib/python3.11/site-packages/django/__init__.py                  
 √  ARCHIVEBOX_BINARY     v0.7.2          valid     /usr/local/bin/archivebox                                                   

 √  CURL_BINARY           v8.5.0          valid     /usr/bin/curl                                                               
 √  WGET_BINARY           v1.21.3         valid     /usr/bin/wget                                                               
 √  NODE_BINARY           v20.10.0        valid     /usr/bin/node                                                               
 √  SINGLEFILE_BINARY     v1.1.46         valid     /app/node_modules/single-file-cli/single-file                               
 √  READABILITY_BINARY    v0.0.11         valid     /app/node_modules/readability-extractor/readability-extractor               
 √  MERCURY_BINARY        v1.0.0          valid     /app/node_modules/@postlight/parser/cli.js                                  
 √  GIT_BINARY            v2.39.2         valid     /usr/bin/git                                                                
 √  YOUTUBEDL_BINARY      v2023.12.30     valid     /usr/local/bin/yt-dlp                                                       
 √  CHROME_BINARY         v120.0.6099.28  valid     /browsers/chromium-1091/chrome-linux/chrome                                 
 √  RIPGREP_BINARY        v13.0.0         valid     /usr/bin/rg                                                                 

[i] Source-code locations:
 √  PACKAGE_DIR           23 files        valid     /app/archivebox                                                             
 √  TEMPLATES_DIR         3 files         valid     /app/archivebox/templates                                                   
 -  CUSTOM_TEMPLATES_DIR  -               disabled  None                                                                        

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled  None                                                                        
 -  COOKIES_FILE          -               disabled  None                                                                        

[i] Data locations:
 √  OUTPUT_DIR            8 files @       valid     /data                                                                       
 √  SOURCES_DIR           101 files       valid     ./sources                                                                   
 √  LOGS_DIR              2 files         valid     ./logs                                                                      
 √  ARCHIVE_DIR           102 files       valid     ./archive                                                                   
 √  CONFIG_FILE           162.0 Bytes     valid     ./ArchiveBox.conf                                                           
 √  SQL_INDEX             872.0 KB        valid     ./index.sqlite3 
@pirate
Copy link
Member

pirate commented Jan 12, 2024

The default search method uses ripgrep to scan over the files on disk (with no index), which can be slow if your backing storage is slow. It works well on local SSDs (see https://demo.archivebox.io) but not on HDDs or NFS/SMB mounts.

What type of drive are you using to host your archive data? If it's an HDD or network mount, it's likely too slow to use ripgrep.

Instead, I recommend switching to the more advanced Sonic search backend which does full-text indexing and works well even with older drives, instructions are here: #956 (comment)

@pirate pirate changed the title Bug: Search is not working Bug: Search times out on v0.7.2 installed on Synology using Portainer Jan 12, 2024
@MyNameIsOka
Copy link
Author

Thanks a lot for your quick response! Yes, I am using HDDs. After defining SEARCH_BACKEND_ENGINE=sonic in the docker-compose file, search started working 👍

@pirate pirate closed this as completed Jan 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants