Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Help switching to ripgrep-all #1091

Closed
kylemclaren opened this issue Jan 30, 2023 · 3 comments
Closed

Help switching to ripgrep-all #1091

kylemclaren opened this issue Jan 30, 2023 · 3 comments
Labels
touches: dependencies/packaging Issues or changes that add/remove/affect dependencies type: support why: functionality Intended to improve ArchiveBox functionality or features

Comments

@kylemclaren
Copy link

Problem

I would like to use RIPGREP_BINARY=rga to make PDF snapshots searchable in my archiveBox, but I get the following error:

[!] Warning: Missing 1 recommended dependencies
    ! RIPGREP_BINARY: rga (unable to detect version)

Steps to Reproduce

  1. Have an ArchiveBox v0.6.2 (dev branch) installation running on Ubuntu 22.04.1 LTS
  2. Run sudo docker-compose run archivebox archivebox config --set RIPGREP_BINARY=rga
  3. I received the above mentioned error message when running archivebox --version

Expected Result

PDF snapshots should be searchable

Actual Result

I receive the error message: RIPGREP_BINARY: rga (unable to detect version)

Environment

ArchiveBox v0.6.2
Cpython Linux Linux-5.15.0-1028-aws-x86_64-with-glibc2.28 x86_64
IN_DOCKER=True DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=sonic

[i] Dependency versions:
 √  ARCHIVEBOX_BINARY     v0.6.2          valid     /usr/local/bin/archivebox                                                   
 √  PYTHON_BINARY         v3.9.5          valid     /usr/local/bin/python3.9                                                    
 √  DJANGO_BINARY         v3.1.10         valid     /usr/local/lib/python3.9/site-packages/django/bin/django-admin.py           
 √  CURL_BINARY           v7.64.0         valid     /usr/bin/curl                                                               
 √  WGET_BINARY           v1.20.1         valid     /usr/bin/wget                                                               
 √  NODE_BINARY           v15.14.0        valid     /usr/bin/node                                                               
 √  SINGLEFILE_BINARY     v0.3.16         valid     /node/node_modules/single-file/cli/single-file                              
 √  READABILITY_BINARY    v0.0.2          valid     /node/node_modules/readability-extractor/readability-extractor              
 √  MERCURY_BINARY        v1.0.0          valid     /node/node_modules/@postlight/mercury-parser/cli.js                         
 √  GIT_BINARY            v2.20.1         valid     /usr/bin/git                                                                
 √  YOUTUBEDL_BINARY      v2021.04.26     valid     /usr/local/bin/youtube-dl                                                   
 √  CHROME_BINARY         v90.0.4430.93   valid     /usr/bin/chromium                                                           
 X  RIPGREP_BINARY        ?               invalid   rga                                                                         

[i] Source-code locations:
 √  PACKAGE_DIR           22 files        valid     /app/archivebox                                                             
 √  TEMPLATES_DIR         3 files         valid     /app/archivebox/templates                                                   
 -  CUSTOM_TEMPLATES_DIR  -               disabled                                                                              

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled                                                                              
 -  COOKIES_FILE          -               disabled                                                                              

[i] Data locations:
 √  OUTPUT_DIR            8 files         valid     /data                                                                       
 √  SOURCES_DIR           4 files         valid     ./sources                                                                   
 √  LOGS_DIR              1 files         valid     ./logs                                                                      
 √  ARCHIVE_DIR           2 files         valid     ./archive                                                                   
 √  CONFIG_FILE           123.0 Bytes     valid     ./ArchiveBox.conf                                                           
 √  SQL_INDEX             208.0 KB        valid     ./index.sqlite3                                                             

[!] Warning: Missing 1 recommended dependencies
    ! RIPGREP_BINARY: rga (unable to detect version)

I have followed the instructions for setting up rga as per the configuration wiki, but I still receive this error message.

Additionally, I would like more context about using sonic as a backend search engine.Do I use it in tandem with rga? Or is it one or the other?

I would greatly appreciate any help in resolving this issue. Thank you!

@kylemclaren kylemclaren changed the title Help switching to rigrep-all Help switching to ripgrep-all Jan 30, 2023
@kylemclaren
Copy link
Author

I now ran sudo docker-compose run archivebox archivebox config --set RIPGREP_BINARY=/usr/local/bin/rga

but still getting:

 X  RIPGREP_BINARY        ?               invalid   /usr/local/bin/rga   

[!] Warning: Missing 1 recommended dependencies
    ! RIPGREP_BINARY: /usr/local/bin/rga (unable to detect version)

I have ensured the binary is installed:

root@6b77e2a2875e:/data# /usr/local/bin/rga --version
ripgrep-all 0.9.6

@pirate
Copy link
Member

pirate commented Jan 31, 2023

It's just one or the other for sonic vs rga unfortunately. The sonic instructions are WIP, more info here: #956 (comment)

Can you confirm that you installed the binary inside the docker container? (not just mounted an rga binary from outside docker to that path)

Can you run env DEBUG=True archivebox config --get RIPGREP_VERSION and post the output.

@pirate
Copy link
Member

pirate commented Jun 13, 2023

Closing due to inactivity. Comment back here if you still need help!

@pirate pirate closed this as completed Jun 13, 2023
@pirate pirate added why: functionality Intended to improve ArchiveBox functionality or features type: support touches: dependencies/packaging Issues or changes that add/remove/affect dependencies labels Jun 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
touches: dependencies/packaging Issues or changes that add/remove/affect dependencies type: support why: functionality Intended to improve ArchiveBox functionality or features
Projects
None yet
Development

No branches or pull requests

2 participants