Skip to content

Guide for creating full text search capabilities on audio files within your object store

Notifications You must be signed in to change notification settings

mixpeek/audio-search-s3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Text Search of Audio files in S3 bucket(s) using Tika and OpenSearch

Using Tika and OpenSearch to search the contents of PDF files across S3 bucket(s)

Medium Article Tutorial

Installation

OpenSearch:

brew update
brew install opensearch
opensearch

App dependencies:

pip install -r requirements.txt

Setup

  • Add your AWS keys to config.py
  • Provide the filename you want to search in insert.py's variable: s3_file_name

Run

  1. Create the index by running python insert.py create_index
  2. Download the file, extract the contents then insert it into OpenSearch via python insert.py download_file
  3. Run the app via python app.py

About

Guide for creating full text search capabilities on audio files within your object store

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages