opensearch-export

Simple script that queries OpenSearch logs and exports them to CSV or JSON.

To run the script:

Create virtual environment

python -m venv .venv

Activate virtual environment

.venv\Scripts\activate

Install dependencies

python -r pip install ./requirements.txt
or
pip install -r requirements.txt

How it Works

The script connects to an OpenSearch cluster using the credentials and connection details provided in parameters.json. It then executes a query based on the configuration in the same file, fetching data within a specified time range and matching defined criteria. The results are streamed and saved to either a JSON or CSV file, as configured.

Configuration (`parameters.json`)

The parameters.json file contains all the necessary settings for the script to run. Here's a breakdown of the main sections:

connection: Specifies the OpenSearch host, port, username, password, and SSL settings.
index: The index pattern to query (e.g., your-index-pattern-*).
timespan: Defines the start and end time for the data query in YYYY-MM-DDTHH:mm:ss format.
query: Contains the specific query details (see below).
output: Configures the output format (json or csv), file path, and batch size for fetching data.
scroll: Sets the scroll time for fetching large datasets.

Defining a Query

The query object within parameters.json allows you to specify the search criteria using the OpenSearch Query DSL.

_source: (Optional) A list of fields to include in the results. If omitted, all fields are returned.
bool_conditions: (Optional) Defines boolean clauses (must, should, must_not, filter) to combine multiple query criteria. You can nest boolean queries and use various query types like term, match, range, wildcard, exists, etc.

Example Query Structure:

"query": {
  "_source": [
    "timestamp",
    "applicationName",
    "fields.eventCode"
  ],
  "bool_conditions": {
    "must": [
      {
        "bool": {
          "should": [
            {
              "bool": {
                "must": [
                  {"wildcard": {"applicationName": "app-prefix*"}},
                  {"term": {"fields.eventCode.keyword": "EVENT_CODE_1"}}
                ]
              }
            },
            {
              "bool": {
                "must": [
                  {"wildcard": {"applicationName": "another-app-prefix*"}},
                  {"exists": {"field": "fields.correlationId"}}
                ]
              }
            }
          ],
          "minimum_should_match": 1
        }
      }
    ]
  }
}

This example fetches specific fields (_source) for documents where the applicationName starts with app-prefix* AND has EVENT_CODE_1, OR where the applicationName starts with another-app-prefix* AND the fields.correlationId exists.

Running the Script

Once configured, run the script from your activated virtual environment:

python fetchData.py

You can optionally provide a path to a different configuration file:

python fetchData.py /path/to/your/custom_parameters.json

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fetchData.py		fetchData.py
parameters.json		parameters.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

opensearch-export

To run the script:

Create virtual environment

Activate virtual environment

Install dependencies

How it Works

Configuration (`parameters.json`)

Defining a Query

Running the Script

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

polymons/opensearch-export

Folders and files

Latest commit

History

Repository files navigation

opensearch-export

To run the script:

Create virtual environment

Activate virtual environment

Install dependencies

How it Works

Configuration (parameters.json)

Defining a Query

Running the Script

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Configuration (`parameters.json`)

Packages