Skip to content
This repository has been archived by the owner on Aug 28, 2021. It is now read-only.

Commit

Permalink
documentation update
Browse files Browse the repository at this point in the history
  • Loading branch information
guilhemmarchand committed Mar 28, 2017
1 parent 272c10f commit f2a5aac
Show file tree
Hide file tree
Showing 2 changed files with 44 additions and 2 deletions.
14 changes: 12 additions & 2 deletions docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -37,8 +37,6 @@ Overview:
binaries
requirements
deployment_matrix
deployment
external

Processing:
-----------
Expand All @@ -48,3 +46,15 @@ Processing:

data_processing

Deployment and configuration:
-----------------------------

.. toctree::
:maxdepth: 2

deployment
external
json_indexing



32 changes: 32 additions & 0 deletions docs/json_indexing.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
###############################
JSON indexing versus legacy CSV
###############################

Nmon data is basically generating CSV data (Comma Separated Value), and this is as well the case for the files generated by the TA-nmon

**By default, the TA-nmon generates several files to be indexed in the following directories:**

* $SPLUNK_HOME/var/log/nmon/var/csv_repository
* $SPLUNK_HOME/var/log/nmon/var/config_repository

In the case of the nmon performance data (the "csv_repository" 0, we generate one csv data file by nmon section. (basically per performance monitor)

Then, Splunk indexes the data using the CSV "INDEXED_EXTRACTIONS" mode, these parameters are visible in "default/props.conf" under the "nmon_data" sourcetype::

[nmon_data]

FIELD_DELIMITER=,
FIELD_QUOTE="
HEADER_FIELD_LINE_NUMBER=1

# your settings
INDEXED_EXTRACTIONS=csv

In this mode, Splunk identifies the fields name using the CSV header, then each field is indexed as an "indexed fields", to be opposed to fields extraction at search time. (like Key Value data for instance)

The indexed CSV mode provides great performances at search time, and CSV data generates a low level of data volume which saves Splunk licensing costs.

However, the disadvantage of this is an higher cost in storage requirements as Splunk has to generate an higher volume of tsidx files (indexed files) versus rawdata files within the indexes storage.



0 comments on commit f2a5aac

Please sign in to comment.