Allow to stripe the data location over multiple locations #1356

kimchy · 2011-09-22T20:36:13Z

Allow to stripe the data location over multiple locations. The striping is simple, placing whole files in one of the locations, and deciding where to place the file based on the location with greatest free space. Note, there is no multiple copies of the same data, in that, its similar to RAID 0. Though simple, it should provide a good solution for people that don't want to mess with raids and the like. Here is how it is configured:

   path.data: /mnt/first,/mnt/second

Or the in an array format:

   path.data: ["/mnt/first", "/mnt/second"]

The text was updated successfully, but these errors were encountered:

medcl · 2011-09-23T02:03:40Z

hi,@kimchy, i was wondering after set the data location to multiple locations,can i change them lately,and does these location containing the same copy?

kimchy · 2011-09-23T14:39:14Z

You can change them later, but requires to restart the node. Each location does not share the same copy, its striped ala RAID 0.

deinspanjer · 2011-11-01T16:25:45Z

What is the expected failure mode if a disk dies or otherwise becomes inaccessible? Will ES continue to write to the remaining volumes? Will the data on the failed node be recognized and recovered by the cluster?

arsonak47 · 2015-02-13T14:04:22Z

I configured multiple folders in my elasticsearch.yaml as -

path.data: /home/esdata/part1, /home/esdata/part2, /home/esdata/part3, /home/esdata/part4, /home/esdata/part5, /home/esdata/part6, /home/esdata/part7, /home/esdata/part8, /home/esdata/part9, /home/esdata/part10, /home/esdata/part11, /home/esdata/part12, /home/esdata/part13, /home/esdata/part14, /home/esdata/part15, /home/esdata/part16, /home/esdata/part17, /home/esdata/part18, /home/esdata/part19, /home/esdata/part20, /home/esdata/part21, /home/esdata/part22, /home/esdata/part23, /home/esdata/part24, /home/esdata/part25

After inserting huge amount of data (apparently around 7.4 GB), I checked my data directories to know the pattern. I got the following output

I am using Elasticsearch-0.90.3. My Elasticsearch cluster has single node and my index has a single shard. Now it's clear from the screenshot that my data is unevenly distributed among directories. Is there any configuration option by which I can insure even data distribution among all the configured directories?

aholbreich · 2017-10-20T09:43:28Z

This functionality is quite interesting, because it can potentially improve IO throughput of ES on machines with several disks. But there is lack of documentation on this. What is the pattern of the distribution between the locations? Is one shard splited over them? Or one shard can only go to one data.path?

antonbormotov · 2017-10-20T10:27:12Z

According to v2.0 breaking changes, specific shard goes to certain data path.
Check this issue as well: #9498

dakrone · 2017-10-20T14:13:11Z

Or one shard can only go to one data.path?

Yes that's correctly. A shard will be entirely on one data path. Multiple shards are distributed across different data paths.

kimchy closed this as completed in 8d7aaa7 Sep 22, 2011

williamrandolph pushed a commit to williamrandolph/elasticsearch that referenced this issue Jun 4, 2020

[DOCS] Add 7.4.0 release notes (elastic#1356)

ba64bc7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to stripe the data location over multiple locations #1356

Allow to stripe the data location over multiple locations #1356

kimchy commented Sep 22, 2011

medcl commented Sep 23, 2011

kimchy commented Sep 23, 2011

deinspanjer commented Nov 1, 2011

arsonak47 commented Feb 13, 2015

aholbreich commented Oct 20, 2017

antonbormotov commented Oct 20, 2017 •

edited

dakrone commented Oct 20, 2017

Allow to stripe the data location over multiple locations #1356

Allow to stripe the data location over multiple locations #1356

Comments

kimchy commented Sep 22, 2011

medcl commented Sep 23, 2011

kimchy commented Sep 23, 2011

deinspanjer commented Nov 1, 2011

arsonak47 commented Feb 13, 2015

aholbreich commented Oct 20, 2017

antonbormotov commented Oct 20, 2017 • edited

dakrone commented Oct 20, 2017

antonbormotov commented Oct 20, 2017 •

edited