High performance local storage #1242

lomori · 2020-04-23T19:14:55Z

HI,

Let's say I have a high-performance storage drive, e.g. NVME, attached to a host. We can mount that somewhere and configure Longhorn to use it. A few questions:

Since replicas are done synchronously, to obtain the best performance possible, we have to set the number of replicas to 1, right? Otherwise, network traffic would slow down access.
Any particular reason replicas are done synchronously? If Longhorn is RWO, there won't be anybody else in a different node accessing a replica. Or could it?
Do you have an idea of the overhead Longhorn imposes on local disk access?

Regards,
Luiz

stale · 2020-06-22T19:46:23Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

unixfox · 2020-06-22T20:55:59Z

I'm still interested into that feature.

stale · 2020-08-22T01:36:36Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

unixfox · 2020-08-22T06:10:43Z

I'm still interested into that feature.

yasker · 2020-08-25T03:33:55Z

We recently have published a performance report at https://longhorn.io/blog/performance-scalability-report-aug-2020/

Since replicas are done synchronously, to obtain the best performance possible, we have to set the number of replicas to 1, right? Otherwise, network traffic would slow down access.

For the best write performance, yes the number of replicas should be 1. But we can provide a better read performance if there are more replicas.

Any particular reason replicas are done synchronously? If Longhorn is RWO, there won't be anybody else in a different node accessing a replica. Or could it?

Because we need to ensure the crash consistency. asynchronously means the software ack the request before it's completely written to the disk, e.g. store the data in the memory or some local cache. So in the case of a node failure, any data in the cache will be lost without the pod knowing because it thought the data has already been written. And replicate synchronously is normally not a problem for the storage system, because Linux is designed to issue multiple requests at the same time, which determined by IO depth.

Do you have an idea of the overhead Longhorn imposes on local disk access?

Yes, see https://longhorn.io/blog/performance-scalability-report-aug-2020/.

We're still working on various enhancement for the performance, e.g. #508 . Also, #1045 might have some impact on the performance as well, though it's mainly a stability feature.

t3hmrman · 2020-08-27T05:05:47Z

Hey @yasker just got a chance to watch your presentation (webinar) earlier this year and took a look at the perf/scalability report released recently as well.

The numbers are pretty staggering -- 20%-30% (so a 1/5th or 1/3rd reduction) in IOPS is a pretty large reduction -- I totally understand the value of crash consistency and the instant migration that it enables (this means longhorn just works for most workloads with no fears when failovers occur), but am wondering if there are any plans to allow looser/configurable consistency levels in the future?

For example, in a situation where I'm running a single Postgres instance with a single "sync" replica (on the local node) for performance and 2 "async" replicas, I might be willing to take the trade-off of potential seconds/minutes of data loss for that 5x increase in IOPS, if entire nodes (especially if they might be dedicated to running databases) going down is relatively rare. Tradeoffs like this get even easier to make in architectures that are fronted by some sort of upstream WAL mechanism (ex. Kafka, etc), where I know that relatively recent state could be recovered, and old state (@ last asynchronous replication) would enable some time to be saved when failing over.

yasker · 2020-08-27T16:48:17Z

@t3hmrman Yes, we've in fact considered having one local "sync" replica with crash consistency, and multiple async replica to help with the performance. But we haven't brought it up to the schedule yet because:

It will be pretty complex from the technical perspective.
As you can see from https://longhorn.io/blog/performance-scalability-report-aug-2020/#benchmark-result , the main bottleneck we have right now is not due to multiple replications, but the efficiency of the single replica. That's what we are aiming to improve first via e.g. [FEATURE] Provide a choice to replace revision counter #508 [POC] SPDK #760 . So adding async version of replica won't help much until we improve the sync replica first.

We're aiming to include #508 in v1.1.0. We will publish an updated perf/scalability report then.

t3hmrman · 2020-08-28T02:45:17Z

@yasker Thanks for the explanation and I appreciate the hard work.

stale · 2020-10-27T17:51:37Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

t3hmrman · 2020-11-02T12:47:44Z

Still interested

stale · 2021-01-01T19:26:56Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

unixfox · 2021-01-01T20:54:26Z

bump

stale · 2021-02-12T05:22:53Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

t3hmrman · 2021-02-12T06:51:32Z

bump

unixfox · 2021-03-19T22:07:42Z

Bump

joshimoo · 2021-04-14T14:39:53Z

I have added this to planning, to keep the bot from closing it :)
We are definitely interested in improving the IO performance for Longhorn in the future.

gp187 · 2022-04-19T14:56:07Z

Any fixes?

minhnguyenvan95 · 2022-04-21T02:06:09Z

thumbs up

minhnguyenvan95 · 2022-04-21T02:07:00Z

I have tested longhorn vs hostpath vs nfs. WAF -> OCELOT -> .NET APP by upload 133mb file

[with long horn]
waf 20.057172s
direct ocelot -> 18.607835s
direct app -> 15.613427s

[no long horn]
waf 7.311976s
direct ocelot -> 3.974725s
direct app -> 2.080060s

[hostpath]
waf 7.419208s
direct ocelot 04.153185s
direct app 1.802004s

[nfs]
waf 11.251932s
direct ocelot 12.396772s
direct app 8.722210s

t3hmrman · 2022-04-21T02:31:19Z

It's pretty impressive for Longhorn to be within shouting distance of NFS! NFS has had a lot of man hours poured into it. Excited to see this project get even better and faster!

lomori added the kind/question Please use `discussion` to ask questions instead label Apr 23, 2020

stale bot added the wontfix label Jun 22, 2020

yasker removed the wontfix label Jun 22, 2020

stale bot added the wontfix label Aug 22, 2020

yasker removed the wontfix label Aug 25, 2020

stale bot added the wontfix label Oct 27, 2020

stale bot removed the wontfix label Nov 2, 2020

stale bot added the wontfix label Jan 1, 2021

stale bot removed the wontfix label Jan 1, 2021

stale bot added the wontfix label Feb 12, 2021

stale bot removed the wontfix label Feb 12, 2021

joshimoo added the kind/feature Feature request, new feature label Feb 12, 2021

stale bot added the wontfix label Mar 19, 2021

stale bot removed the wontfix label Mar 19, 2021

longhorn deleted a comment from stale bot Mar 20, 2021

joshimoo added this to the Planning milestone Apr 14, 2021

innobead modified the milestones: Planning, Backlog Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High performance local storage #1242

High performance local storage #1242

lomori commented Apr 23, 2020

stale bot commented Jun 22, 2020

unixfox commented Jun 22, 2020

stale bot commented Aug 22, 2020

unixfox commented Aug 22, 2020

yasker commented Aug 25, 2020

t3hmrman commented Aug 27, 2020 •

edited

yasker commented Aug 27, 2020

t3hmrman commented Aug 28, 2020

stale bot commented Oct 27, 2020

t3hmrman commented Nov 2, 2020

stale bot commented Jan 1, 2021

unixfox commented Jan 1, 2021

stale bot commented Feb 12, 2021

t3hmrman commented Feb 12, 2021

unixfox commented Mar 19, 2021

joshimoo commented Apr 14, 2021

gp187 commented Apr 19, 2022

minhnguyenvan95 commented Apr 21, 2022

minhnguyenvan95 commented Apr 21, 2022 •

edited

t3hmrman commented Apr 21, 2022

High performance local storage #1242

High performance local storage #1242

Comments

lomori commented Apr 23, 2020

stale bot commented Jun 22, 2020

unixfox commented Jun 22, 2020

stale bot commented Aug 22, 2020

unixfox commented Aug 22, 2020

yasker commented Aug 25, 2020

t3hmrman commented Aug 27, 2020 • edited

yasker commented Aug 27, 2020

t3hmrman commented Aug 28, 2020

stale bot commented Oct 27, 2020

t3hmrman commented Nov 2, 2020

stale bot commented Jan 1, 2021

unixfox commented Jan 1, 2021

stale bot commented Feb 12, 2021

t3hmrman commented Feb 12, 2021

unixfox commented Mar 19, 2021

joshimoo commented Apr 14, 2021

gp187 commented Apr 19, 2022

minhnguyenvan95 commented Apr 21, 2022

minhnguyenvan95 commented Apr 21, 2022 • edited

t3hmrman commented Apr 21, 2022

t3hmrman commented Aug 27, 2020 •

edited

minhnguyenvan95 commented Apr 21, 2022 •

edited