Question on stdlib cache? #19187

ramith · 2019-09-25T08:21:37Z

Description:
So I was looking at the cache implementation at :
https://github.com/ballerina-platform/ballerina-lang/blob/release-1.0.1/stdlib/cache/src/main/ballerina/src/cache/cache.bal

I have following questions:

I see there is synchronization in put() function but not in get function. is this intentional?
Further, if you look at line 173 to 175 (following code fragment), I would argue there can be a put() operation which could happen in parallel busting the logic. e.g. think about a scenario where !self.hasKey(key) becomes true and a put() happens before the return ();

      // Check whether the requested cache is available.
        if (!self.hasKey(key)) {
            return ();
        }

I suppose we can deduce a couple of more such scenarios just by looking at get() and put()

But I could be wrong given that I don't have a clear understanding of how ballerina's concurrency is handled.

Steps to reproduce:
N/A
Affected Versions:
v1.0, v 1.0.1
OS, DB, other environment details and versions:
MacOS, Jdk 1.8

Suggested Assignees (optional):
@wggihan @pubudu91 @a5anka

The text was updated successfully, but these errors were encountered:

a5anka · 2019-09-25T08:28:53Z

Another issue is that we are locking all the put() functions of multiple caches using the global object cacheMap. This adds an unnecessary performance overhead.

wggihan · 2019-09-25T10:09:29Z

Description:
So I was looking at the cache implementation at :
https://github.com/ballerina-platform/ballerina-lang/blob/release-1.0.1/stdlib/cache/src/main/ballerina/src/cache/cache.bal

I have following questions:

I see there is synchronization in put() function but not in get function. is this intentional?
Yes, this is intentional. The main idea behind the reason is to improve cache performance. Basically get doesn't need any handling in term of concurrently accessing.

Further, if you look at line 173 to 175 (following code fragment), I would argue there can be a put() operation which could happen in parallel busting the logic. e.g. think about a scenario where !self.hasKey(key) becomes true and a put() happens before the return ();
      // Check whether the requested cache is available.
        if (!self.hasKey(key)) {
            return ();
        }

IINM, cache misses can happen due to this. But since this is a simple cache. I believe that is acceptable.

I suppose we can deduce a couple of more such scenarios just by looking at get() and put()

But I could be wrong given that I don't have a clear understanding of how ballerina's concurrency is handled.

Steps to reproduce:
N/A
Affected Versions:
v1.0, v 1.0.1
OS, DB, other environment details and versions:
MacOS, Jdk 1.8

Suggested Assignees (optional):
@wggihan @pubudu91 @a5anka

wggihan · 2019-09-25T10:10:30Z

Another issue is that we are locking all the put() functions of multiple caches using the global object cacheMap. This adds an unnecessary performance overhead.

With the current implementation, we can't omit the global cache map.

chethiya · 2019-10-22T13:58:21Z

I assume original post is regarding data corruption that can occur due to race condition. But the given example there (i.e. hasKey) doesn't cause any such race conditions it seems. Ballerina map<> implementation seems to be thread safe (assuming it's implemented using MapValueImpl class)

But I think following line in get() of Cache can cause data races

            cacheEntry.lastAccessedTime = time:currentTime().time;

So this can cause cache to evict wrong keys in case the values in cacheEntry.lastAccessedTime get corrupted due to data races.

hasithaa assigned wggihan Sep 25, 2019

hasithaa added the Team/StandardLibs All Ballerina standard libraries label Sep 25, 2019

anupama-pathirage added the Type/Improvement label Sep 27, 2019

ldclakmal mentioned this issue Jan 29, 2020

Redesign Ballerina Cache API #20794

Closed

ldclakmal mentioned this issue Feb 28, 2020

Implement Ballerina Cache API (v2.0.0) [master] #21308

Merged

13 tasks

ldclakmal added this to the Ballerina 1.2.0 milestone Mar 11, 2020

wggihan closed this as completed in #21308 Mar 11, 2020

ldclakmal mentioned this issue Mar 12, 2020

Implement Ballerina Cache API (v2.0.0) #21051

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on stdlib cache? #19187

Question on stdlib cache? #19187

ramith commented Sep 25, 2019 •

edited

a5anka commented Sep 25, 2019 •

edited

wggihan commented Sep 25, 2019

wggihan commented Sep 25, 2019

chethiya commented Oct 22, 2019

Question on stdlib cache? #19187

Question on stdlib cache? #19187

Comments

ramith commented Sep 25, 2019 • edited

a5anka commented Sep 25, 2019 • edited

wggihan commented Sep 25, 2019

wggihan commented Sep 25, 2019

chethiya commented Oct 22, 2019

ramith commented Sep 25, 2019 •

edited

a5anka commented Sep 25, 2019 •

edited