check memory consumption #230

oblonski · 2016-04-09T17:58:30Z

and whether there is room to reduce it

anurag1paul · 2016-06-13T07:42:16Z

Can you suggest any areas where we can focus to reduce memory consumption?

rajendrauppal · 2016-06-24T06:11:01Z

@oblonski marvelous library. I too have same pain point. memory consumption is too high at present. can you please suggest any areas where memory footprint can be brought down. is it already done for jsprit? or is in pipeline? was profiling done for memory and time consumption? Would highly appreciate any pointers or starting points. Thanks!

oblonski · 2016-06-24T06:50:32Z

Hi, Why is it a pain point? How do you use jsprit?

PGWelch · 2016-06-24T07:31:46Z

@rajendrauppal Would you be able to run profiling using VisualVM (comes in the java jdk bin directory) to identify the part of the code causing the large memory consumption?

@oblonski I did once notice an out-of-memory exception running a larger problem (1000 jobs?) with limited memory available but I'm afraid I didn't note where it was coming from, although as I recall it might have been connected with one of ruin implementations. Do any of the ruin implementations (or a structure you use to record nearby jobs for ruins?) have O(n3) memory allocation? Or O(n2) but not using jagged arrays?

oblonski · 2016-06-24T07:44:22Z

There are various sources consuming significant memory.

the neighborhood is definitely smth we should improve. there might be a) ways to do it faster and b) with less memory consumption. The neighborhood is used for the radial ruin strategy to select neighboring jobs faster.
due to its generic nature, the StateManager use Objects rather than primitive types
in generel we should look at Objects we can actually turn into Primitives and
Maps/Sets we can turn into Arrays (latter does not only make it more memory efficient but also usually much faster)

If memory really is/gets an issue we can also think about storing states and other attributes with a very low-level approach such as it is done by @karussell in GraphHopper. He stores all attributes of an edge in a single long value using low-level bit operations. I think we should only do this if this is really an issue.

Edit: "From Java code to Java heap" is - I think - a nice article to understand memory consumptions of Objecs and Maps/Sets etc. better

oblonski · 2016-06-24T08:07:11Z

This https://github.com/graphhopper/jsprit/blob/master/jsprit-core/src/main/java/com/graphhopper/jsprit/core/problem/VehicleRoutingProblem.java#L571 is a very good example where we can use an Array of Arrays since at this point every job has an index. It would make it more memory efficient and faster to retrieve activities.

karussell · 2016-06-24T09:13:56Z

@PGWelch what does memory restricted mean? 2gb? I think we should not invest time here if e.g. 4gb is sufficient even for largish ~5k problems. Although on the other side: often a reduced memory usage leads to a good speed up (better caching, less GC, ...)

PGWelch · 2016-06-24T09:21:26Z

Probably around 1 GB, although as said I didn't record details at the time. I agree that there's no point reducing memory consumption if the benefits are low. Really we need some profiling and to find out the circumstances where this is becoming a problem ( @rajendrauppal ?), then make the call if its worth doing anything.

anurag1paul · 2016-06-25T07:38:16Z

Hi,
We were trying to solve a problem with 1000 shipments (both pick up and drop), 200 vehicles, multiple depots and with time window constraints. We were using Array based cost matrix, with fast regret. We tried to solve this problem on different servers but faced 'out of memory' error in servers with configurations of RAM upto 16 GB. Finally we were able to solve it on a 8 core, 16 GB RAM, 4 GB swap server with a max heap size of 17.5 GB. We used 15 threads with 1024 iterations and we got run-time as 29 minutes. Run-time can be reduced further if we use more cores, but we were wondering if it is possible to bring memory usage down.

anurag1paul · 2016-06-25T07:46:06Z

I used VisualVM to do memory profile for a 200 shipment, 50 vehicles, MPMD CVRPTW problem using array based cost matrix and fast regret and 1024 iterations. You can see the top memory using objects in the attached image.

karussell · 2016-06-25T11:14:46Z

We were using Array based cost matrix, with fast regret.

how exactly does it look like int[][] or double[][] or List<List<Double>>?

Can you find out where the source for the many and large ArrayList$Itr comes from?

karussell · 2016-06-25T11:16:01Z

Also we would be interested to list your use case here - let us know via email if you would like this too :)

anurag1paul · 2016-06-25T11:55:37Z

We are basically creating cost matrix as an object of class jsprit.core.util.FastVehicleRoutingTransportCostsMatrix, it is double[][][2]

    private Builder(int noLocations, boolean isSymmetric) {
        this.isSymmetric = isSymmetric;
        matrix = new double[noLocations][noLocations][2];
    }

For my case, isSymmetric is false. I will try to find out where ArrayList$Itr comes from.
Regarding listing, I will contact you via mail.

oblonski · 2016-07-06T12:55:30Z

This a807024 optimizes memory consumption significantly. When having 5000 services and 100 vehicles, it reduces memory consumption for the neighborhood calculation from 800MB to 50MB. Additionally, it reduces com time from 9sec to 5sec (which is - given the problem size - not that important ;)).

PGWelch · 2016-07-06T14:23:54Z

@anurag1paul (and @oblonski ) The code:

matrix = new double[noLocations][noLocations][2];

Looks to me like it allocates a single block of memory noLocations x noLocations x2 x 8 bytes (for double) big. I believe this will create OutOfMemory exceptions a long time before the system actually runs out of memory, because of memory fragmentation - e.g. the system might have 1 GB available but the longest continuous piece of free memory is only 10 MB.

To be safe, the allocation should be jagged:

matrix = new double[noLocations][][];
    for(int i =0 ; i < noLocations ; i++){
        matrix[i] = new double[noLocations][2];
    }

PGWelch · 2016-07-06T14:24:48Z

p.s. @oblonski - great work on bringing the neighbourhood memory consumption down. I think that was where I hit the out-of-memory when running in a limited memory environment

karussell · 2016-07-06T18:10:18Z

I believe this will create OutOfMemory exceptions a long time before the system actually

And I doubt that it requests a contiguous block from the JVM - where did you read about this?

Also for 5K locations this is just 25MB and for 10K it is 800MB where probably other tweaks like using float instead double should be tried.

PGWelch · 2016-07-06T18:48:18Z

@karussell The java spec doesn't state whether its a contiguous block or not, so it may well be contiguous, and if this is the case the non-jagged version of the allocation is potentially dangerous. I know in C++ this kind of allocation would be contiguous (so I'm mainly guessing based on parallels with C++).

I calculate 5K locations as 381 MB using 5000 * 5000 * 2 * 8 / (1024 * 1024) and 10K as 1.5 GB. How do you get 25MB? Am I missing something obvious here?

karussell · 2016-07-06T21:13:27Z

Ups, missed the factor 8 but why 2?

Did you ever had an issue related to continguos and not?

oblonski · 2016-07-07T07:18:47Z

I assume 2 is related to time AND distance, i.e. you need to memorize two double values for each relation.

PGWelch · 2016-07-07T18:55:06Z

I've had an issue with a contiguous 1d array in java, causing out of memory on a few hundred thousand entries when there was easily enough memory available for it (but just not contiguous memory). Of course new double[noLocations][noLocations][2] is multi-dimensional but (AFAIK) the java spec doesn't say whether this would be one contiguous block or not (so I would consider it dangerous). Memory fragmentation is a well-known issue in high-performance C++ but of course the java memory model is different.

karussell · 2016-07-10T13:24:27Z

Just thought about the big matrix. For really big ones we could use a GraphHopper DataAccess object (or similar on disc approach) and access the matrix via memory mapping (or if enough memory via RAM). This way the access will be a bit slower but OS will make it faster for frequently used entries.

PGWelch · 2016-07-11T10:21:16Z

Interesting. I think for problems that big, you'd need to apply clustering techniques anyway before running in jsprit (though the clustering techniques would need a big matrix).

oblonski · 2016-07-26T15:54:58Z

close this for now

karussell · 2016-07-26T16:43:07Z

@oblonski maybe it could be interesting to comment on the 'large vrp' discussion too? Was it this one? https://discuss.graphhopper.com/t/very-large-vrp/766/5

oblonski · 2016-08-02T17:26:29Z

Jari Kytöjoki, Teemu Nuortio, Olli Bräysy, Michel Gendreau [p.2748] proposed a way to reduce memory consumption of storing 20k x 20k time and distance matrix from 5.96GB to 0.186-0.378GB (case dependent).

karussell · 2016-08-02T19:04:17Z

I would think 6gb is okayish but probably the matrix is not the (only) issue :)

muzuro · 2016-09-28T09:31:58Z

may be it is related with #282

rajendrauppal · 2016-11-30T09:16:56Z

@oblonski any tentative timeline on releasing this

oblonski · 2016-12-07T07:45:40Z

it is partly contained in the master already. we plan to make a new release this month

oblonski · 2017-01-11T19:38:12Z

We made huge improvements here also thanks to @muzuro. I close this for now.

oblonski added the todo label Apr 9, 2016

oblonski added a commit that referenced this issue Jul 6, 2016

override remove - related to #230

291764e

oblonski closed this as completed Jul 26, 2016

oblonski reopened this Aug 2, 2016

oblonski closed this as completed Jan 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

check memory consumption #230

check memory consumption #230

oblonski commented Apr 9, 2016

anurag1paul commented Jun 13, 2016

rajendrauppal commented Jun 24, 2016

oblonski commented Jun 24, 2016

PGWelch commented Jun 24, 2016

oblonski commented Jun 24, 2016 •

edited

Loading

oblonski commented Jun 24, 2016 •

edited

Loading

karussell commented Jun 24, 2016

PGWelch commented Jun 24, 2016

anurag1paul commented Jun 25, 2016 •

edited

Loading

anurag1paul commented Jun 25, 2016 •

edited

Loading

karussell commented Jun 25, 2016

karussell commented Jun 25, 2016

anurag1paul commented Jun 25, 2016 •

edited

Loading

oblonski commented Jul 6, 2016 •

edited

Loading

PGWelch commented Jul 6, 2016

PGWelch commented Jul 6, 2016

karussell commented Jul 6, 2016

PGWelch commented Jul 6, 2016 •

edited

Loading

karussell commented Jul 6, 2016

oblonski commented Jul 7, 2016

PGWelch commented Jul 7, 2016

karussell commented Jul 10, 2016

PGWelch commented Jul 11, 2016

oblonski commented Jul 26, 2016

karussell commented Jul 26, 2016

oblonski commented Aug 2, 2016 •

edited

Loading

karussell commented Aug 2, 2016 •

edited

Loading

muzuro commented Sep 28, 2016

rajendrauppal commented Nov 30, 2016

oblonski commented Dec 7, 2016

oblonski commented Jan 11, 2017

check memory consumption #230

check memory consumption #230

Comments

oblonski commented Apr 9, 2016

anurag1paul commented Jun 13, 2016

rajendrauppal commented Jun 24, 2016

oblonski commented Jun 24, 2016

PGWelch commented Jun 24, 2016

oblonski commented Jun 24, 2016 • edited Loading

oblonski commented Jun 24, 2016 • edited Loading

karussell commented Jun 24, 2016

PGWelch commented Jun 24, 2016

anurag1paul commented Jun 25, 2016 • edited Loading

anurag1paul commented Jun 25, 2016 • edited Loading

karussell commented Jun 25, 2016

karussell commented Jun 25, 2016

anurag1paul commented Jun 25, 2016 • edited Loading

oblonski commented Jul 6, 2016 • edited Loading

PGWelch commented Jul 6, 2016

PGWelch commented Jul 6, 2016

karussell commented Jul 6, 2016

PGWelch commented Jul 6, 2016 • edited Loading

karussell commented Jul 6, 2016

oblonski commented Jul 7, 2016

PGWelch commented Jul 7, 2016

karussell commented Jul 10, 2016

PGWelch commented Jul 11, 2016

oblonski commented Jul 26, 2016

karussell commented Jul 26, 2016

oblonski commented Aug 2, 2016 • edited Loading

karussell commented Aug 2, 2016 • edited Loading

muzuro commented Sep 28, 2016

rajendrauppal commented Nov 30, 2016

oblonski commented Dec 7, 2016

oblonski commented Jan 11, 2017

oblonski commented Jun 24, 2016 •

edited

Loading

oblonski commented Jun 24, 2016 •

edited

Loading

anurag1paul commented Jun 25, 2016 •

edited

Loading

anurag1paul commented Jun 25, 2016 •

edited

Loading

anurag1paul commented Jun 25, 2016 •

edited

Loading

oblonski commented Jul 6, 2016 •

edited

Loading

PGWelch commented Jul 6, 2016 •

edited

Loading

oblonski commented Aug 2, 2016 •

edited

Loading

karussell commented Aug 2, 2016 •

edited

Loading