Various checks on malloc #1781

wiredfool · 2016-03-26T21:02:33Z

I'd like feedback on this.

i think this is a good approach -- it follows the other changes that we've made to check for overflow on the mallocs. The biggest change is switching to calloc(n,size) instead of malloc(size) in some cases -- calloc does the multiplication, (is supposed to) check for overflow, and zeros the memory. I don't think that the zeroing is ever a bad thing, and there are some reports of the performance of calloc zeroing the memory faster than the memset that we use in a few places.

i will probably rebase/squash this prior to merging.

coveralls · 2016-03-26T21:15:04Z

Coverage decreased (-0.03%) to 76.852% when pulling 0239830 on wiredfool:malloc_check into 34c80ef on python-pillow:master.

coveralls · 2016-04-01T10:20:02Z

Changes Unknown when pulling 0564fb8 on wiredfool:malloc_check into * on python-pillow:master*.

hugovk · 2016-04-03T09:22:05Z

_imaging.c

-    int n, i;
-    int bands;
+    Py_ssize_t n;
+    int i, bands;


Should i also be Py_ssize_t?

No, n is set to constant values well within the range of int in different locations in this function, and i counts up to that. The reason that n is a Py_ssize_t is that it's address is passed as an argument to getlist, which needs it to be a Py_ssize_t to match the result type of the python get sequence length function.

hugovk · 2016-04-03T09:32:59Z

👍 calloc rather than malloc sounds sensible.

wiredfool · 2016-04-19T09:23:57Z

undone -

~~rebase~~
~~check coverage to make sure we're hitting the checks.~~

wiredfool · 2016-06-08T12:48:29Z

@homm Can you take a look at this PR and comment?

homm · 2016-06-08T13:05:44Z

Ok. I will in couple of days.

…Length

…nt about leaking memory from prior to when we had the cleanup mechanisim

…er uses of xsize/ysize

homm · 2016-06-08T13:36:11Z

In general, I'm against replacing malloc with calloc everywhere just for the overflow check. calloc not only does the check, it also guarantees that memory is filled with zeroes, and sometimes this can be slower.

I will take a close look at every case later.

wiredfool · 2016-06-08T15:12:17Z

My argument for calloc is:

It is safer WRT overflow.
It eliminates an entire class of error with uninitialized memory.
It's not significantly slower, at least tests aren't taking a significantly different time.

I think that the only possible drawback is performance, but from my reading elsewhere calloc is likely faster than malloc + memset, and not significantly slower than malloc due to tricks that can be played with page mapping.

wiredfool · 2016-06-08T15:26:23Z

Also, when looking at this, I think there's a significant speed gain to be made for larger images by combining Storage.c:ImagingNewBlock and ImagingNewArray so that we're always allocating blocks rather than switching from one block at <16M and line by line for images >16M. For example, in the current layout if you had a 2047x2048 RGBA image, that would be one big allocation of ~16M. a 2049x2048 RGBA would be 2049 allocations of 8K each.

I think it would be relatively transparent to the rest of the code if: char *im->block became char ** im->blocks. We'd add a count of blocks in the struct, and fill the im->image as normal.

homm · 2016-06-08T15:33:37Z

so that we're always allocating blocks

Thinking about this long time, haven't implemented yet :)

You are right that it should be transparent for most of C code except the code which directly uses ImagingNewBlock (in map.c).

homm · 2016-06-08T15:40:00Z

I haven't researched, but my thought are optimal block size can be about:

block_size = 2 * 2**20
mem_size = ((block_size + line_size - 1) // line_size) * line_size

wiredfool · 2016-06-08T16:06:16Z

Looks like the only call to it is through the _imaging.c new_block interface, from ImageTk.py. Which bypasses the threshold check to ensure that they get one big chunk of memory to pass to tk.

So the comment in ImagingNewBlock is wrong, and we should probably be returning an error where we're returning NULL from the overflow check. And ImagingDelete might need to be called on im to prevent a leak.

homm · 2016-06-14T22:10:56Z

_imaging.c

@@ -376,12 +387,14 @@ getlist(PyObject* arg, int* length, const char* wrong_length, int type)
    }

    n = PyObject_Length(arg);
-    if (length && wrong_length && n != *length) {
+    if (length && wrong_length && n !=  *length) {


Inadvertent

homm · 2016-06-14T23:38:12Z

I've stopped in the middle. It's too large for one pass.

wiredfool · 2016-06-15T16:05:03Z

No worries.

I've done a quick benchmark on malloc vs calloc, where I opened and loaded a jpeg 100x, one of them was over the block limit and one was under. The difference between this branch (calloc) and master (malloc) is essentially lost in the noise. I'm seeing variances from run to run of ~1% with no clear pattern of one being faster. Calloc seems faster on the larger image, malloc on the smaller, but still pretty well within the noise level.

wiredfool · 2016-06-16T08:11:57Z

Comments addressed.

homm · 2016-06-20T23:56:32Z

Ok, I finished with the rest of changes.

wiredfool added Needs Review Do Not Merge labels Mar 26, 2016

wiredfool force-pushed the malloc_check branch from 0239830 to 0564fb8 Compare April 1, 2016 10:09

wiredfool modified the milestone: 3.3.0 Apr 1, 2016

hugovk reviewed Apr 3, 2016
View reviewed changes

wiredfool force-pushed the malloc_check branch 2 times, most recently from 0239830 to 29ab316 Compare April 15, 2016 18:51

wiredfool force-pushed the malloc_check branch from 29ab316 to 2f4e5f0 Compare April 19, 2016 18:14

wiredfool force-pushed the malloc_check branch 2 times, most recently from e88ca01 to bdd9b93 Compare May 24, 2016 12:32

wiredfool force-pushed the malloc_check branch from f0397f2 to c8bb1c5 Compare May 30, 2016 10:30

wiredfool removed the Do Not Merge label May 30, 2016

wiredfool added 11 commits June 8, 2016 06:21

mixed 8c tabs+spaces -> spaces

b1a190a

Change return type of PyPath_Flatten to Py_ssize_t to match PyObject_…

c589ae6

…Length

Malloc check, python-pillow#1715

52d60cd

Malloc check, realloc, python-pillow#1715

4b4ef5f

added cleanup to free dictionary memory in ZipEncode, fixes old comme…

49566b2

…nt about leaking memory from prior to when we had the cleanup mechanisim

Convert xsize/ysize to ints in function declarations to match all oth…

d48e5cd

…er uses of xsize/ysize

overflow check for im->linesize

768936f

Rework block allocator

54a9797

Replace SIZE_MAX with type specific _MAX

5369d8e

MSVC doesn't define UINT32_MAX

7660563

Malloc check merge/rebase

92a13d9

homm reviewed Jun 14, 2016
View reviewed changes

wiredfool added 5 commits June 16, 2016 00:52

extraneous space

b0ec525

added check to prevent arcs > 360 degrees

d0ae5bc

We're not actually multiplying out the bytes, only the indexes

ce57e6a

added comment closer to malloc

8aedf8b

removed redundant check

95f464f

wiredfool mentioned this pull request Jun 19, 2016

3.3.0 Release July 1, 2016 #1800

Closed

wiredfool merged commit bdd0a6a into python-pillow:master Jun 21, 2016

wiredfool deleted the malloc_check branch October 2, 2017 13:29

radarhere mentioned this pull request Nov 13, 2023

MemoryError with large georef image #7534

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various checks on malloc #1781

Various checks on malloc #1781

wiredfool commented Mar 26, 2016

coveralls commented Mar 26, 2016

coveralls commented Apr 1, 2016

hugovk Apr 3, 2016

wiredfool Apr 3, 2016

hugovk commented Apr 3, 2016

wiredfool commented Apr 19, 2016 •

edited

Loading

wiredfool commented Jun 8, 2016

homm commented Jun 8, 2016

homm commented Jun 8, 2016

wiredfool commented Jun 8, 2016

wiredfool commented Jun 8, 2016

homm commented Jun 8, 2016

homm commented Jun 8, 2016

wiredfool commented Jun 8, 2016

homm Jun 14, 2016

wiredfool Jun 14, 2016

homm commented Jun 14, 2016

wiredfool commented Jun 15, 2016

wiredfool commented Jun 16, 2016

homm commented Jun 20, 2016

Various checks on malloc #1781

Various checks on malloc #1781

Conversation

wiredfool commented Mar 26, 2016

coveralls commented Mar 26, 2016

coveralls commented Apr 1, 2016

hugovk Apr 3, 2016

Choose a reason for hiding this comment

wiredfool Apr 3, 2016

Choose a reason for hiding this comment

hugovk commented Apr 3, 2016

wiredfool commented Apr 19, 2016 • edited Loading

wiredfool commented Jun 8, 2016

homm commented Jun 8, 2016

homm commented Jun 8, 2016

wiredfool commented Jun 8, 2016

wiredfool commented Jun 8, 2016

homm commented Jun 8, 2016

homm commented Jun 8, 2016

wiredfool commented Jun 8, 2016

homm Jun 14, 2016

Choose a reason for hiding this comment

wiredfool Jun 14, 2016

Choose a reason for hiding this comment

homm commented Jun 14, 2016

wiredfool commented Jun 15, 2016

wiredfool commented Jun 16, 2016

homm commented Jun 20, 2016

wiredfool commented Apr 19, 2016 •

edited

Loading