Memory release latency issue

09-04-2014

Registered User

9, 0

Join Date: Aug 2009

Last Activity: 15 September 2014, 6:01 PM EDT

Posts: 9

Thanks Given: 0

Thanked 0 Times in 0 Posts

Memory release latency issue

I have an application that routinely alloc() and realloc() gigabyte blocks of memory for image processing applications; specifically performing rotations of huge images, or creating/ deleting huge image buffers to contain multiple images. Immediately upon completion of an operation I call free() to release the memory.

I've noticed dramatic performance disparities depending upon the sequence that operations are performed. The first call to a function completes quickly, but subsequent calls can take up to 5X as long as the first; exact same code. All terminate normally, the issue is performance or lack of it.

It appears that after I free() a block of memory that I am using the system, for unknown reasons, does not make this resource immediately available again for an indeterminate period. I free the memory, but the system performs as if the memory is still in use. There is no logic issue of the memory being freed; the only path to a return is through the free() statement.

I'm a coder, not a systems expert. Any ideas out there? What is going on? Language is C/C++.

Many thanks in advance.

Imagtek
imagtek.com

---------- Post updated at 12:34 PM ---------- Previous update was at 12:32 PM ----------

The system is CentOS/64 bit, release 2.6.32-358.14.1.el6.x86_64

Last edited by imagtek; 09-04-2014 at 01:32 PM.. Reason: add information

imagtek

View Public Profile for imagtek

Find all posts by imagtek

09-04-2014

Registered User

5,091, 1,931

Join Date: May 2012

Last Activity: 15 July 2020, 4:46 AM EDT

Location: Simplicity

Posts: 5,091

Thanks Given: 565

Thanked 1,931 Times in 1,668 Posts

Likely the kernel is not optimized for free().
It is a complex task, and maybe blocking new allocations.
You can try an OS upgrade, with a higher kernel version.
Or try to optimize your code - call free() less often.

MadeInGermany

View Public Profile for MadeInGermany

Find all posts by MadeInGermany

09-04-2014

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

In short, malloc() is the wrong tool for throwing around entire gigabytes of memory at once. You should cut out the middleman and use mmap().

The first time you request an entire gigabyte of memory, malloc() probably has to call brk to extend the heap segment. (This is a system memory call related to mmap.) This adds a vast new region of unused memory to the heap -- memory that's all guaranteed to hold nothing but ASCII NULLs. It just gives it straight to you and doesn't bother to clean it.

Then you free() and malloc() it again. Because it's been used, malloc() will memset() that entire gig of memory to NULL for you to make it "clean" again.

By using mmap() instead, you can let the OS do that as-needed instead of in one 1-gig write. mmap also has other useful features like file backing -- if all you're doing is dumping 5 gigs of file into memory, mmap can save you a ton of trouble and speed and RAM.

Or, if you went the other direction, you could just keep reusing the same block of memory around all the time without free()ing it.

Last edited by Corona688; 09-04-2014 at 03:56 PM..

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

09-04-2014

Registered User

1,015, 157

Join Date: Jun 2009

Last Activity: 25 June 2018, 8:15 AM EDT

Posts: 1,015

Thanks Given: 3

Thanked 157 Times in 149 Posts

That's because free() does nothing other than make the memory you freed available for you by your next malloc() call.

Why are you using malloc() and free() over and over, anyway? Just malloc() (or mmap()) a few chunks that you know will be big enough and use the same ones over and over.

achenle

View Public Profile for achenle

Find all posts by achenle

09-04-2014

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Quote:

Originally Posted by imagtek

It appears that after I free() a block of memory that I am using the system, for unknown reasons, does not make this resource immediately available again for an indeterminate period. I free the memory, but the system performs as if the memory is still in use. There is no logic issue of the memory being freed; the only path to a return is through the free() statement.

Perhaps I misunderstood you before. So the problem isn't the speed of the free(), but the memory use?

It's like achenle says, it is in use. malloc() assumes if you've allocated it before, you're going to allocate it again, and keeps it in the pool for later. If you want control over when exactly it's released to the OS, you need mmap.

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

09-04-2014

Registered User

11,728, 1,345

Join Date: Feb 2004

Last Activity: 8 May 2020, 9:07 AM EDT

Location: NM

Posts: 11,728

Thanks Given: 903

Thanked 1,345 Times in 1,201 Posts

Plus, if you repeatedly call malloc/free for varying very large sized chunks of memory, malloc will gladly fragment heap to the point where it becomes less efficient. This is due in part to the fact that some OS flavors may reclaim memory after a free call. Especially if there are other processes calling for memory chunks. Numa also plays into big chunk operations.

Several years ago we ran a test on a non-prod Solaris 10 box with 64GB of memory. We malloced one single giant chunk, never called malloc again. We reused the chunk over and over with varying sized buffers. By adding back in the malloc/free calls between every operation on new "new" chunk, the same test code ran about 15% slower and spent most of that extra time in kernel mode.

NUMA really slows down accessing large memory allocations because of locality issues. The system cannot relocate gigantic memory chunks to more convenient locations. Since you have a commodity cpu (multicoore x86 ) then NUMA is a concern.
You need to look into cpu affinity for threads.

If you are reading from and then writing to vastly distant memory chunks you need to be aware of the order of accessing neighboring memory rather than doing something like copying the contents of arr[0] to arr[2000000], then reading in arr[1000000]. Each one of those example actions can mean reloading an L2 cache - as an example. As it is nowadays, memory is about an order of magnitude or more slower than your cpus.

Edit: You really should consider this article:

http://www.akkadia.org/drepper/cpumemory.pdf

It is somewhat old, but still completely applicable.

Last edited by jim mcnamara; 09-04-2014 at 05:44 PM..

This User Gave Thanks to jim mcnamara For This Post:

jim mcnamara

View Public Profile for jim mcnamara

Find all posts by jim mcnamara

09-05-2014

Registered User

9, 0

Join Date: Aug 2009

Last Activity: 15 September 2014, 6:01 PM EDT

Posts: 9

Thanks Given: 0

Thanked 0 Times in 0 Posts

Thanks all for very informative replies. Memory allocation at the system level is more complex than I thought. I'll dig into the mmap() possibility. Part of my design-for-performance strategy working with huge images is to code low-level and as close to the system as possible, so it looks like more work to do there. As I said, first time through these algorithms fly, then its like they get stuck in the mud. Sometimes simply painting the screen hangs for seconds at a time. Always immediately after using/freeing massive blocks of memory.

I'll play around with some of these ideas and let you know what I find. I'm pushing my old 8 GB machine to its limits, maybe a bit past them, but that is what its for.

Thanks again for the valuable information.
imagtek

imagtek

View Public Profile for imagtek

Find all posts by imagtek

Red Hat

Memory release latency issue

10 More Discussions You Might Find Interesting

1. Linux

Swap memory issue

Discussion started by: ratheeshjulk

2. Red Hat

Memory Issue

Discussion started by: rsheikh01

3. AIX

Memory issue

Discussion started by: powerAIX

4. Solaris

Zone memory issue

Discussion started by: rock123

5. AIX

AIX memory issue

Discussion started by: learnbash

6. Solaris

Locked memory issue

Discussion started by: fugitive

7. Linux

Virtual Memory issue

Discussion started by: scriptingmani

8. AIX

Shared memory issue

Discussion started by: sdspawankumar

9. Linux

Memory issue while diff !!!

Discussion started by: csaha

10. Windows & DOS: Issues & Discussions

Memory Issue

Discussion started by: vestro