mm: kmemleak: optimise kmemleak_lock acquiring during kmemleak_scan - openslx/kernel-qcow2-linux.git

diff options

author	Catalin Marinas	2015-06-25 01:58:37 +0200
committer	Linus Torvalds	2015-06-25 02:49:45 +0200
commit	93ada579b0eea06f808aef08ead64bb230fb7851 (patch)
tree	83a4e03631e64f0c120b0a6a358822e74c7233fc /mm/mmap.c
parent	mm: kmemleak: avoid deadlock on the kmemleak object insertion error path (diff)
download	kernel-qcow2-linux-93ada579b0eea06f808aef08ead64bb230fb7851.tar.gz kernel-qcow2-linux-93ada579b0eea06f808aef08ead64bb230fb7851.tar.xz kernel-qcow2-linux-93ada579b0eea06f808aef08ead64bb230fb7851.zip

mm: kmemleak: optimise kmemleak_lock acquiring during kmemleak_scan

The kmemleak memory scanning uses finer grained object->lock spinlocks primarily to avoid races with the memory block freeing. However, the pointer lookup in the rb tree requires the kmemleak_lock to be held. This is currently done in the find_and_get_object() function for each pointer-like location read during scanning. While this allows a low latency on kmemleak_*() callbacks on other CPUs, the memory scanning is slower. This patch moves the kmemleak_lock outside the scan_block() loop, acquiring/releasing it only once per scanned memory block. The allow_resched logic is moved outside scan_block() and a new scan_large_block() function is implemented which splits large blocks in MAX_SCAN_SIZE chunks with cond_resched() calls in-between. A redundant (object->flags & OBJECT_NO_SCAN) check is also removed from scan_object(). With this patch, the kmemleak scanning performance is significantly improved: at least 50% with lock debugging disabled and over an order of magnitude with lock proving enabled (on an arm64 system). Signed-off-by: Catalin Marinas <catalin.marinas@arm.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

Diffstat (limited to 'mm/mmap.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: