Ultra Kernel Samepage Merging
The patches in this repo are the latest UKSM patches
The current release number: 0.1.2.6
This release includes two bug fixes from Huawei, many thanks to their engineers and esepcially to @colo-ft who submitted the patches.
Changelog for all versions is in Documentation/vm/uksm.txt.
Ultra KSM. Copyright (C) 2011-2016 Nai Xia
This is an improvement upon KSM. Some basic data structures and routines
are borrowed from ksm.c .
Its new features:
Full system scan:
It automatically scans all user processes’ anonymous VMAs. Kernel-user
interaction to submit a memory area to KSM is no longer needed.
Rich area detection:
It automatically detects rich areas containing abundant duplicated
pages based. Rich areas are given a full scan speed. Poor areas are
sampled at a reasonable speed with very low CPU consumption.
Ultra Per-page scan speed improvement:
A new hash algorithm is proposed. As a result, on a machine with
Core(TM)2 Quad Q9300 CPU in 32-bit mode and 800MHZ DDR2 main memory, it
can scan memory areas that does not contain duplicated pages at speed of
627MB/sec ~ 2445MB/sec and can merge duplicated areas at speed of
477MB/sec ~ 923MB/sec.
Thrashing area avoidance:
Thrashing area(an VMA that has frequent Ksm page break-out) can be
filtered out. My benchmark shows it’s more efficient than KSM’s per-page
hash value based volatile page detection.
Misc changes upon KSM:
Full Zero Page consideration(contributed by Figo Zhang)
Now uksmd consider full zero pages as special pages and merge them to an
special unswappable uksm zero page.
Ultra KSM. Copyright (C) 2011-2016 Nai Xia
[FAST ‘18] UKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling [PDF] [Slides]