Non-moving segment allocation strategy might be leading to fragmentation

changed the description

assigned to @teo

changed the description

Could you do some analysis with ghc-debug to check the amount of fragmentation?

Yeah that sounds good. I'll do something like that hopefully this/next week

I'm gathering some data on this using ghc-debug.

I'm doing this by labelling each free block small or big based on whether it's in a free group of less than 15 blocks or not.

Here are my results so far:

live blocks	total blocks	small	big
131001	418816	75680	210499
478184	2004480	253998	1264468
748726	3449344	290613	2396531
1037579	5251584	387682	3805809

And in the form of a graph:

As you can see, small blocks that cannot be used by the non-moving GC make up quite a bit of space, approx 10-20% of total. But the vast majority of fragmentation isn't coming from that source. It just seems to be common-or-garden fragmentation.

I'll keep running some more test cases and see if this changes on bigger heaps. But so far it seems to me like this specific source of fragmentation isn't a high priority issue.

It's strange but it looks like most megablocks have a bunch of non-moving segments in the middle and then plenty of space (around 80ish) block at the start and end.

Here's some more data:

As you can see the small gaps amount to less than 10% of total memory usage. But general fragmentation is quite high.

live	total	small	big
184910	218112	16359	16843
287570	402176	41233	73373
330646	485888	55852	99390
356826	633344	65372	211146
551176	1017856	104265	362415
501486	1123584	111088	511010
655133	1486848	130528	701187
947464	1956608	194375	814769
987185	2305280	210088	1108007
1101904	2730496	236433	1392159
1195574	3107840	254522	1657744
1332443	3534592	295726	1906423
1578214	4252416	345145	2329057
1618509	4616448	360146	2637793
1676187	4911104	376935	2857982
1587992	5085696	370824	3126880
1771691	5473536	402382	3299463
1827270	5812736	431117	3554349
1861145	6181632	439558	3880929
2259238	6707712	512793	3935681
2309672	7254784	523616	4421496
2606380	7843328	601112	4635836
2644519	8444672	621293	5178860
2816185	9019392	673626	5529581
2726459	9551616	678416	6146741
3047845	10162432	767349	6347238
3062144	10710272	754068	6894060
2962559	11115264	787508	7365197
3179979	11281408	805430	7295999
3746406	12244736	931399	7566931
3167541	12415744	858189	8390014
4103549	13189120	1013392	8072179
4244075	13257472	1021921	7991476
3655042	13606912	966848	8985022
3501572	13810432	937536	9371324
4015827	14162688	1072941	9073920
4231291	14750464	1135328	9383845
4484000	15640576	1192378	9964198
4648291	16192768	1243090	10301387
4718051	16195072	1241413	10235608
5032282	17058560	1324353	10701925

Here is a graph of memory usage generated with eventlog2html:

What's very confusing is why megablock usage keeps going up despite block usage being nowhere near the limit. As we can see from the other data I've posted in this thread, we have plenty of free blocks and plenty of free blocks that the non-moving allocator should be able to use...

Ok I've been looking around in gdb and it seems like my app is allocating a lot of big pinned byte arrays. Presumably they don't live for long so they don't turn up in profiles or the heap graph, but they must be allocated so they require allocating a megablock.

So, that explains the very high fragmentation I think.

That still leaves the 10%ish waste from the aligning logic

10% certainly sounds worth addressing. Implementing a segment allocator building upon the megablock allocator sounds like a worthwhile thing to try. It is not clear to me that the overheads involved will be worth worrying about for most programs which reach a steady state of allocation.

I've fixed the pinned ByteString based fragmentation and re-run my ghc-debug script. Now we see that this small block group fragmentation seems to slowly increase as a percentage of total memory it starts around 10% and grows to about 23% by the end of my data.

live	total	small	big
211312	236288	21790	3186
257032	303616	40984	5600
334928	412928	63633	14367
442348	555008	96541	16119
509421	644864	118272	17171
573928	742144	139882	28334
651953	841216	162044	27219
745165	976128	196642	34321
855837	1114624	230465	28322
930814	1225728	256821	38093
1038814	1385216	287765	58637
1154061	1544704	327829	62814
1208916	1626880	344432	73532
1403073	1884416	407286	74057
1489833	1999104	433063	76208
1604410	2148352	470496	73446
1703020	2303232	503084	97128
1835437	2480128	545165	99526
1891365	2557440	569917	96158
2026550	2721536	612267	82719
2198937	2982400	665594	117869
2319906	3171328	706979	144443
2387206	3221760	733486	101068
2508103	3405824	777689	120032
2614502	3595264	809822	170940
2687222	3634688	837202	110264
2899876	3928064	918790	109398
3052992	4161536	961673	146871
3251467	4428032	1029962	146603
3423483	4671232	1087983	159766
3596941	4906752	1155673	154138
3801897	5197056	1215758	179401
4029231	5517312	1304312	183769
4326727	5944064	1402505	214832
4564580	6315008	1484994	265434

removed needs triage label

added Ttask label

mentioned in issue #24246

added runtime perf label

I'm gonna start working on this.

Here's what I'm thinking:

have a list of free segments owned by the nonmoving generation
if the list is empty when we want to allocate a segment, then request a new megablock and cut it up into segment sized block groups.
when we free a segment, we add the block group to our list. (we keep the current 16 segments are cached for each size logic, for now)
after sweeping, we sort/traverse the list and return any fully unused megablocks to the allocator. (hopefully I can reuse some existing logic to do this)

I think this is all quite uncontroversial but I thought I'd sketch a design before starting work in earnest

mentioned in issue #24492 (closed)

mentioned in commit bff85001

mentioned in merge request !12152 (closed)

mentioned in commit c0cddc18

mentioned in commit 8680656b

mentioned in commit 447752c9

mentioned in commit dbd8b095

This strategy sounds similar to what we do to avoid creating fragmentation with pinned blocks. See pinned_object_block in the RTS.

mentioned in commit 8d638a38

mentioned in commit 70f445cd

mentioned in commit 29a06cc9

mentioned in commit 0a3717b5

mentioned in commit 1757ae34

mentioned in commit 8d857b78

mentioned in commit f89868f8

mentioned in commit 79813509

mentioned in commit d094d8db

mentioned in commit 5e36e7ce

mentioned in commit d2bca83f

mentioned in commit d0dc8d92

closed with commit b38dcf39

mentioned in commit b38dcf39

mentioned in commit c4ad0034

mentioned in merge request !13023 (closed)

mentioned in commit c9799fe6

mentioned in commit 8958612c

Non-moving segment allocation strategy might be leading to fragmentation

Summary

Issue

Proposed solution

Implementation

Child items ...

Activity