#moarvm on 27 March 2020 - Raku Programming Language Log

github.com/moarvm/moarvm \| IRC logs at colabti.org/irclogger/irclogger_logs/moarvm Set by AlexDaniel on 12 June 2018.
00:37 Kaiepi left, Kaiepi joined 01:08 dogbert17 joined 01:11 dogbert11 left 02:35 JessiWilde joined 02:39 JessiWilde left 02:44 [Coke] joined 05:02 linkable6 joined, evalable6 joined 07:40 Kaiepi left 07:41 Kaiepi joined
nine	It looks more and more like my memory leaks are in fact a rakudo issue rather than caused by Inline::Perl5	08:09	Copy link Message link Add to gist Remove
MasterDuke	rakudo or moarvm?	08:13	Copy link Message link Add to gist Remove
nine	Well rakudo includes moarvm and the leak could be caused by anything in there	08:20	Copy link Message link Add to gist Remove
	Now the question is: how do I find out what those things are that are never freed?	08:22	Copy link Message link Add to gist Remove
	heaptrack could tell me if I could disable the nursery (thus keeping us from moving stuff around in memory)		Copy link Message link Add to gist Remove
MasterDuke	i feel that's a setup for a whole bunch of jokes, but i think they'd all be pretty dark (re finding things that are never freed)	08:23	Copy link Message link Add to gist Remove
	didn't you do a whole bunch of related experiments recently? setting a giant gen2 or something like that?	08:24	Copy link Message link Add to gist Remove
nine	Well I did spend a lot of time with the GC, indeed.	08:25	Copy link Message link Add to gist Remove
	Actually disabling the nursery seems trivial. I just have to override the allocate_in_gen2 check in MVM_gc_allocate. But then I need a way to trigger a GC run (which is usually done when the nursery is full), otherwise it'll show all memory as leaked.	08:26	Copy link Message link Add to gist Remove
MasterDuke	nqp::force_gc(), right?	08:27	Copy link Message link Add to gist Remove
nine	Ha! What an excellent idea :)	08:29	Copy link Message link Add to gist Remove
MasterDuke	jnthn: talking about heaptrack reminds me that i checked what valgrind reports as leaking when doing `raku --full-cleanup -e ''`. most seem to be spesh related. there are a couple small things, but then two big ones (12k and 411k) seem to be spesh worker related	08:33	Copy link Message link Add to gist Remove
	the comment at github.com/MoarVM/MoarVM/blob/mast...#L560-L562 suggests workers are being cleaned up, but is there more that should be done?	08:35	Copy link Message link Add to gist Remove
	oh, and if i run valgrind with MVM_SPESH_DISABLE=1 it only reports leaks in two places (instead of the nine with spesh enabled)	08:42	Copy link Message link Add to gist Remove
	nine: don't want to distract you from finding big leaks, but you may know something about the two remaining ones. one is related to serialization, there's a chain of rakudo_scalar_set_container_spec, deserialize_stable, ... . the other is just MVM_load_bytecode_buffer_to_cu	08:45	Copy link Message link Add to gist Remove
08:57 zakharyas joined
nwc10	good *, #moarvm	09:11	Copy link Message link Add to gist Remove
Geth	MoarVM: MasterDuke17++ created pull request #1260: Free compunits loaded from bytecode upon dealloc	09:39	Copy link Message link Add to gist Remove
09:53 Geth left, Geth joined
MasterDuke	oh, the one remaining leak (with spesh disabled) is from rakudo's src/vm/moar/ops/container.c	10:12	Copy link Message link Add to gist Remove
10:31 sena_kun joined 10:36 AlexDaniel left 10:50 Altai-man_ joined 10:53 sena_kun left
MasterDuke	huh, rakudo_scalar_set_container_spec github.com/rakudo/rakudo/blob/mast...#L173-L177 is being called twice, but rakudo_scalar_gc_free_data github.com/rakudo/rakudo/blob/mast....c#L82-L84 is only being called once, when running `raku --full-cleanup -e ''`	10:53	Copy link Message link Add to gist Remove
nwc10	MoarVM oops: Malformed DU chain: writer takenextdispatcher of 5(1) in BB 86 is incorrect	10:55	Copy link Message link Add to gist Remove
	Spesh of 'EVAL' (cuid: 12669, file: SETTING::src/core.c/ForeignCode.pm6:27)		Copy link Message link Add to gist Remove
	That's t/spec/integration/eval-and-threads.t		Copy link Message link Add to gist Remove
	master/master/296fbcf46 (because Rakudo has just moved on...)	10:56	Copy link Message link Add to gist Remove
MasterDuke	vrurg: ^^^	10:58	Copy link Message link Add to gist Remove
jnthn	ooh, that's a good catch	11:02	Copy link Message link Add to gist Remove
	(DU chain checker)++		Copy link Message link Add to gist Remove
MasterDuke	what is a DU chain?	11:03	Copy link Message link Add to gist Remove
nwc10	jnthn: there were a couple of other spectest failures but I didn't dig into them when I hit that one	11:05	Copy link Message link Add to gist Remove
	and at times, spectests seem to be left in a state of fail, which irks me		Copy link Message link Add to gist Remove
	well, the spec test is probably correct, but it isn't TODO	11:06	Copy link Message link Add to gist Remove
jnthn	MasterDuke: Define-Use	11:12	Copy link Message link Add to gist Remove
	Basically links an SSA var with all of the places that use it	11:13	Copy link Message link Add to gist Remove
11:33 MasterDuke left 12:39 MasterDuke joined
MasterDuke	jnthn: does a du-chain error usually indicate a problem in the implementation or optimization?	12:42	Copy link Message link Add to gist Remove
12:51 sena_kun joined 12:53 Altai-man_ left 13:05 MasterDuke left
jnthn	MasterDuke: It means that something in spesh is transforming the program graph inconsistently, e.g. it doesn't update things properly	13:09	Copy link Message link Add to gist Remove
tellable6	jnthn, I'll pass your message to MasterDuke		Copy link Message link Add to gist Remove
jnthn	And so leaves the graph in an inconsistent state, which can cause other analyses/optimizations to do the wrong thing		Copy link Message link Add to gist Remove
13:22 MasterDuke joined
MasterDuke	fyi, the du chain error in t/spec/integration/eval-and-threads.t disappears if you set MVM_SPESH_INLINE_DISABLE=1	13:26	Copy link Message link Add to gist Remove
tellable6	2020-03-27T13:09:23Z #moarvm <jnthn> MasterDuke: It means that something in spesh is transforming the program graph inconsistently, e.g. it doesn't update things properly		Copy link Message link Add to gist Remove
jnthn	Then there's a high chance the mistake is in inline.c	13:29	Copy link Message link Add to gist Remove
MasterDuke	yep. if i comment out just these lines github.com/MoarVM/MoarVM/blob/mast...#L653-L656 the error stays, just is reported in a different location (Spesh of 'AT-KEY' (cuid: 6608, file: SETTING::src/core.c/Hash.pm6:35)), but i comment out the else if entirely everything is fine	13:31	Copy link Message link Add to gist Remove
Geth	MoarVM: edb6f87261 \| (Daniel Green)++ \| src/core/loadbytecode.c Free compunits loaded from bytecode upon dealloc Before this, `MVM_SPESH_DISABLE=1 valgrind --leak-check=full raku --full-cleanup -e ''` would report 'definitely lost: 1,560 bytes in 2 blocks', after it reports 'definitely lost: 32 bytes in 1 blocks'.	13:45	Copy link Message link Add to gist Remove
	MoarVM: 8ed5e7f5c6 \| niner++ (committed using GitHub Web editor) \| src/core/loadbytecode.c Merge pull request #1260 from MasterDuke17/cleanup_compunits_loaded_from_bytecode Free compunits loaded from bytecode upon dealloc		Copy link Message link Add to gist Remove
nine	MasterDuke: good catch!		Copy link Message link Add to gist Remove
13:46 MasterDuke left
[Coke]	... I saw "free compunits" and was thinking something else. :)	13:58	Copy link Message link Add to gist Remove
	like, $0.		Copy link Message link Add to gist Remove
13:58 MasterDuke joined
MasterDuke	nine: thanks. btw, did you see my question about the rakudo container spec?	14:00	Copy link Message link Add to gist Remove
nine	See yes, understand no :)	14:16	Copy link Message link Add to gist Remove
	Who calls rakudo_scalar_set_container_spec?	14:17	Copy link Message link Add to gist Remove
MasterDuke	i think 6model code in moarvm	14:18	Copy link Message link Add to gist Remove
	maybe just nqp::setcontspec and deserialize_stable()	14:19	Copy link Message link Add to gist Remove
	however, by far the bigger leaker is the jit	14:26	Copy link Message link Add to gist Remove
	wait, hmm	14:31	Copy link Message link Add to gist Remove
jnthn	Does it leak as it works, or just not free pages at process end?		Copy link Message link Add to gist Remove
14:33 lucasb joined
MasterDuke	i'm just checking `valgrind raku --full-cleanup -e ''`	14:36	Copy link Message link Add to gist Remove
	and i get oddly different report with MVM_JIT_DISABLE or MVM_SPESH_DISABLE	14:39	Copy link Message link Add to gist Remove
	so i'm not 100% sure it's the jit. i think spesh working are definitely causing problems, and maybe also casting blame on the jit	14:41	Copy link Message link Add to gist Remove
	e.g., 411,444 definitely lost, with a stack trace of MVM_vm_create_instance, MVM_spesh_worker_start, MVM_thread_new, MVM_tc_create, calloc	14:42	Copy link Message link Add to gist Remove
jnthn	Hm, does full cleanup actaully close down the spesh worker, so there's chance to clear its thread context up?	14:43	Copy link Message link Add to gist Remove
	Normally we'd just never care, 'cus it runs all process		Copy link Message link Add to gist Remove
MasterDuke	that's what these are supposed to do, right? github.com/MoarVM/MoarVM/blob/mast...#L561-L562	14:44	Copy link Message link Add to gist Remove
jnthn	Yeah, that joins it.	14:45	Copy link Message link Add to gist Remove
	And the GC global cleanup comes after that		Copy link Message link Add to gist Remove
14:50 Altai-man_ joined
Geth	MoarVM: 48da6b2918 \| (Stefan Seifert)++ \| src/6model/reprs/CArray.c Implement calculation of unmanaged size for CArray	14:51	Copy link Message link Add to gist Remove
14:53 sena_kun left
Geth	MoarVM: MasterDuke17++ created pull request #1261: Free some memory calloc'ed when making spesh plans	16:15	Copy link Message link Add to gist Remove
	MoarVM: 09b88efb00 \| (Daniel Green)++ \| src/spesh/plan.c Free some memory calloc'ed when making spesh plans	16:27	Copy link Message link Add to gist Remove
	MoarVM: b178623ce5 \| niner++ (committed using GitHub Web editor) \| src/spesh/plan.c Merge pull request #1261 from MasterDuke17/cleanup_leak_when_planning_for_certain_specialization Free some memory calloc'ed when making spesh plans		Copy link Message link Add to gist Remove
MasterDuke	thanks	16:28	Copy link Message link Add to gist Remove
	i think most of the rest is from add_nodes_for_typed_argument()	16:29	Copy link Message link Add to gist Remove
	github.com/MoarVM/MoarVM/blob/mast...ard.c#L189 and github.com/MoarVM/MoarVM/blob/mast...ard.c#L195	16:31	Copy link Message link Add to gist Remove
16:51 sena_kun joined 16:53 Altai-man_ left
Geth	MoarVM: MasterDuke17++ created pull request #1262: Destroy vectors that were previously init'ed	17:03	Copy link Message link Add to gist Remove
MasterDuke	a little less sure of ^^^, i've never used MVM_VECTOR_* before	17:04	Copy link Message link Add to gist Remove
	valgrind does still report `indirectly lost: 410,767 bytes in 36 blocks` and `still reachable: 1,677 bytes in 5 blocks`	17:05	Copy link Message link Add to gist Remove
17:07 zakharyas left
MasterDuke	that's with jit disabled though	17:08	Copy link Message link Add to gist Remove
	`MVM_SPESH_BLOCKING=1 MVM_SPESH_NOELAY=1 valgrind --leak-check=full ./install/bin/raku --full-cleanup -e ''` has `definitely lost: 3,064 bytes in 20 blocks` and `indirectly lost: 427,165 bytes in 95 blocks`		Copy link Message link Add to gist Remove
17:18 zakharyas joined
Geth	MoarVM: c82ff9d739 \| (Daniel Green)++ \| src/spesh/arg_guard.c Destroy vectors that were previously init'ed	18:18	Copy link Message link Add to gist Remove
	MoarVM: ed3e7cb984 \| niner++ (committed using GitHub Web editor) \| src/spesh/arg_guard.c Merge pull request #1262 from MasterDuke17/cleanup_leak_when_creating_nodes_to_check_argument_types Destroy vectors that were previously init'ed		Copy link Message link Add to gist Remove
	MoarVM: ae71ac80dd \| (Stefan Seifert)++ \| src/6model/reprs/NativeCall.c Fix NativeCall leaking sym_name on cleanup	18:28	Copy link Message link Add to gist Remove
18:50 Altai-man_ joined 18:53 sena_kun left, patrickb joined 18:54 zakharyas left 18:59 patrickb left 19:01 lucasb left 19:07 AlexDaniel joined 19:08 AlexDaniel left, AlexDaniel joined
nine	I know why the spesh thread's tc doesn't get freed!	19:32	Copy link Message link Add to gist Remove
MasterDuke	oh!		Copy link Message link Add to gist Remove
19:33 MasterDuke left
nine	A thread object's tc doesn't get freed in it's gc_free as you'd expect. Instead it's handled by logic in the GC itself. It actually takes 2 GC runs to reach that as we need to keep stuff alive for finalizers to run. This second run simply doesn't happen for the spesh thread.	19:38	Copy link Message link Add to gist Remove
	It only affects the spesh thread as we run the GC after joining a thread. So all threads but the very last to be joined will get at least 2 runs. This means that when spesh is disabled, we'd just leak another thread (unless there is none).	19:39	Copy link Message link Add to gist Remove
19:39 MasterDuke joined
MasterDuke	so we need to run a another gc after what now is the last?	19:41	Copy link Message link Add to gist Remove
	or is there a better solution?		Copy link Message link Add to gist Remove
Geth	MoarVM: ada1cadaa7 \| (Stefan Seifert)++ \| src/moar.c Don't leak a (spesh) thread when running with --full-cleanup A thread object's tc doesn't get freed in it's gc_free as you'd expect. Instead it's handled by logic in the GC itself. It actually takes 2 GC runs to reach that as we need to keep stuff alive for finalizers to run. This second run simply doesn't happen for the spesh thread. ... (6 more lines)	19:42	Copy link Message link Add to gist Remove
nine	Yeah, we'll just run the GC one more time. As we do this only with --full-cleanup this won't hurt		Copy link Message link Add to gist Remove
[Coke]	nine++	19:45	Copy link Message link Add to gist Remove
MasterDuke	nine++	20:08	Copy link Message link Add to gist Remove
	not quite 0, but much less now. `definitely lost: 2,192 bytes in 19 blocks`, `indirectly lost: 16,398 bytes in 59 blocks`	20:10	Copy link Message link Add to gist Remove
nwc10	Stage parse : 393.582	20:12	Copy link Message link Add to gist Remove
	that is ASAN, spesh no delay and full paranoia.		Copy link Message link Add to gist Remove
	I don't remember it being below 400. It used to be above 500	20:13	Copy link Message link Add to gist Remove
	Someone(s0		Copy link Message link Add to gist Remove
	Someone(s) have been doing good stuff		Copy link Message link Add to gist Remove
	(probably not nine, who does other good stuff)		Copy link Message link Add to gist Remove
nine	Haha! We leak all JIT code because.....of an off by 1 error in the reference count (yes we do that)		Copy link Message link Add to gist Remove
nwc10	failure do down-karma the reference :-)	20:14	Copy link Message link Add to gist Remove
MasterDuke	ha. it did look to me like stuff was supposed to be freed, didn't think to look for off-by-ones		Copy link Message link Add to gist Remove
nine	And fixing that uncovered a segfault...	20:15	Copy link Message link Add to gist Remove
MasterDuke	doh		Copy link Message link Add to gist Remove
20:30 patrickb joined
Geth	MoarVM: 92f1935d7a \| (Stefan Seifert)++ \| src/jit/compile.c Fix potential segfault when failing to compile JIT code When we abort JIT compilation (due to a negative offset for dynamic label) not all fields in the MVMJitCode struct are initialized, so the MVM_free call on them may cause invalid memory access. Fix by initializing the struct with 0 on allocation.	20:31	Copy link Message link Add to gist Remove
	MoarVM: 282989882d \| (Stefan Seifert)++ \| src/jit/compile.c No longer leak all JIT code JIT code starts with a reference count of 1. AO_fetch_and_sub1 used to check the reference count in MVM_jit_code_destroy returns the original value. So we need to compare with 1 instead of 0.		Copy link Message link Add to gist Remove
MasterDuke	nine++++, down to only leaking `32 bytes in 1 blocks`!	20:34	Copy link Message link Add to gist Remove
timotimo	that's -e '', right?	20:36	Copy link Message link Add to gist Remove
MasterDuke	yep, with `MVM_SPESH_BLOCKING=1 MVM_SPESH_NOELAY=1`		Copy link Message link Add to gist Remove
timotimo	right		Copy link Message link Add to gist Remove
	very cool		Copy link Message link Add to gist Remove
	perhaps this should be put on a schedule	20:37	Copy link Message link Add to gist Remove
	a week or so before an upcoming release or so		Copy link Message link Add to gist Remove
MasterDuke	though now it's the same if you leave them off		Copy link Message link Add to gist Remove
timotimo	cool		Copy link Message link Add to gist Remove
MasterDuke	you mean, not put it in the release scheduled for tomorrow?		Copy link Message link Add to gist Remove
timotimo	oh, no that's not what i mean		Copy link Message link Add to gist Remove
MasterDuke	oh, check for new leaks?	20:38	Copy link Message link Add to gist Remove
timotimo	yes		Copy link Message link Add to gist Remove
Geth	MoarVM: 77484098ff \| (Stefan Seifert)++ \| 2 files Fix leaking native callback cache on thread exit		Copy link Message link Add to gist Remove
MasterDuke	could be part of ci		Copy link Message link Add to gist Remove
	i would like a series of performance ci tests/benchmarks run regularly	20:39	Copy link Message link Add to gist Remove
timotimo	performance on publically offered CI infrastructure like travis or appveyor is probably a poor choice, unless we measure with like callgrind or cachegrind		Copy link Message link Add to gist Remove
MasterDuke	we do have some credits for aws/azure/gcp, right? even just run weekly or something not quite as frequent as every commit would be good	20:41	Copy link Message link Add to gist Remove
nine	That won't make performance tests much more useful	20:43	Copy link Message link Add to gist Remove
MasterDuke	it's a start at least. almost always easier to improve something existing than create something new	20:46	Copy link Message link Add to gist Remove
20:51 sena_kun joined 20:53 Altai-man_ left
nine	That leak rakudo_scalar_set_container_spec seems simple enough to plug by just freeing a preexisting st->container_data. That can hardly hurt as we overwrite it anyway. But I'd like jnthn's input on how this was meant to work architecture wise and if multiple calls to that function are to be expected.	21:00	Copy link Message link Add to gist Remove
MasterDuke	i'm also running a rakudo compile (just CORE.c) through valgrind to see if it picks up anything else. not sure how long this is going to take...	21:04	Copy link Message link Add to gist Remove
nine	I'd probably give it a night :)	21:06	Copy link Message link Add to gist Remove
MasterDuke	943s for stage parse	21:11	Copy link Message link Add to gist Remove
nine	That's not all that basd		Copy link Message link Add to gist Remove
	s/basd/bad/		Copy link Message link Add to gist Remove
MasterDuke	this ryzen 7 is a great cpu	21:12	Copy link Message link Add to gist Remove
	164s for stage optimize	21:14	Copy link Message link Add to gist Remove
	236s for stage mast	21:19	Copy link Message link Add to gist Remove
	10s for stage mbc		Copy link Message link Add to gist Remove
	and of course i forgot to run with --leak-check=full		Copy link Message link Add to gist Remove
nine	Good thing it doesn't take a night then :)	21:20	Copy link Message link Add to gist Remove
Geth	MoarVM: 768e6c2f30 \| (Stefan Seifert)++ \| src/core/threadcontext.c Fix invalid reads when freeing native callback cache Hashes use the fixed size allocator, so we must free the cache before tearing down the allocator.		Copy link Message link Add to gist Remove
21:50 zakharyas joined
MasterDuke	i'll gist the results, there definitely still are some leaks	21:51	Copy link Message link Add to gist Remove
	gist.github.com/MasterDuke17/e9ef9...28cf6795a7	21:52	Copy link Message link Add to gist Remove
	tldr, `definitely lost: 17,312 bytes in 30 blocks`, `indirectly lost: 206,515 bytes in 3,735 blocks`, `possibly lost: 6,038,832 bytes in 1 blocks`, `still reachable: 1,708 bytes in 6 blocks`	21:53	Copy link Message link Add to gist Remove
nine	MasterDuke: I have a fix for those MVM_cu_callsite_add issues	21:57	Copy link Message link Add to gist Remove
MasterDuke	nice	21:58	Copy link Message link Add to gist Remove
nine	Will create a PR as though it works and I do have a vague understanding of why it my be correct, I'm much less certain than with the other fixes	21:59	Copy link Message link Add to gist Remove
MasterDuke	what about the giant one at the end? looks like you've touched that very line in fact	22:02	Copy link Message link Add to gist Remove
Geth	MoarVM: niner++ created pull request #1263: Fix interned callsites getting leaked when fixing up inlines	22:04	Copy link Message link Add to gist Remove
MasterDuke	btw, looks like github.com/MoarVM/MoarVM/blob/mast...#L904-L906 is missing an `MVM_free(output)`, though that isn't the cause of the leak	22:09	Copy link Message link Add to gist Remove
nine	Yeah, it does		Copy link Message link Add to gist Remove
22:10 Kaiepi left 22:11 Kaiepi joined
nine	Is it me or is tc->serialized actually never read anywhere	22:11	Copy link Message link Add to gist Remove
MasterDuke	github.com/MoarVM/MoarVM/blob/mast...ode.c#L213 ?	22:12	Copy link Message link Add to gist Remove
lizmat	feels like a MoarVM bump is in order? or too soon?	22:14	Copy link Message link Add to gist Remove
nine	MasterDuke: that's a different thing. tc->serialized used to be read by the mast compiler, which no longer exists	22:15	Copy link Message link Add to gist Remove
MasterDuke	but that's only setting something also. i never see a `serialized` being read...		Copy link Message link Add to gist Remove
nine	true	22:17	Copy link Message link Add to gist Remove
MasterDuke	doing another valgrind run with your PR and github.com/MoarVM/MoarVM/blob/mast...ion.c#L925 commented out	22:18	Copy link Message link Add to gist Remove
22:19 harrow joined
nine	Btw. t/spec/MISC/bug-coverage-stress-6.d.t fails with "continuationinvoke expects an MVMContinuation" when the loop's iterations are bumped like the comments describe	22:23	Copy link Message link Add to gist Remove
Geth	MoarVM: 21fa394a8f \| (Stefan Seifert)++ \| 3 files Remove remnants of the old mast compiler to fix memory leak tc->serialized* were actually no longer used by anything since the mast compiler was removed. In addition, the memory pointed to by them never got freed.	22:26	Copy link Message link Add to gist Remove
nine	There seems to be only one leak left according to the valgrind gist. And for that we even have a comment showing how to get rid of it: /* XXX make all the statics members of the global MVM instance instead? */	22:29	Copy link Message link Add to gist Remove
22:42 nine joined 22:44 zakharyas left 22:45 camelia joined 22:50 Altai-man_ joined
MasterDuke	huh. still the one in concatenate_outputs	22:53	Copy link Message link Add to gist Remove
22:53 sena_kun left 23:10 Altai-man_ left
MasterDuke	yep. did another run and still the same	23:19	Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!