#moarvm on 23 November 2021 - Raku Programming Language Log

Welcome to the main channel on the development of MoarVM, a virtual machine for NQP and Rakudo (moarvm.org). This channel is being logged for historical purposes. Set by lizmat on 24 May 2021.
00:03 reportable6 left 00:04 reportable6 joined 00:21 squashable6 joined
vrurg	[Coke]: to clone you need the original. But it's too late to steal it!	00:35	Copy link Message link Add to gist Remove
00:38 squashable6 left 00:43 squashable6 joined 02:09 squashable6 left 02:10 squashable6 joined 03:07 squashable6 left 03:10 squashable6 joined 04:10 reportable6 left, shareable6 left, sourceable6 left, benchable6 left, squashable6 left, quotable6 left, nativecallable6 left, releasable6 left, bisectable6 left, statisfiable6 left, evalable6 left, coverable6 left, unicodable6 left, tellable6 left, committable6 left, greppable6 left, bloatable6 left, notable6 left, linkable6 left, coverable6 joined 04:11 statisfiable6 joined 04:12 sourceable6 joined, linkable6 joined, reportable6 joined, squashable6 joined, notable6 joined 04:13 committable6 joined, nativecallable6 joined 04:19 squashable6 left 04:21 squashable6 joined 05:11 shareable6 joined, tellable6 joined 05:12 unicodable6 joined, bisectable6 joined 05:13 benchable6 joined 06:02 reportable6 left 06:05 reportable6 joined 06:11 vrurg_ joined 06:12 releasable6 joined 06:13 vrurg left 07:10 quotable6 joined 07:12 bloatable6 joined
Nicholas	good *, #moarvm	07:19	Copy link Message link Add to gist Remove
nine	Good sunrise!	07:22	Copy link Message link Add to gist Remove
07:53 squashable6 left 08:13 evalable6 joined 08:54 squashable6 joined
Geth	MoarVM/fix_phi_out_of_bounds_read: bf106de221 \| (Stefan Seifert)++ \| 2 files Fix out of bounds read of PHI facts in spesh During spesh optimization, we remove reads of registers with dead writers from PHI nodes. It could happen that the PHI node ended up with no registers to read at all. However the following analysis code assumed that we'd always have at least 1 register to read from, resulting in an array read out of bounds error and a variety of failure modes. ... (5 more lines)	08:55	Copy link Message link Add to gist Remove
	MoarVM: niner++ created pull request #1610: Fix out of bounds read of PHI facts in spesh		Copy link Message link Add to gist Remove
nine	dogbert17: fix for complex.t ^^^		Copy link Message link Add to gist Remove
Geth	MoarVM/fix_phi_out_of_bounds_read: 8a684b3304 \| (Stefan Seifert)++ \| 2 files Fix out of bounds read of PHI facts in spesh During spesh optimization, we remove reads of registers with dead writers from PHI nodes. It could happen that the PHI node ended up with no registers to read at all. However the following analysis code assumed that we'd always have at least 1 register to read from, resulting in an array read out of bounds error and a variety of failure modes. ... (7 more lines)	08:56	Copy link Message link Add to gist Remove
nine	(just added a reference to the GH issue to the commit message=		Copy link Message link Add to gist Remove
MasterDuke	nice	08:59	Copy link Message link Add to gist Remove
09:10 greppable6 joined
dogbert17	nine++, was it an easy fix?	09:17	Copy link Message link Add to gist Remove
nine	A few hours in total	09:22	Copy link Message link Add to gist Remove
	Suddenly made a lot of sense when I dumped the spesh graph and saw that PHI that was not actually accessing the register that we got the bogus facts for		Copy link Message link Add to gist Remove
dogbert17	I wonder how often this actually happen	09:38	Copy link Message link Add to gist Remove
	*happens	09:39	Copy link Message link Add to gist Remove
lizmat	moarning!	09:47	Copy link Message link Add to gist Remove
	so, with regards to the release... are we in agreement to postpone the release to 4 Dec ?	09:48	Copy link Message link Add to gist Remove
	nine agrees		Copy link Message link Add to gist Remove
jnthnwrthngtn	moarning o/	09:54	Copy link Message link Add to gist Remove
Nicholas	\o	09:55	Copy link Message link Add to gist Remove
lizmat	and if we are in agreement on the postponement, does that also mean that the attrinited work by jnthn should still go in after that release	09:56	Copy link Message link Add to gist Remove
	or that we move that forward ?		Copy link Message link Add to gist Remove
jnthnwrthngtn	I locally ran blin on that branch over the weekend, didn't look at the results yet	09:58	Copy link Message link Add to gist Remove
10:25 evalable6 left, linkable6 left 10:27 evalable6 joined 11:27 evalable6 left, evalable6 joined
dogbert17	dogbert@dogbert-VirtualBox:~/repos/oo-monitors$ perl6 -Ilib t/basic.t	11:37	Copy link Message link Add to gist Remove
	===SORRY!=== Error while compiling /home/dogbert/repos/oo-monitors/t/basic.t		Copy link Message link Add to gist Remove
	Missing or wrong version of dependency 'gen/moar/stage2/NQPHLL.nqp' (from '/home/dogbert/repos/oo-monitors/lib/OO/Monitors.pm6 (OO::Monitors)')		Copy link Message link Add to gist Remove
	at /home/dogbert/repos/oo-monitors/t/basic.t:1		Copy link Message link Add to gist Remove
	what would be needed in order to figure out why this problem occurs after a bump?		Copy link Message link Add to gist Remove
12:02 reportable6 left 12:05 reportable6 joined
MasterDuke	jnthnwrthngtn: an interesting change in behavior from new-disp, referenced in my most recent comment on github.com/rakudo/rakudo/pull/4650	12:05	Copy link Message link Add to gist Remove
	committable6: 2021.09,2021.10 my Int $a; try { $a.VAR.log; CATCH { default { say $_.typename } } }	12:07	Copy link Message link Add to gist Remove
committable6	MasterDuke, ¦2021.09: «Int␤» ¦2021.10: «Scalar␤»		Copy link Message link Add to gist Remove
lizmat	MasterDuke: I guess that enforces my point about improving the error message about containers :-)	12:25	Copy link Message link Add to gist Remove
12:28 linkable6 joined
jnthnwrthngtn	MasterDuke: That looks more like a fix to me than anything? :)	12:37	Copy link Message link Add to gist Remove
MasterDuke	heh, yeah. just wondering if there are any other places we should look for such things	12:38	Copy link Message link Add to gist Remove
	though, tbh, i'm not sure why the invocant is `Int`. shouldn't it be `Scalar` also?	12:39	Copy link Message link Add to gist Remove
lizmat	and yet another Rakudo Weekly News hits the Net: rakudoweekly.blog/2021/11/23/2021-...adler-rip/	13:30	Copy link Message link Add to gist Remove
	timo: answer to your question about a golf of the reducing CPU usage on race	14:14	Copy link Message link Add to gist Remove
	say (^5000000).roll(20).race(:1batch).map: { [+] ^$_ .map: { $_ * $_ } }	14:15	Copy link Message link Add to gist Remove
	the answer is not important, the CPU usage is		Copy link Message link Add to gist Remove
timo	having only four cores makes this possibly not easy to reproduce?	14:18	Copy link Message link Add to gist Remove
	lizmat only has 8 though	14:19	Copy link Message link Add to gist Remove
	saw similar behaviour on a 4 core machine	14:21	Copy link Message link Add to gist Remove
	gist.github.com/lizmat/2e5bb69739d...24fdc0dd47 # snapper	14:22	Copy link Message link Add to gist Remove
MasterDuke	yeah, i see pretty much the same thing	14:27	Copy link Message link Add to gist Remove
lizmat	hmmm... maybe this is not a good example...	14:39	Copy link Message link Add to gist Remove
	playing with some debug messages in core		Copy link Message link Add to gist Remove
	hmmm... is nqp::time threadsafe ?	14:43	Copy link Message link Add to gist Remove
jnthnwrthngtn	Struggle to imagine it not being; it calculates and returns a simple numeric value?	14:44	Copy link Message link Add to gist Remove
lizmat	I'm just seeing strange values in debug messages, like "13: completed in 7936936 msecs"	14:45	Copy link Message link Add to gist Remove
	that would be more than 2 hours :-)		Copy link Message link Add to gist Remove
	I basically added:	14:46	Copy link Message link Add to gist Remove
	my $from = nqp::time;		Copy link Message link Add to gist Remove
	say "$*THREAD.id(): starting task";		Copy link Message link Add to gist Remove
evalable6	1: starting task		Copy link Message link Add to gist Remove
lizmat	to !run-one		Copy link Message link Add to gist Remove
	and:		Copy link Message link Add to gist Remove
	say "$*THREAD.id(): completed in { ((nqp::time() - $from) / 1000).Int } msecs";		Copy link Message link Add to gist Remove
	at the end		Copy link Message link Add to gist Remove
jnthnwrthngtn	The the units of nqp::time micros or nanos?	14:51	Copy link Message link Add to gist Remove
lizmat	yeah, the golf is flawed		Copy link Message link Add to gist Remove
	nanos		Copy link Message link Add to gist Remove
jnthnwrthngtn	Ah, maybe you intended msecs to be micro rather than mili, and I assumed milli...	14:52	Copy link Message link Add to gist Remove
lizmat	say (^5000000).roll(20).race(:1batch).map: { [+] ^$_ .race.map: { $_ * $_ } } # better golf	15:00	Copy link Message link Add to gist Remove
	and this better golf shows no decline in CPU usage, so I guess it is my log loading algorithm that is to blame		Copy link Message link Add to gist Remove
jnthnwrthngtn	Is your loading CPU or I/O bound, ooc?	15:02	Copy link Message link Add to gist Remove
lizmat	well, I'd say CPU bound, as the IO is just a .slurp		Copy link Message link Add to gist Remove
	timo: please disregard my golf, until I have a better one	15:31	Copy link Message link Add to gist Remove
16:21 evalable6 left, linkable6 left 16:24 evalable6 joined
dogbert17	nine: I have now been running complex.t in a loop for a couple of hours and it hasn't crashed so your PHI fix works perfectly	16:26	Copy link Message link Add to gist Remove
	dogbert17 wonders if nine's PR might have fixed the hyper bug as well	16:30	Copy link Message link Add to gist Remove
nine	\o/	16:38	Copy link Message link Add to gist Remove
16:38 [Coke] left
MasterDuke	nine: btw, did you see dev.azure.com/MoarVM/MoarVM/_build...559d8f7fdf ?	16:39	Copy link Message link Add to gist Remove
nine	oh no	16:40	Copy link Message link Add to gist Remove
MasterDuke	and while there's some recent talk about releases and whether to delay merging branches, anyone have thoughts/comments/suggestions on github.com/MoarVM/MoarVM/pull/1608 ?		Copy link Message link Add to gist Remove
nine	MasterDuke: I actually have an idea about that failure	16:44	Copy link Message link Add to gist Remove
MasterDuke	oh?	16:45	Copy link Message link Add to gist Remove
nine	res is uninitialized here: github.com/MoarVM/MoarVM/blob/mast...ffi.c#L217 but added to frame roots here: github.com/MoarVM/MoarVM/blob/mast...ffi.c#L326	16:46	Copy link Message link Add to gist Remove
MasterDuke	probably unrelated, but `values` leaks here github.com/MoarVM/MoarVM/blob/mast...ffi.c#L220	16:52	Copy link Message link Add to gist Remove
Geth	MoarVM: 0006714d07 \| (Stefan Seifert)++ \| src/core/nativecall_libffi.c Fix use of uninitialized memory in native callbacks with libffi We're GC rooting res.o but didn't initialize the local variable. This could cause memory corruption or segfaults when the GC was trying to process this object.		Copy link Message link Add to gist Remove
MasterDuke	it isn't used at all in callback_handler	16:55	Copy link Message link Add to gist Remove
16:56 [Coke] joined
nine	MasterDuke: indeed! Please just remove it :)	16:56	Copy link Message link Add to gist Remove
MasterDuke	it's alloca'ed in MVM_nativecall_invoke and MVM_nativecall_dispatch, but the individual elements are MVM_malloc'ed, so there could be a leak if an exception is thrown partway through the `for (i = 0; i < num_args; i++) {` loops	16:57	Copy link Message link Add to gist Remove
	oh wait, don't all the element in `values` leak even if there's no exception?	17:41	Copy link Message link Add to gist Remove
nine	MasterDuke: looks like, yes	17:44	Copy link Message link Add to gist Remove
18:02 reportable6 left
Geth	MoarVM: 66688b941e \| (Daniel Green)++ \| src/core/nativecall_libffi.c Remove unused variable	18:19	Copy link Message link Add to gist Remove
MasterDuke	that was simple enough to just do, the other stuff i'll do in a branch/pr	18:20	Copy link Message link Add to gist Remove
nine	Looks to me like those little mallocs for arg handling could become allocas easily	18:21	Copy link Message link Add to gist Remove
18:21 linkable6 joined
MasterDuke	ah, maybe i'll add them to github.com/MoarVM/MoarVM/pull/1608	18:22	Copy link Message link Add to gist Remove
18:28 TempIRCLogger__ left 18:31 lizmat left 18:35 lizmat joined 18:36 TempIRCLogger joined 18:39 TempIRCLogger left, TempIRCLogger joined 18:40 lizmat_ joined 18:43 lizmat left 18:44 lizmat_ left, lizmat joined 19:04 reportable6 joined
nine	Btw. I've managed to catch one of those segfaults in the p6-GLib build process in rr	19:32	Copy link Message link Add to gist Remove
lizmat	nine++	19:43	Copy link Message link Add to gist Remove
MasterDuke	nice	19:44	Copy link Message link Add to gist Remove
nine	Seems like none of the hypothesis so far is fitting to this new data	19:49	Copy link Message link Add to gist Remove
MasterDuke	m: use nqp; my num $a; $a = nqp::time_n for ^1_000_000; say now - INIT now	19:54	Copy link Message link Add to gist Remove Run code
camelia	0.04672441		Copy link Message link Add to gist Remove
MasterDuke	m: use nqp; my num $a; $a = now.Num for ^1_000_000; say now - INIT now # huh, i thought this was closer to nqp::time_n than it is...		Copy link Message link Add to gist Remove Run code
camelia	2.437410707		Copy link Message link Add to gist Remove
timo	well, what does it spesh to?	19:56	Copy link Message link Add to gist Remove
nine	What I've got so far: we're doing a return_o. This calls MVM_frame_try_return which finds an exit handler on the current frame. It then calls MVM_frame_dispatch_from_c to run this exit handler. MVM_frame_dispatch_from_c set's the caller's return address: cur_frame->return_address = *(tc->interp_cur_op)		Copy link Message link Add to gist Remove
	But cur_op was already advanced, so it actually points right at the end of the bytecode	19:57	Copy link Message link Add to gist Remove
	When we return from that exit handler, we then start processing whatever follows the bytecode in memory.		Copy link Message link Add to gist Remove
	Now this so far is pretty clear. What isn't is why this only happens now and then not more deterministically.	19:58	Copy link Message link Add to gist Remove
20:04 linkable6 left, evalable6 left
MasterDuke	ugh. $.tai needing to be a Rational in Instant is really annoying	20:06	Copy link Message link Add to gist Remove
	even the 'after' spesh of 'Num has gcd_I	20:07	Copy link Message link Add to gist Remove
	ah, but some manual inlining seems to help	20:12	Copy link Message link Add to gist Remove
	got it down to 0.325s		Copy link Message link Add to gist Remove
20:35 nine left 20:36 nine joined 21:06 evalable6 joined
nine	Oooh....it has something to do with spesh. I added an assertion to the runloop to reliably catch when we exit the proper bytecode. With MVM_SPESH_BLOCKING=1 it fails every time while with MVM_SPESH_DISABLE=1 I haven't seen it fail yet.	21:09	Copy link Message link Add to gist Remove
	But it's a quite unusual spesh issue. It happens even with MVM_SPESH_LIMIT=1	21:29	Copy link Message link Add to gist Remove
	So it's not speshing of any particular frame that causes the issue, but spesh being active at all. But then, I don't understand how MVM_SPESH_BLOCKING=1 can make such a difference		Copy link Message link Add to gist Remove
MasterDuke	NO_DELAY change anything?	21:31	Copy link Message link Add to gist Remove
	japhb is looking forward to MasterDuke's PR speeding up `now`	22:44	Copy link Message link Add to gist Remove
23:04 linkable6 joined
japhb	D'oh! Now I see it. Sigh. ETOOMANYCHANNELS	23:13	Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!