#moarvm on 16 June 2021 - Raku Programming Language Log

Welcome to the main channel on the development of MoarVM, a virtual machine for NQP and Rakudo (moarvm.org). This channel is being logged for historical purposes. Set by lizmat on 24 May 2021.
00:02 reportable6 left 00:06 reportable6 joined 02:11 japhb joined 02:23 frost joined 02:28 frost left 02:39 frost joined 02:40 frost left, frost joined, frost left 02:43 frost joined 04:32 vrurg joined 04:34 vrurg_ left 05:23 nebuchadnezzar joined 06:02 reportable6 left 06:05 reportable6 joined 08:40 linkable6 left 08:42 linkable6 joined 09:16 dogbert17 joined 09:20 squashable6 left 09:21 squashable6 joined 09:31 RakuIRCLogger left, AlexDaniel left, RakuIRCLogger joined 09:34 AlexDaniel joined 11:20 squashable6 left
nine	jnthnwrthngtn: spesh needs to retain instructions when their result or side effect is needed by deoptimized code. Would it be possible to run those instructions only when we actually deoptimize? I.e. have something like deoptimization handlers akin to exception handlers?	11:21	Copy link Message link Add to gist Remove
11:21 squashable6 joined 12:02 reportable6 left 12:03 reportable6 joined
jnthnwrthngtn	nine: Well, anything that really has side-effects can't be eliminated as dead anyway, unless it can prove the side-effects will be invisible. Results of pure things are another case, and there's options there.	12:42	Copy link Message link Add to gist Remove
	Emitting code to run when we deoptimize is one possibility, although quite a step up in complexity.		Copy link Message link Add to gist Remove
	Two other possibilities:	12:43	Copy link Message link Add to gist Remove
	1. Keep a table of registers to populate with wvals or other literals upon deopt, or perhaps to set from other registers. This may cover a lot of cases without leading to arbitrary complexity in deopt handlers.	12:44	Copy link Message link Add to gist Remove
	2. Figure out a way to move the deopt point further back in time, so we re-run a bit more of the original code during a deopt. This is...harder.		Copy link Message link Add to gist Remove
nine	Further back in time means to a point earlier?	12:52	Copy link Message link Add to gist Remove
jnthnwrthngtn	Yes, I mean deopt to an earlier bytecode address so that we replay more	12:53	Copy link Message link Add to gist Remove
	Of course once needs to be really sure we don't duplicate side-effects		Copy link Message link Add to gist Remove
nine	Well we do have information about which ops are pure and thus safe	12:54	Copy link Message link Add to gist Remove
	I do wonder though how much this could gain us		Copy link Message link Add to gist Remove
jnthnwrthngtn	Option 1 is at least easy to dump and understand what it will do (we already dump out the materializations we'll perform for example)		Copy link Message link Add to gist Remove
	I dunno, to me it looked like mostly cheap (especially after JIT) instructions	12:55	Copy link Message link Add to gist Remove
nine	Good news of the day: I got the Raku port of the non-trivial numerical simulation we have at work to perform as fast as the Perl + Inline C version that's in production!	12:56	Copy link Message link Add to gist Remove
	Of course I need a hyper to get there (which revealed the issue I posted in #raku-dev), but even without it's only a factor of 2 off.		Copy link Message link Add to gist Remove
jnthnwrthngtn	Oh, wow :)	12:57	Copy link Message link Add to gist Remove
	Is that Raku + nativecall to C?		Copy link Message link Add to gist Remove
nine	But...boy, one really needs to avoid NumLexRef and NumAttrRef like the plague. Which leads to odd code like foo(0e0 + $bar)		Copy link Message link Add to gist Remove
	No, pure Raku		Copy link Message link Add to gist Remove
	And the only nqp involved is nqp::div_n in 2 places, because the / op is so heavy, we cannot even inline it	12:58	Copy link Message link Add to gist Remove
jnthnwrthngtn	Oh, yes, that one wants better analyses in static opt and perhaps some new-disp work		Copy link Message link Add to gist Remove
nine	Also for example $distance - (my num $ = time_to_force(1e0 - $time_distance		Copy link Message link Add to gist Remove
	Even though time_to_force already returns a num		Copy link Message link Add to gist Remove
jnthnwrthngtn	.oO( At least we're all well experienced at trying to avoid plagues these days... )		Copy link Message link Add to gist Remove
nine	I guess it's time to update that idiom :D	12:59	Copy link Message link Add to gist Remove
jnthnwrthngtn	For sub calls that we can resolve, we can introspect the signature to see if there's rw things. Heck, even for multis we can often say "no candidate has anything that could ever need this rw"	13:02	Copy link Message link Add to gist Remove
nine	Sounds easy to detect such situations in the static optimizer. But what would it optimize the code to?	13:09	Copy link Message link Add to gist Remove
13:33 lizmat left 13:34 lizmat joined
jnthnwrthngtn	nine: Just reading the var rather than taking a ref to it	13:48	Copy link Message link Add to gist Remove
	(The QAST::Var would have attribute or lexical as the scope, not attributeref or lexref, iirc)	13:50	Copy link Message link Add to gist Remove
nine	Huh....looks like the static optimizer has exactly that kind of optimization already in place: simplify_refs # Looks through positional args for any lexicalref or attributeref, and if we find them check if the expectation is for an non-rw argument.	15:08	Copy link Message link Add to gist Remove
	Aaah, of course! It's the static optimizer. So it will only be able to do this if it can statically find a multi candidate. And it cannot if any arguments are method calls. In those cases the other arguments will remain (lexical\|attr)refs	15:31	Copy link Message link Add to gist Remove
MasterDuke	could a spesh plugin do better?	15:58	Copy link Message link Add to gist Remove
jnthnwrthngtn	nine: Yes, but all the multi candidates are statically known, so we could look at all of them and see that none want an rw arg	16:13	Copy link Message link Add to gist Remove
nine	Ooh... +5 insightful	16:14	Copy link Message link Add to gist Remove
jnthnwrthngtn	nine: Need to take care of anything with a \foo arg as well as explicitly raw/rw or a capture arg	16:15	Copy link Message link Add to gist Remove
	But other than those details it's not so bad		Copy link Message link Add to gist Remove
nine	Like.... multi sub infix:<->(Num:D \a, Num:D \b)	16:16	Copy link Message link Add to gist Remove
jnthnwrthngtn	Yes, but I've no idea why that's written \ and not $		Copy link Message link Add to gist Remove
	Oh, maybe I do		Copy link Message link Add to gist Remove
nine	To avoid the Scalar		Copy link Message link Add to gist Remove
jnthnwrthngtn	It was because historically we could emit code to produce a Scalar container in that case, but these days I'm quite sure we do not	16:17	Copy link Message link Add to gist Remove
	s/could/would/		Copy link Message link Add to gist Remove
	I think the Num type constraint tells us "ok, it's not iterable, so we'll get away with it"		Copy link Message link Add to gist Remove
nine	So type constraints do actually help with optimization?	16:18	Copy link Message link Add to gist Remove
jnthnwrthngtn	MasterDuke: (spesh plugin) They're not really in a position to help, I think. But new-disp is a bit better place.		Copy link Message link Add to gist Remove
	nine: Certainly in this case, yes :)		Copy link Message link Add to gist Remove
	new-disp alone isn't able to totally do it, because when we emit the call we either need to pass in a native ref or a native value	16:19	Copy link Message link Add to gist Remove
nine	How do we know it's not a WeirdNum is Num does Iterable?	16:20	Copy link Message link Add to gist Remove
jnthnwrthngtn	However, what new-disp will allow us to do is front-load the decont operations by making the part of th dispatch program		Copy link Message link Add to gist Remove
	nine: We don't. We just hope nobody is so awful. :P		Copy link Message link Add to gist Remove
	One of those cases of "I know it's a cheat but I'm pretty sure we'll get away with it".	16:21	Copy link Message link Add to gist Remove
nine	I didn't know we had those :)	16:22	Copy link Message link Add to gist Remove
jnthnwrthngtn	Anyway, on the new-disp thing: if the dispatch program looks at the signature, it can put the derefs (for Scalar, and with a few more ops, for native refs) into the dispatch program itself		Copy link Message link Add to gist Remove
	This has multiple benefits		Copy link Message link Add to gist Remove
	1. Less variance in argument tuples, so unlike today where for :(Int, Int) we may produce 4 specializations for all combinations of value and containered value, we'd only ever produce 1.	16:24	Copy link Message link Add to gist Remove
	2. If it's a native ref, then the taking of it and deref of it is all in the caller, so even if we don't end up inlining, it's visible for spesh to eliminate.		Copy link Message link Add to gist Remove
	3. Ditto for EA; the Scalar doesn't escape, so we can do more Scalar replacements	16:25	Copy link Message link Add to gist Remove
	Note that since new-disp is a runtime thing, we can do this not only for subs, for also for methods.		Copy link Message link Add to gist Remove
	*but also	16:26	Copy link Message link Add to gist Remove
	(Note also that the big reason that new-disp can do this is because unlike spesh plugins, where we have a 2 stage "pick a destination with these args", "invoke the chosen destination with these other args", new-disp is powerful enough to rewrite the argument capture entirely.)	16:28	Copy link Message link Add to gist Remove
17:48 frost left 18:02 reportable6 left 18:04 reportable6 joined
MasterDuke	anyone have questions/concerns/comments about github.com/MoarVM/MoarVM/pull/1508 ?	18:21	Copy link Message link Add to gist Remove
	it was inspired by the conversation in github.com/Raku/nqp/pull/728	18:23	Copy link Message link Add to gist Remove
nine	What are those paths relative to?	18:28	Copy link Message link Add to gist Remove
MasterDuke	in moarvm, `$(DESTDIR)/$(PREFIX)/include/`. in nqp's build, it'd be `@moar::prefix@/include/`	18:34	Copy link Message link Add to gist Remove
	this is not a 100% unbreakable solution, but it will let us exchange a hard-coded list of directories in nqp/rakudo with something a little more dynamic	18:40	Copy link Message link Add to gist Remove
nine	Why not put in the full paths from the start?	18:43	Copy link Message link Add to gist Remove
MasterDuke	i don't really care, but since this a space separated list it seems safer to not include the full path, which is more likely to have a space in it than just the final directory names	18:47	Copy link Message link Add to gist Remove
nine	Space separated? Oh, that sounds bad indeed	20:24	Copy link Message link Add to gist Remove
	At the end of the day the Raku version of the simulation is less than a factor of 2 slower than the Perl + C version. And with parallelization it's actually some 30 % faster!	20:32	Copy link Message link Add to gist Remove
	And the lovely thing about this is that instead of hardly readable C code like "double const geo_distance = SvNV(*(hv_fetch(distances_hv, name_pv, name_len, 0)));" the optimizations I did in the Raku version actually made the code more readable.	20:34	Copy link Message link Add to gist Remove
jdv	but what does it do?		Copy link Message link Add to gist Remove
nine	After all it's just adding type annotations to method arguments and splitting complex expressions into smaller ones so I could store the intermediary values in typed variables.	20:35	Copy link Message link Add to gist Remove
	jdv: I actually gave a lightning talk on this a few years ago: www.youtube.com/watch?v=XasKakffdGQ		Copy link Message link Add to gist Remove
jdv	i got some of it. from back in the day when talk recordings were barely usable;)	20:40	Copy link Message link Add to gist Remove
	thanks		Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!