#moarvm on 25 March 2018 - Raku Programming Language Log

01:40 benchable6 joined 01:57 ilbot3 joined 02:49 greppable6 joined
Geth	MoarVM/grisu: af2eb8a7f7 \| (Zoffix Znet)++ \| src/math/grisu.c More post-Grisu3 Num renderer polishing - Fix a couple of rendering issues introed in earlier mod[^1] - Add more cases of handling decimal positions like we had before Grisu3 stuff. [1] github.com/MoarVM/MoarVM/commit/8841c4241b	03:44	Copy link Message link Add to gist Remove
	MoarVM: zoffixznet++ created pull request #823: Stringify Num using Grisu3 algo / Generalize mp_get_double() routine	04:06	Copy link Message link Add to gist Remove
	MoarVM/master: 5 commits pushed by (Zoffix Znet)++ - Stringify Num using Grisu3 algo - Mod post-Grisu3 Num renderer… - Generalize mp_get_double() - More post-Grisu3 Num renderer polishing - Merge pull request #823 from MoarVM/grisu		Copy link Message link Add to gist Remove
	MoarVM: Kaiepi++ created pull request #824: Allow NativeCall support for wchar_t	05:09	Copy link Message link Add to gist Remove
05:32 SourceBaby joined 05:35 SourceBaby joined 05:41 SourceBaby joined 05:42 SourceBaby joined 05:44 SourceBaby joined
samcv	so I've decided after much thinking that while i'll retain the shiftjis name in the shiftjis decode/encode functions, the name of the encoding is going to be referenced as windows	06:43	Copy link Message link Add to gist Remove
	windows-932 since that's the official name of the extensions that it uses, and make sure we don't encounter issues if or when i add support for the baseline shiftjis standard	06:44	Copy link Message link Add to gist Remove
	though maybe not the most common name it's know by, it's accurate		Copy link Message link Add to gist Remove
Geth	MoarVM: 4d3fc2818d \| (Zoffix Znet)++ \| src/math/bigintops.c Fix handling of denormals in nqp::div_In Makes 3.1x faster: nqp::div_In with 21020 Ints 19.3x faster: nqp::div_In with 21020100020 Ints Fixes RT#130155: rt.perl.org/Ticket/Display.html?id=130155 Fixes RT#130154: rt.perl.org/Ticket/Display.html?id=130154 Fixes RT#130153: rt.perl.org/Ticket/Display.html?id=130153 ... (19 more lines)	07:00	Copy link Message link Add to gist Remove
synopsebot	RT#130155 [new]: rt.perl.org/Ticket/Display.html?id=130155 [BUG] Rat operations give bogus underflow		Copy link Message link Add to gist Remove
synopsebot	RT#130154 [new]: rt.perl.org/Ticket/Display.html?id=130154 [BUG] Int/Int gives bogus underflow		Copy link Message link Add to gist Remove
	RT#130153 [new]: rt.perl.org/Ticket/Display.html?id=130153 [9999][BUG] Int**Int yields bogus overflow		Copy link Message link Add to gist Remove
07:01 evalable6 joined
Geth	MoarVM: a5ed7ea5ed \| (Zoffix Znet)++ \| src/math/bigintops.c Tweak naming of double mantissa size define Tis teh bits; ain't digits	07:08	Copy link Message link Add to gist Remove
	MoarVM/master: 5 commits pushed by (Samantha McVey)++ - Add shiftjis decode and decodestream support - Ensure a failed JIS result is distinct from 0 - Make sure to throw in shiftjis decode if byte after lead - Use windows-932 instead of shiftjis for enc name - Merge pull request #820 from samcv/shiftjis	07:31	Copy link Message link Add to gist Remove
08:06 domidumont joined 08:12 domidumont joined 08:32 lizmat joined 09:48 evalable6 joined
MasterDuke	timotimo: here's another --profile segv. `$ = (1,2,3,4,5).max for ^100_000`	10:18	Copy link Message link Add to gist Remove
	backtrace (minux ~87k lines) gist.github.com/MasterDuke17/a0ced...445d90da23	10:32	Copy link Message link Add to gist Remove
10:34 Ven`` joined 10:57 Ven`` joined
timotimo	87k lines? holy crap. maybe that's actually a stack overflow?	11:29	Copy link Message link Add to gist Remove
MasterDuke	it was just repeats of `#87351 0x00007ffff76c955c in dump_call_graph_node (tc=tc@entry=0x555555758c60, pds=pds@entry=0x7fffffffd540, pcn=0x5555557bb610) at src/profiler/instrument.c:420`	11:30	Copy link Message link Add to gist Remove
timotimo	yeah		Copy link Message link Add to gist Remove
	not the same addresses though, right?		Copy link Message link Add to gist Remove
MasterDuke	correct, different addresses for pcn	11:31	Copy link Message link Add to gist Remove
timotimo	i have no explanation for this deep of a call graph yet	11:33	Copy link Message link Add to gist Remove
MasterDuke	runs just fine with MVM_SPESH_INLINE_DISABLE=1	11:36	Copy link Message link Add to gist Remove
timotimo	i wonder if my recent inline fix was just bogus?	11:38	Copy link Message link Add to gist Remove
MasterDuke	you definitely fixed something, right? maybe there's just more to fix that was uncovered by that change	11:41	Copy link Message link Add to gist Remove
timotimo	the fix got in right after the release, though, right?		Copy link Message link Add to gist Remove
MasterDuke	i think. fwiw, i'm at HEAD	11:43	Copy link Message link Add to gist Remove
timotimo	ok so at least it's actually a stack overflow, not some other kind of crash	11:52	Copy link Message link Add to gist Remove
	so, the good news is, i should probably be able to reduce the stack size of dump_call_graph_node a little	11:59	Copy link Message link Add to gist Remove
MasterDuke	so we get more iterations in before it overflows?	12:00	Copy link Message link Add to gist Remove
timotimo	yeah	12:02	Copy link Message link Add to gist Remove
	it should definitely not grow that big, though		Copy link Message link Add to gist Remove
	actually, perhaps i can't do better than 144 bytes	12:05	Copy link Message link Add to gist Remove
	wait, did i just confuse bits and bytes again %)	12:06	Copy link Message link Add to gist Remove
	OK, the one 144 frame is now two frames, the one that'll remain on the stack more often is 80, the other one is 128	12:19	Copy link Message link Add to gist Remove
	not quite as good as i had hoped, maybe i can do something about it yet.		Copy link Message link Add to gist Remove
MasterDuke	nice	12:21	Copy link Message link Add to gist Remove
timotimo	cool, 64 and 128 now	12:23	Copy link Message link Add to gist Remove
MasterDuke	nicer	12:24	Copy link Message link Add to gist Remove
timotimo	it now reaches the "write profile file" stage	12:26	Copy link Message link Add to gist Remove
12:27 benchable6 joined
timotimo	just 160 megs!	12:28	Copy link Message link Add to gist Remove
MasterDuke	does it actually write it out?		Copy link Message link Add to gist Remove
timotimo	yup		Copy link Message link Add to gist Remove
MasterDuke	good deal	12:29	Copy link Message link Add to gist Remove
Geth	MoarVM/profile_dump_less_stack_usage: 90b05c81b9 \| (Timo Paulssen)++ \| src/profiler/instrument.c profile: extract recursion loop for smaller stack frames dumping call graphs used to put 144 bytes onto the stack for every slice of recursion, now it'll deposit just 64 bytes for every slice and put a 128 byte frame on top to do most of the actual work.	12:31	Copy link Message link Add to gist Remove
timotimo	MasterDuke: could you have a look at what the stack actually looks like? i'll try to enjoy the last bit of sun on the balcony	12:32	Copy link Message link Add to gist Remove
MasterDuke	what do you mean?	12:33	Copy link Message link Add to gist Remove
	with your branch?		Copy link Message link Add to gist Remove
	timotimo: gist updated with output of `info frame` at the segv	12:36	Copy link Message link Add to gist Remove
timotimo	oh, i meant the profiled call stack	12:52	Copy link Message link Add to gist Remove
	so we have almost 100k of each pull-one, infix:<cmp>, infix:«>», return, max, iterator-and-first, is-lazy, iterator, ReifiedList, one anonymous routine, new, and SET-SELF, the last 4 out of Rakudo/Iterator	12:57	Copy link Message link Add to gist Remove
	and almost 100k of the -e frame		Copy link Message link Add to gist Remove
	it seems to think that the -e frame is calling itself over and over	12:58	Copy link Message link Add to gist Remove
MasterDuke	huh. think it's a problem in how the profiler is recording the data? or in how spesh is doing inlining?	13:10	Copy link Message link Add to gist Remove
timotimo	not sure yet	13:11	Copy link Message link Add to gist Remove
14:02 AlexDaniel joined 14:22 greppable6 joined 15:00 robertle joined
timotimo	what was the second-to-last profile example that segfaulted? :\|	15:58	Copy link Message link Add to gist Remove
15:58 committable6 joined
dogbert11	second to last?	15:59	Copy link Message link Add to gist Remove
	from me or MasterDuke?		Copy link Message link Add to gist Remove
timotimo	i'm not entirely sure :\|		Copy link Message link Add to gist Remove
dogbert11	perhaps this gist.github.com/dogbert17/750ffbf9...72b70779d8	16:01	Copy link Message link Add to gist Remove
timotimo	let's see	16:03	Copy link Message link Add to gist Remove
dogbert11	do you have a new theory?	16:04	Copy link Message link Add to gist Remove
timotimo	that used to segfault?		Copy link Message link Add to gist Remove
dogbert11	yes		Copy link Message link Add to gist Remove
timotimo	doesn't any more :)		Copy link Message link Add to gist Remove
dogbert11	timotimo++		Copy link Message link Add to gist Remove
	it worked if you turned off inlining	16:05	Copy link Message link Add to gist Remove
timotimo	same as the one today, then		Copy link Message link Add to gist Remove
dogbert11	perhaps your fix works there as well	16:06	Copy link Message link Add to gist Remove
timotimo	it did		Copy link Message link Add to gist Remove
dogbert11	cooool		Copy link Message link Add to gist Remove
timotimo	but it just makes it non-explosive		Copy link Message link Add to gist Remove
	the call graph being so huge is still bogus		Copy link Message link Add to gist Remove
dogbert11	aha, one mystery left then		Copy link Message link Add to gist Remove
	I have one small script where the profile get 150 megs	16:07	Copy link Message link Add to gist Remove
timotimo	that's likely the same underlying issue	16:08	Copy link Message link Add to gist Remove
16:09 benchable6 joined
dogbert11	if it helps, I can tune a script so that the profile, albeit buggy, is quite small	16:10	Copy link Message link Add to gist Remove
timotimo	93.2% (5054407876196421ms)	16:11	Copy link Message link Add to gist Remove
	sigh		Copy link Message link Add to gist Remove
	Infinity% (14.95ms)		Copy link Message link Add to gist Remove
	oops, did i say it doesn't crash any more	16:12	Copy link Message link Add to gist Remove
	it just doesn't crash reliably		Copy link Message link Add to gist Remove
dogbert11	FWIW if I run the profile under valgrind it does not SEGV	16:22	Copy link Message link Add to gist Remove
	and the generated profile doesn't look bugus	16:24	Copy link Message link Add to gist Remove
timotimo	the call graph is suspiciously deep	16:27	Copy link Message link Add to gist Remove
dogbert11	my profilare you referring to the long spike on that page	16:28	Copy link Message link Add to gist Remove
	wanted to write; are you referring to the long spike on that page	16:29	Copy link Message link Add to gist Remove
timotimo	yeah		Copy link Message link Add to gist Remove
	got something to show you		Copy link Message link Add to gist Remove
	once i figure out how to work this GUI here		Copy link Message link Add to gist Remove
dogbert11	clicking it gives 'push-at-least SETTING::src/core/Iterator.pm6:49 '	16:30	Copy link Message link Add to gist Remove
timotimo	i.imgur.com/htmC15q.png		Copy link Message link Add to gist Remove
	see how the structure is suspiciously similar?		Copy link Message link Add to gist Remove
dogbert11	yes	16:31	Copy link Message link Add to gist Remove
timotimo	i think we're accidentally forgetting to handle a prof_exit and thus recursing too deep	16:32	Copy link Message link Add to gist Remove
dogbert11	the allocations page also looks a bit strange		Copy link Message link Add to gist Remove
	there are many lines mentioning the same thing, e.g. Rakudo::Iterator::CountOnlyBoolOnlyDelegate is mentioned 40-50 times	16:34	Copy link Message link Add to gist Remove
timotimo	that does seem wrong, yeah		Copy link Message link Add to gist Remove
dogbert11	instead of being aggregated		Copy link Message link Add to gist Remove
	same with '<anon\|23>+{Rakudo::Iterator::CountOnlyBoolOnlyDelegate[<anon\|33>]}' and 'Rakudo::Iterator::CountOnlyBoolOnlyDelegate[<anon\|33>]'	16:36	Copy link Message link Add to gist Remove
timotimo	it's small enough that i can do it without --minimal, i.e. get the function names in the graph, too	16:42	Copy link Message link Add to gist Remove
dogbert11	does that give you any new clues?	16:43	Copy link Message link Add to gist Remove
timotimo	let's see		Copy link Message link Add to gist Remove
	hm, so, the implementation of List:D's ACCEPTS is deeply recursive	17:31	Copy link Message link Add to gist Remove
	this list's accepts is called from junction's ACCEPTS	17:32	Copy link Message link Add to gist Remove
	let's just say that at some point this code would have a stack trace that'd have a hundred or so ACCEPTS in a row	17:33	Copy link Message link Add to gist Remove
	just replacing the ~~ there with eq makes the profile really tiny	17:38	Copy link Message link Add to gist Remove
	i think we see that one type so many times because we actually create many types of that name	17:39	Copy link Message link Add to gist Remove
dogbert11	shouldn't they be aggregated?	17:41	Copy link Message link Add to gist Remove
timotimo	not if they are different actual types	17:42	Copy link Message link Add to gist Remove
MasterDuke	so does anything very recursive blow up profiles?	17:44	Copy link Message link Add to gist Remove
17:47 FROGGS_ joined
timotimo	yep	17:47	Copy link Message link Add to gist Remove
	thing is, we create one role for every time we mix in the CountOnlyBoolOnlyDelegate	17:50	Copy link Message link Add to gist Remove
	which is a role that just forwards a call to bool-only and a call to count-only to another iter object's method of the same name		Copy link Message link Add to gist Remove
	i'd think it'd be better to have an attribute mixed in by the role that stores this delegate target		Copy link Message link Add to gist Remove
	i'm not sure why the name shows up as anon\|23 and anon\|33, like, how do these numbers get so short?	17:51	Copy link Message link Add to gist Remove
MasterDuke	that would just help/fix that particular bit of code, right?	17:54	Copy link Message link Add to gist Remove
timotimo	i expect this causes a bit of performance degradation in everything that uses this particular piece of rakudo iterator tech	17:55	Copy link Message link Add to gist Remove
MasterDuke	i'm kind of impressed, `sub f($n) { $n * ($n < 2 ?? $n !! f($n - 1)) }; say f(40_000)` only created a 19mb profile		Copy link Message link Add to gist Remove
timotimo	also, it's not the cause for the huge profile; the use of ~~ against lists/junctions is		Copy link Message link Add to gist Remove
MasterDuke	ah		Copy link Message link Add to gist Remove
dogbert11	the program actually generates a MoarVM panic if MVM_GC_DEBUG=2	17:57	Copy link Message link Add to gist Remove
	message is 'MoarVM panic: Invalid assignment (maybe of heap frame to stack frame?)'		Copy link Message link Add to gist Remove
timotimo	yeah, i was hunting that yesterday		Copy link Message link Add to gist Remove
	dinner proparation time now, though		Copy link Message link Add to gist Remove
	No such method '!set-delegate-target' for invocant of type '<anon\|23>+{Rakudo::Iterator::CountOnlyBoolOnlyDelegate}'. Did you mean '!set-delegate-target'?	18:11	Copy link Message link Add to gist Remove
	:\|		Copy link Message link Add to gist Remove
	got it down	18:16	Copy link Message link Add to gist Remove
	one Rakudo::Iterator::CountOnlyBoolOnlyDelegate		Copy link Message link Add to gist Remove
	one <anon\|23>+{Rakudo::Iterator::CountOnlyBoolOnlyDelegate}		Copy link Message link Add to gist Remove
	so ...	18:24	Copy link Message link Add to gist Remove
	mixin goes from 21.6ms inclusive time down to 2.77ms inclusive time		Copy link Message link Add to gist Remove
	with the same amount of calls to it :)		Copy link Message link Add to gist Remove
	notably because generate_mixin now gets called only once, rather than the 39 times that mixin itself is called		Copy link Message link Add to gist Remove
	same for set_is_mixin and setup_mixin_cache	18:25	Copy link Message link Add to gist Remove
	oh, wow		Copy link Message link Add to gist Remove
	from 2 gc runs down to 1		Copy link Message link Add to gist Remove
dogbert11	what are you up to ?	18:27	Copy link Message link Add to gist Remove
timotimo	i'm not sure if these two profiles were measuring the same code	18:28	Copy link Message link Add to gist Remove
18:36 robertle joined 18:52 Kaiepi joined
timotimo	i'll probably change it back a tiny bit so that it's actually parameterized on something, but only on the target iterator's type	19:15	Copy link Message link Add to gist Remove
	that could give us better specializations, i think	19:16	Copy link Message link Add to gist Remove
	though we may not be calling bool-only or count-only a million times in regular code	19:17	Copy link Message link Add to gist Remove
19:22 bisectable6 joined
Geth	MoarVM: 67e5093f0e \| (Timo Paulssen)++ \| src/debug/debugserver.c only suspend on actually must-suspend breakpoints	20:12	Copy link Message link Add to gist Remove
FROGGS	jnthn: I found something interesting when hunting the DBDish::mysql instability	20:37	Copy link Message link Add to gist Remove
	this causes a double free quite often: github.com/MoarVM/MoarVM/blob/mast...rp.c#L5040		Copy link Message link Add to gist Remove
timotimo	oh, huh!	20:38	Copy link Message link Add to gist Remove
jnthn	Oops...but also, huh...shouldn't that only set during type creation?	20:39	Copy link Message link Add to gist Remove
timotimo	probably should, yeah	20:40	Copy link Message link Add to gist Remove
FROGGS	aye		Copy link Message link Add to gist Remove
timotimo	.o( put a lock on it )		Copy link Message link Add to gist Remove
FROGGS	these two got in:	20:41	Copy link Message link Add to gist Remove
	// debugname = 0x7fffe44d7010 "NativeCall::Types::Pointer[MoarVM::Guts::REPRs::MVMArrayB]"		Copy link Message link Add to gist Remove
	// debugname = 0x7fff7c080fa0 "Array[DBDish::mysql::Native::MYSQL_BIND]"		Copy link Message link Add to gist Remove
20:42 lizmat joined
FROGGS	okay, in Pointer.^parameterize we call .^set_name, which calls that op	20:42	Copy link Message link Add to gist Remove
	that's still during type creation, right?		Copy link Message link Add to gist Remove
timotimo	should be, yeah		Copy link Message link Add to gist Remove
	the only way to double-free there is to get two threads into the tiny space between MVM_free(STABLE(obj)->debug_name) and STABLE(obj)->debug_name = debugname	20:43	Copy link Message link Add to gist Remove
	which is possible if the encode_C_string causes GC i suppose		Copy link Message link Add to gist Remove
FROGGS	that's what I thought too		Copy link Message link Add to gist Remove
timotimo	this would be a time for helgrind if it weren't so noisy due to many false-positives :(	20:44	Copy link Message link Add to gist Remove
FROGGS	wow, yes, it says a lot	20:52	Copy link Message link Add to gist Remove
	hmmm	20:54	Copy link Message link Add to gist Remove
	MVM_string_utf8_encode_C_string cannot cause GC, right? I mean, it returns a char*	20:55	Copy link Message link Add to gist Remove
timotimo	oh, yes		Copy link Message link Add to gist Remove
jnthn	It'd seem to mean that two threads are trying to concurrently set the name	21:00	Copy link Message link Add to gist Remove
FROGGS	the type named Array[DBDish::mysql::Native::MYSQL_BIND] is on DBDish::mysql::Connection, which get's need'ed not use'ed...	21:06	Copy link Message link Add to gist Remove
	but need was just loading at compile time but no imports?	21:07	Copy link Message link Add to gist Remove
	okay, change it to "use" makes no difference, but that's no surprise	21:10	Copy link Message link Add to gist Remove
timotimo	did you grab some tracebacks?		Copy link Message link Add to gist Remove
FROGGS	a backtrace from gdb?	21:11	Copy link Message link Add to gist Remove
timotimo	call MVM_dump_backtrace(tc)		Copy link Message link Add to gist Remove
	^- literally type that into gdb		Copy link Message link Add to gist Remove
FROGGS	right		Copy link Message link Add to gist Remove
	(gdb) call MVM_dump_backtrace(tc)	21:14	Copy link Message link Add to gist Remove
	Invalid cast.		Copy link Message link Add to gist Remove
dogbert11	is call the same as p ?		Copy link Message link Add to gist Remove
timotimo	just to be sure, tc is available in your currently selected frame?		Copy link Message link Add to gist Remove
	no, p is call + print		Copy link Message link Add to gist Remove
FROGGS	(gdb) p tc	21:15	Copy link Message link Add to gist Remove
	$1 = 1.4616321449683623412809166386416848		Copy link Message link Add to gist Remove
timotimo	lolwat		Copy link Message link Add to gist Remove
	tc is supposed to be a pointer to a MVMThreadContext :)		Copy link Message link Add to gist Remove
	of course we don't have full control over everybody's code		Copy link Message link Add to gist Remove
	so maybe you're inside mysql's code or something?		Copy link Message link Add to gist Remove
FROGGS	(gdb) p MVM_dump_backtrace	21:16	Copy link Message link Add to gist Remove
	$2 = {void (MVMThreadContext *)} 0x7ffff76ac870 <MVM_dump_backtrace>		Copy link Message link Add to gist Remove
	so, that's in place at least		Copy link Message link Add to gist Remove
lizmat	timotimo: re github.com/rakudo/rakudo/commit/9f...1f0bfR1430		Copy link Message link Add to gist Remove
	that being a private method, is that correct?		Copy link Message link Add to gist Remove
timotimo	oops, it is not correct!	21:17	Copy link Message link Add to gist Remove
FROGGS	MVM_interp_run (tc=0x5cf6, tc@entry=0x7fffdc0f3290, initial_invoke=0x0, invoke_data=0x6, invoke_data@entry=0x7fffdc13c4b0) at src/core/interp.c:5040		Copy link Message link Add to gist Remove
lizmat	also: github.com/rakudo/rakudo/commit/9f...1f0bfR1436		Copy link Message link Add to gist Remove
FROGGS	which tc shall I use?		Copy link Message link Add to gist Remove
lizmat	ok,		Copy link Message link Add to gist Remove
	timotimo: will fix it :-)		Copy link Message link Add to gist Remove
timotimo	too late		Copy link Message link Add to gist Remove
	this does probably mean that this is untested by the spec test suite	21:18	Copy link Message link Add to gist Remove
	otherwise i wouldn't have gotten a PASS		Copy link Message link Add to gist Remove
lizmat	yeah :-)_		Copy link Message link Add to gist Remove
timotimo	lizmat++		Copy link Message link Add to gist Remove
	i wish we could get the supervisor process to not allocate anything on the heap :)	21:19	Copy link Message link Add to gist Remove
FROGGS	timotimo: look gist.githubusercontent.com/FROGGS/...tfile1.txt		Copy link Message link Add to gist Remove
timotimo	oh, huh. i wonder if NativeHelpers needs to have its guts updated perhaps?	21:20	Copy link Message link Add to gist Remove
	it does kind of do terrible things with internal structs if i remember correctly		Copy link Message link Add to gist Remove
	the code start { }; sleep 60 will run the GC 8 times	21:21	Copy link Message link Add to gist Remove
	ah, getrusage-total allocates, of course		Copy link Message link Add to gist Remove
lizmat	timotimo: perhaps we shouldn't use a sub for that ?	21:22	Copy link Message link Add to gist Remove
timotimo	no, it's the nqp::getrusage op	21:23	Copy link Message link Add to gist Remove
lizmat	ah, ok		Copy link Message link Add to gist Remove
timotimo	the profiler didn't add allocation logging to that op yet; it'll show up in the next profile i'll take		Copy link Message link Add to gist Remove
FROGGS	okay, here we create types at runtime: github.com/salortiz/NativeHelpers-...ob.pm6#L27	21:24	Copy link Message link Add to gist Remove
timotimo	hm. we don't actually have anything to log the allocation of nested objects. like getrusage will create a hash with a bunch of numbers in it, but we'll only count the hash	21:26	Copy link Message link Add to gist Remove
	ah, not a hash, a BOOTIntArray	21:27	Copy link Message link Add to gist Remove
	that's a lot better than i thought		Copy link Message link Add to gist Remove
	5.8k objects in the 60 seconds i slept		Copy link Message link Add to gist Remove
	in theory we could change getrusage to write to an int array you pass it, so we could re-use that …	21:28	Copy link Message link Add to gist Remove
lizmat	that sounds like an excellent plan		Copy link Message link Add to gist Remove
	this would sit well with race conditions I guess		Copy link Message link Add to gist Remove
timotimo	how do you mean?	21:29	Copy link Message link Add to gist Remove
lizmat	ah, no, somehow I was thinking it used one buffer internally (probably P5 thinking)		Copy link Message link Add to gist Remove
	the whole issue was that it created a new one each time, right ?		Copy link Message link Add to gist Remove
timotimo	oh, no, we'd keep the array in our "user" code	21:30	Copy link Message link Add to gist Remove
	a perl6-level getrusage sub would allocate the array and fill it immediately		Copy link Message link Add to gist Remove
lizmat	and giving it a list_i puts the responsibility in the hands of the dev		Copy link Message link Add to gist Remove
timotimo	but the TPS could re-use the same object over and over		Copy link Message link Add to gist Remove
lizmat	right		Copy link Message link Add to gist Remove
timotimo	FWIW, there are other objects we allocate a whole lot more of		Copy link Message link Add to gist Remove
lizmat	such as ?		Copy link Message link Add to gist Remove
timotimo	Num, Scalar, BOOTCode, NumLexRef, BOOTHash, IntLexRef, IntAttrRef are all above 10k	21:31	Copy link Message link Add to gist Remove
	Num at 88k		Copy link Message link Add to gist Remove
lizmat	hmmm... we shouldn't have any Nums :-(		Copy link Message link Add to gist Remove
timotimo	you think?		Copy link Message link Add to gist Remove
lizmat	aaaahhhh lemme see	21:32	Copy link Message link Add to gist Remove
timotimo	we do have a bunch of native num attributes, those shouldn't be making Num objects, but that's simply a case of "we box just to immediately unbox" i bet		Copy link Message link Add to gist Remove
lizmat	there is a infix:<*> involved	21:33	Copy link Message link Add to gist Remove
timotimo	well, from the allocation numbers i see 41k from Int.pm6's Bridge method, 17.6k from Num.pm6's infix:</>, 17.5k from Real.pm6's infix:</> and then Num.pm6's infix:<*>	21:34	Copy link Message link Add to gist Remove
	a big portion of Scalar allocations seem to come from iterating over a Range, it seems like	21:35	Copy link Message link Add to gist Remove
	i.e. prefix:<^>, iterator from Range.pm6, line 1703 from Rakudo/Iterator.pm6, SET-SELF from 5 lines below that, 11.7k Scalars in Range.pm6's SET-SELF, and 17.6k from Range.pm6's new	21:36	Copy link Message link Add to gist Remove
	i'll be AFK for a bit		Copy link Message link Add to gist Remove
	anyway, the range in question is most probably from prod-affinity-workers	21:38	Copy link Message link Add to gist Remove
	sounds like an easy win to me	21:39	Copy link Message link Add to gist Remove
	lizmat looks	21:40	Copy link Message link Add to gist Remove
timotimo	perl6 --profile -e 'start { }; sleep 60' and then look at the profiler's allocations tab	21:43	Copy link Message link Add to gist Remove
lizmat	timo: I think I got a good idea what's going on there	21:59	Copy link Message link Add to gist Remove
	timotimo: but am too tired now to tinker with this right now		Copy link Message link Add to gist Remove
	(another 6 hours in the car today)		Copy link Message link Add to gist Remove
	so, will look at it tomorrow	22:00	Copy link Message link Add to gist Remove
timotimo	OK, maybe i'll just do it right now :)	22:09	Copy link Message link Add to gist Remove
	you rubberducked good, though :)	22:10	Copy link Message link Add to gist Remove
	just by replacing the for ^worker-list.items with a loop ( ) loop i got it down to 6 gc runs in the same time it did 8 before	22:20	Copy link Message link Add to gist Remove
	i might need to run 2 minutes to get more precise measurements	22:21	Copy link Message link Add to gist Remove
MasterDuke	is it faster?		Copy link Message link Add to gist Remove
timotimo	it runs pretty much exactly 60 seconds :P	22:22	Copy link Message link Add to gist Remove
MasterDuke	heh	22:23	Copy link Message link Add to gist Remove
timotimo	i should have run it with "time" to get proper cpu time measurements		Copy link Message link Add to gist Remove
	The profiled code ran for 60005.54ms. Of this, 29.98ms were spent on garbage collection (that's 0.05%).		Copy link Message link Add to gist Remove
	The profiled code ran for 60005.6ms. Of this, 28.19ms were spent on garbage collection (that's 0.05%).		Copy link Message link Add to gist Remove
	that's before -> after, so somehow it got the tinyest bit slower. which i'll just call "noise" :)	22:24	Copy link Message link Add to gist Remove
	now i'm down to 5 collections		Copy link Message link Add to gist Remove
	124238 (60.21%)	22:26	Copy link Message link Add to gist Remove
	56587 (61.84%)		Copy link Message link Add to gist Remove
	50127 (59.79%)		Copy link Message link Add to gist Remove
	interesting development (this is call frames in total and percentage eliminated via inlining)	22:27	Copy link Message link Add to gist Remove
	aha, i see why getrusage-total isn't being jitted. it's mostly being entered inlined via a frame that also has nqp::cpucores, which isn't jitted	22:30	Copy link Message link Add to gist Remove
Geth	MoarVM: b1f64db89b \| (Zoffix Znet)++ \| src/core/coerce.c Add missing include for Grisu3 dtoa function Fixes github.com/MoarVM/MoarVM/issues/825 M#825	22:32	Copy link Message link Add to gist Remove
synopsebot	M#825 [open]: github.com/MoarVM/MoarVM/issues/825 implicit function declaration compiler warning for function ‘dtoa_grisu3’		Copy link Message link Add to gist Remove
timotimo	hah, now it's bailing on lseep	22:34	Copy link Message link Add to gist Remove
	sleep*		Copy link Message link Add to gist Remove
	lovely!	22:42	Copy link Message link Add to gist Remove
	jit-compiled frames: 96.93% (120908)	22:43	Copy link Message link Add to gist Remove
22:50 greppable6 joined
timotimo	cool, got rid of some NumLexRefs	22:50	Copy link Message link Add to gist Remove
	and Num on top of that	22:51	Copy link Message link Add to gist Remove
	both prod-affinity-workers and .sum allocate a hash for named arguments even though they don't get passed any; wonder why that happens	22:53	Copy link Message link Add to gist Remove
	probably deopt annoyingness	22:55	Copy link Message link Add to gist Remove
	huh, prod-affinity-workers doesn't show up in the spesh log :\|	22:57	Copy link Message link Add to gist Remove
MasterDuke	nice. i've never really figured out how to reduce allocations of things		Copy link Message link Add to gist Remove
22:57 evalable6 joined
timotimo	oh, no, it is in there, i just misspelt it	22:58	Copy link Message link Add to gist Remove
	oh, wrong again	22:59	Copy link Message link Add to gist Remove
	funny, it speshes prod-affinity-workers, but only if profiling is turned on. so maybe the profiling overhead makes the cpu usage go up a tiny bit and makes the scheduler decide to create an additional worker?	23:02	Copy link Message link Add to gist Remove
MasterDuke	what if instead you start some other thread doing random work?	23:05	Copy link Message link Add to gist Remove
timotimo	then the impact on gc runs won't be as visible	23:09	Copy link Message link Add to gist Remove
	running a 120s profile now	23:10	Copy link Message link Add to gist Remove
	changed a few / to nqp::div_n		Copy link Message link Add to gist Remove
	oh, huh, 5 gc runs for 120s as well	23:13	Copy link Message link Add to gist Remove
	oh wow		Copy link Message link Add to gist Remove
	m: say [+] 53084, 17770, 17760, 13444, 6303, 5959, 5948		Copy link Message link Add to gist Remove Run code
camelia	120268		Copy link Message link Add to gist Remove
23:13 committable6 joined
timotimo	m: say [+] 24873, 12066, 11896, 11877, 11867, 11861, 11857	23:14	Copy link Message link Add to gist Remove Run code
camelia	96297		Copy link Message link Add to gist Remove
timotimo	m: say (120268 * 2) R/ 96297		Copy link Message link Add to gist Remove Run code
camelia	0.4003434		Copy link Message link Add to gist Remove
timotimo	so we're now only allocating 40% as many objects - though i didn't account for how big each object is		Copy link Message link Add to gist Remove
MasterDuke	that's a big reduction in count!	23:15	Copy link Message link Add to gist Remove
timotimo	aye		Copy link Message link Add to gist Remove
	i'll run 5 minutes now	23:16	Copy link Message link Add to gist Remove
	jnthn was rather not happy about the thought of making the ThreadPoolScheduler's code less readable, though	23:18	Copy link Message link Add to gist Remove
MasterDuke	i would think if it's been proven bug-free for a while now he'd be more amenable to optimizing it	23:19	Copy link Message link Add to gist Remove
	can always make a PR for comments	23:20	Copy link Message link Add to gist Remove
timotimo	OK, so 8 GC runs over 5 minutes, that's not so bad	23:21	Copy link Message link Add to gist Remove
	whoops, i made it no longer allocate as many BOOTCode (hardly any any more) but also made prod-affinity-workers unjittable	23:27	Copy link Message link Add to gist Remove
MasterDuke	"Timotimo's Choice"	23:28	Copy link Message link Add to gist Remove
timotimo	queuepoll's not jitted :)	23:29	Copy link Message link Add to gist Remove
MasterDuke	jit all the ops!!	23:30	Copy link Message link Add to gist Remove
timotimo	yeah why not :P	23:33	Copy link Message link Add to gist Remove
	vim shouldn't let me ctrl-p into the install/ folder %)	23:38	Copy link Message link Add to gist Remove
	actually, there shouldn't be an install folder under moarvm/ anyway		Copy link Message link Add to gist Remove
	got the frame jitted again, yay	23:40	Copy link Message link Add to gist Remove
23:43 Kaiepi joined
MasterDuke	and not allocating as many BOOTCodes?	23:48	Copy link Message link Add to gist Remove
timotimo	71 over the course of the whole run	23:49	Copy link Message link Add to gist Remove
MasterDuke	cool beans	23:53	Copy link Message link Add to gist Remove
timotimo	i.imgur.com/3upvzFE.png		Copy link Message link Add to gist Remove
	the tinyest difference %)		Copy link Message link Add to gist Remove
	huh, the bytecode at the end is actually bigger	23:56	Copy link Message link Add to gist Remove
	OK, the devirtualized calls are actually more arguments to put on the stack	23:57	Copy link Message link Add to gist Remove
	but the call itself is more direct		Copy link Message link Add to gist Remove
	m: say 30290 - 29954	23:59	Copy link Message link Add to gist Remove Run code
camelia	336		Copy link Message link Add to gist Remove
timotimo	that is not much		Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!