#moarvm on 16 October 2019 - Raku Programming Language Log

github.com/moarvm/moarvm \| IRC logs at colabti.org/irclogger/irclogger_logs/moarvm Set by AlexDaniel on 12 June 2018.
00:19 squashable6 left 00:22 squashable6 joined 00:45 ggoebel joined 00:53 lucasb left 03:14 ggoebel left 05:05 robertle left 06:49 camelia left 06:50 camelia joined 06:53 domidumont joined 07:14 discord6 left, discord6 joined 07:15 sena_kun joined 07:18 releasable6 left, releasable6 joined 07:20 sivoais_ joined, sivoais left 07:26 hankache joined 07:53 hankache left 08:42 sena_kun left 09:24 domidumont left 11:42 shareable6 joined 12:11 domidumont joined 12:20 ggoebel joined 13:50 lucasb joined 15:01 sena_kun joined
dogbert17	.seen nine	15:05	Copy link Message link Add to gist Remove
tellable6	dogbert17, I saw nine 2019-10-16T15:04:38Z in #perl6: <nine> In my experience, getting such feedback is a good time to evaluate my own behaviour, regardless of whether I feel that it should be directed at me or not		Copy link Message link Add to gist Remove
15:23 robertle joined 15:53 hankache joined
nine	I'm here ;)	16:05	Copy link Message link Add to gist Remove
timotimo	.o( MVM_op_get_mark could get optimized )	16:34	Copy link Message link Add to gist Remove
	8,483,131 self time before, 4,759,885 after, oh my	16:35	Copy link Message link Add to gist Remove
jnthn	That speeds up bytecode validation?	16:38	Copy link Message link Add to gist Remove
timotimo	little bit		Copy link Message link Add to gist Remove
	is there a tool that gives you C code for a lookup when given a lookup table?		Copy link Message link Add to gist Remove
	is "perfect hashing" the right keyword here?	16:39	Copy link Message link Add to gist Remove
dogbert17	nine: are you still there :)	16:42	Copy link Message link Add to gist Remove
timotimo	OK, so callgrind gives me the distribution of returns from MVM_op_get_mark		Copy link Message link Add to gist Remove
nine	dogbert17: now you're making me really curious why you're asking for me ;)	16:43	Copy link Message link Add to gist Remove
timotimo	179k -a (op >= 135 && op < 140), 4k op == 157, 1.5k ".s" (i.e. spesh ops), 76k op == 23, 71k op == 34 ":j", 71k op >= 51 && op < 56 ".r", 69k op == 127 "+a" and 303k op >= 128 && op <= 135 "*a", 169k " " (everything else)	16:44	Copy link Message link Add to gist Remove
dogbert17	I just wanted to tell you that I've drawn a blank wrt gist.github.com/dogbert17/26d3e358...e163246bc7		Copy link Message link Add to gist Remove
timotimo	at the moment i have it split between >= 135 and < 135	16:45	Copy link Message link Add to gist Remove
dogbert17	I couldn't really understand your comment from yesterday: "read_setup looks fishy. Looks to me like MVM_io_eventloop_add_active_work can allocate. Would need to MVMROOT t there"		Copy link Message link Add to gist Remove
timotimo	that doesn't seem optimal, the <135 case seems overrepresented		Copy link Message link Add to gist Remove
dogbert17	it looks ok to me but I must have missed something		Copy link Message link Add to gist Remove
nine	dogbert17: ah, sorry, my comment wasn't entirely correct. There is an issue, but it's not about t as that's not yet initialized.	16:47	Copy link Message link Add to gist Remove
	This can allocate: github.com/MoarVM/MoarVM/blob/mast...udp.c#L173		Copy link Message link Add to gist Remove
timotimo	4,744,146 to 4,873,498, not an improvement		Copy link Message link Add to gist Remove
nine	So *async_task is the thing that may be moved there and that needs MVMROOTing		Copy link Message link Add to gist Remove
dogbert17	aha		Copy link Message link Add to gist Remove
nine	Otherwise t may get initialized with an out of date pointer	16:48	Copy link Message link Add to gist Remove
dogbert17	I could definitely try that		Copy link Message link Add to gist Remove
nine	It may not cause the error you see but it's definitely a bug	16:49	Copy link Message link Add to gist Remove
timotimo	m: say 4759885 * 100 / 685470196		Copy link Message link Add to gist Remove Run code
camelia	0.6943970763	16:50	Copy link Message link Add to gist Remove
timotimo	m: say8483131 * 100 / 685470196		Copy link Message link Add to gist Remove Run code
camelia	===SORRY!=== Error while compiling <tmp> Two terms in a row at <tmp>:1 ------> say8483131 *⏏ 100 / 685470196 expecting any of: infix infix stopper postfix statement end state…		Copy link Message link Add to gist Remove
timotimo	m: say 8483131 * 100 / 685470196		Copy link Message link Add to gist Remove Run code
camelia	1.2375637992		Copy link Message link Add to gist Remove
timotimo	that was earliest version vs currently "best" version		Copy link Message link Add to gist Remove
	maybe i should bisect the range again		Copy link Message link Add to gist Remove
dogbert17	nine: fixed it locally but, as you suspected, it didn't fix the problem	16:51	Copy link Message link Add to gist Remove
nine	dogbert17: so do you know where t->body.schedulee gets initialized in the failing example?	16:58	Copy link Message link Add to gist Remove
timotimo	bleehhh		Copy link Message link Add to gist Remove
	MVMROOT2 is the line, callgrind reports "26 calls to repr_alloc_init, 7375 call to repr_get_str, 52 calls to MVM_fixed_size_alloc, 35561 calls to MVM_repr_elems, 135800 calls to MVM_coerce_simple_intify, 35535 calls to MVM_fixed_size_alloc, 209226 calls to MVM_repr_at_pos_o"	16:59	Copy link Message link Add to gist Remove
	gee, thanks		Copy link Message link Add to gist Remove
dogbert17	I tried to figure that out by using breakpoint in gdb, all I've seen so far is that github.com/MoarVM/MoarVM/blob/mast...udp.c#L537 is called several times an the third time it tends to fail on line 573	17:00	Copy link Message link Add to gist Remove
timotimo	that one "line" is doing a lot of heavy lifting obviously		Copy link Message link Add to gist Remove
nine	dogbert17: I'd probably just read te source to find that out. Someone must cause do_setup_setup to be called and that's probably where t gets set up	17:03	Copy link Message link Add to gist Remove
dogbert17	I guess I'll continue my hunt wut you're welcome to join :)	17:06	Copy link Message link Add to gist Remove
	*but		Copy link Message link Add to gist Remove
	nine: this line is called: github.com/MoarVM/MoarVM/blob/mast...udp.c#L660	17:08	Copy link Message link Add to gist Remove
nine	dogbert17: that looks quite ok. It's even a bit too paranoid. That MVMROOT5 there could just be MVMROOT(tc, task, ... as the other variables are not used anymore.	17:17	Copy link Message link Add to gist Remove
	dogbert17: what if an outdated pointer is already passed to MVM_io_socket_udp_async? You could add an assertion there		Copy link Message link Add to gist Remove
dogbert17	do you mean something like this? github.com/MoarVM/MoarVM/blob/mast...list.h#L50	17:23	Copy link Message link Add to gist Remove
nine	there's an MVM_GC_ASSERT_NOT_FROMSPACE or such	17:24	Copy link Message link Add to gist Remove
dogbert17	aha, will look for it		Copy link Message link Add to gist Remove
	found it	17:25	Copy link Message link Add to gist Remove
nine	You can sprinkle that around in the code to narrow down the place where things go wrong	17:26	Copy link Message link Add to gist Remove
timotimo	MVM_serialization_read_ref at 197,147,665 incl / 35,392,385 self before, 192,316,491 incl / 40,777,091 self after	17:30	Copy link Message link Add to gist Remove
	i devirtualized integer reading a little bit	17:31	Copy link Message link Add to gist Remove
	maybe MVM_repr_get_by_id isn't better than grabbing the REPR out of the object	17:32	Copy link Message link Add to gist Remove
17:33 domidumont left
timotimo	190,765,710 incl / 40,777,091 self afterwards	17:33	Copy link Message link Add to gist Remove
dogbert17	so far I can see that 'task', i.e. ssi->async_task, is busted here: github.com/MoarVM/MoarVM/blob/mast...udp.c#L540		Copy link Message link Add to gist Remove
nine	dogbert17: great! Is uv__finish_close in that backtrace as well?	17:34	Copy link Message link Add to gist Remove
dogbert17	#0 MVM_panic (exitCode=1, messageFormat=0x7ffff76323c8 "Collectable %p in fromspace accessed") at src/core/exceptions.c:832		Copy link Message link Add to gist Remove
	#1 0x00007ffff74a4138 in do_setup_setup (handle=0x7fffe4002630) at src/io/asyncsocketudp.c:543		Copy link Message link Add to gist Remove
	#2 0x00007ffff760ed25 in uv__finish_close (handle=0x7fffe4002630) at 3rdparty/libuv/src/unix/core.c:286	17:35	Copy link Message link Add to gist Remove
	I put the assert on line 543		Copy link Message link Add to gist Remove
nine	dogbert17: hey...wait a minute. Why does the setup_op_table not have a gc_mark entry?	17:41	Copy link Message link Add to gist Remove
dogbert17	perhaps it should	17:43	Copy link Message link Add to gist Remove
nine	I'm pretty sure that if during the life time of that task a GC may get triggered (like it can) the task needs a gc_mark function	17:44	Copy link Message link Add to gist Remove
17:45 hankache left, hankache joined
dogbert17	and this? github.com/MoarVM/MoarVM/blob/mast...ket.c#L727	17:45	Copy link Message link Add to gist Remove
nine	I don't see why not	17:48	Copy link Message link Add to gist Remove
	Oh, now I do. There's a generic gc_mark in MVMAsyncTask	17:49	Copy link Message link Add to gist Remove
	That takes care of queue, schedulee, cancel_notify_queue and cancel_notify_schedulee		Copy link Message link Add to gist Remove
timotimo	hrm. kcachegrind likes to read the source files over and over again	17:51	Copy link Message link Add to gist Remove
	or maybe i'm Doing It Wrong		Copy link Message link Add to gist Remove
dogbert17	but setup_op_table still needs a mark function ?	17:52	Copy link Message link Add to gist Remove
nine	dogbert17: ooh....only now I realized. You already mentioned ssi->async_task being busted. That's not t->body.schedulee but t itself. And that's kept in this SocketSetupInfo structure. But who marks the async_task pointer in that structure during GC?	17:53	Copy link Message link Add to gist Remove
dogbert17	perhaps no one	17:54	Copy link Message link Add to gist Remove
nine	That would be setup_gc_mark's job		Copy link Message link Add to gist Remove
	So now we know why we need such a function in asyncsocketudp but not in asyncsocket		Copy link Message link Add to gist Remove
dogbert17	indeed	17:57	Copy link Message link Add to gist Remove
timotimo	hurm. REPR(obj)->ass_funcs.bind_key(...) is faster than MVMHash_bind_key even though the latter doesn't have to do a function pointer deref	17:59	Copy link Message link Add to gist Remove
nine	timotimo: what the?	18:00	Copy link Message link Add to gist Remove
timotimo	the MVMHash_bind_key adds 100,000 estimated cycles		Copy link Message link Add to gist Remove
	30753 calls via the ass_funcs gives me 13,168,926 estimated cycles, and via MVMHash_bind_key it's 13,347,851	18:01	Copy link Message link Add to gist Remove
	maybe it's important that the reprops struct is const?		Copy link Message link Add to gist Remove
nine	timotimo: why is there even a bind_key function in MVMHash.c? Could just as well put that code into MVMHash_bind_key directly and use that in the repr_ops	18:03	Copy link Message link Add to gist Remove
timotimo	yeah		Copy link Message link Add to gist Remove
	let me do that	18:04	Copy link Message link Add to gist Remove
	13,192,210 for MVMHash_bind_key now, i should probably also double-check how stable the numbers are	18:05	Copy link Message link Add to gist Remove
nine	yeah, since it's only estimations	18:07	Copy link Message link Add to gist Remove
timotimo	Ir events are easily going up and down 100k out of the 496 mill	18:08	Copy link Message link Add to gist Remove
18:09 hankache left
timotimo	so comparing current state with master:	18:10	Copy link Message link Add to gist Remove
	m: say 496,122,671 * 100 / 507,117,300 # total cycle estimation cost of a perl6 -e 'say 0.1 + 0.2 - 0.3'	18:11	Copy link Message link Add to gist Remove Run code
camelia	496122132.34714117300		Copy link Message link Add to gist Remove
timotimo	haha oops		Copy link Message link Add to gist Remove
	m: say 496_122_671 * 100 / 507_117_300 # total cycle estimation cost of a perl6 -e 'say 0.1 + 0.2 - 0.3'		Copy link Message link Add to gist Remove Run code
camelia	97.83193573		Copy link Message link Add to gist Remove
timotimo	m: say 189_796_594 * 100 / 197_356_704 # inclusive cycles of MVM_serialization_read_ref	18:12	Copy link Message link Add to gist Remove Run code
camelia	96.169316853		Copy link Message link Add to gist Remove
timotimo	m: say 72_546_296 * 100 / 76_316_992 # inclusive cycles of MVM_validate_static_frame	18:13	Copy link Message link Add to gist Remove Run code
camelia	95.05916585		Copy link Message link Add to gist Remove
18:13 MasterDuke joined
timotimo	m: say 151_149_442 * 100 / 157_972_415 # inclusive cycles of deserialize	18:14	Copy link Message link Add to gist Remove Run code
camelia	95.680908594		Copy link Message link Add to gist Remove
timotimo	of course deserialize includes read_ref		Copy link Message link Add to gist Remove
dogbert17	nine: soemthing like 'static void setup_gc_mark(MVMThreadContext tc, void data, MVMGCWorklist worklist) { SocketSetupInfo ssi = (SocketSetupInfo *)data; MVM_gc_worklist_add(tc, worklist, &ssi->async_task); }'	18:17	Copy link Message link Add to gist Remove
timotimo	only even calling into MVM_intcache_get if the number falls into the right range is also a good idea	18:20	Copy link Message link Add to gist Remove
	from 356k calls to 169k calls	18:21	Copy link Message link Add to gist Remove
	but only from 2_961_990 to 2_028_775 cycles (because most of the extra calls returned NULL very early)		Copy link Message link Add to gist Remove
nine	dogbert17: makes sense, yes	18:23	Copy link Message link Add to gist Remove
dogbert17	and it actually seems to work	18:24	Copy link Message link Add to gist Remove
nine	yeah! \o/		Copy link Message link Add to gist Remove
	dogbert17: nice work!		Copy link Message link Add to gist Remove
dogbert17	nine: do we need to MVM_gc_worklist_add(tc, worklist, <something else>);	18:25	Copy link Message link Add to gist Remove
	nine++ for excellent help		Copy link Message link Add to gist Remove
	should we attack t/spec/S17-procasync/encoding.t now? MasterDuke mentioned something about a cast	18:27	Copy link Message link Add to gist Remove
	here's the gist of the matter: gist.github.com/dogbert17/e4cb7b4d...73a49c7727	18:28	Copy link Message link Add to gist Remove
MasterDuke	is there a safe way to cast `MVMGrapheme32 ` into `char `?		Copy link Message link Add to gist Remove
	dogbert17 waits with baited breath for an answer :)	18:29	Copy link Message link Add to gist Remove
	dogbert17 sigh should of course be bated	18:30	Copy link Message link Add to gist Remove
timotimo	what is your actual use case?		Copy link Message link Add to gist Remove
	if you just cast, you'll get quadruple-spaced rubbish text if you just print it		Copy link Message link Add to gist Remove
MasterDuke	heh, wondered if i should have said something, but you caught it		Copy link Message link Add to gist Remove
	github.com/MoarVM/MoarVM/blob/mast...#L520-L537		Copy link Message link Add to gist Remove
timotimo	if you just MVM_free the result of casting, it doesn't matter at all	18:31	Copy link Message link Add to gist Remove
	which is what throw_adhoc_free does with it		Copy link Message link Add to gist Remove
	m: say 494_254_583 * 100 / 507_117_300	18:34	Copy link Message link Add to gist Remove Run code
camelia	97.46356178		Copy link Message link Add to gist Remove
timotimo	i mean, it ain't an order of magnitude, but i'll take it		Copy link Message link Add to gist Remove
	should also look at whether the cycle count is reduced with my other branch that stores a bit of optional info in the discriminator number	18:35	Copy link Message link Add to gist Remove
	serialization_read_int is the function on second place after MVM_serialization_read_ref (in self cycle counts)	18:36	Copy link Message link Add to gist Remove
	but it's also called 938k times compared to read_ref's 714k times		Copy link Message link Add to gist Remove
	read_ref calls it 539k times, deserialize_stable calls it 181.8k times and deserialize calls it another 85k times	18:37	Copy link Message link Add to gist Remove
	of course the only way to gain a real amount of performance is to have to deserialize less during startup	18:46	Copy link Message link Add to gist Remove
Geth	MoarVM: dogbert17++ created pull request #1195: Add missing gc_mark entry to setup_op_table	18:49	Copy link Message link Add to gist Remove
18:49 reportable6 joined
MasterDuke	well, there's something about that code that's wrong. if i revert github.com/MoarVM/MoarVM/blame/mas...#L450-L468 back to before my change (i.e., to just `MVM_free(buffer); MVM_exception_throw_adhoc(tc, "Malformed UTF-8"); break;` valgrind doesn't complain anymore	18:50	Copy link Message link Add to gist Remove
nine	dogbert17, MasterDuke: pos is the index into char bytes. Why is it used to index into buffer? There's "count" for that	18:51	Copy link Message link Add to gist Remove
	buffer will often by smaller than bytes	18:52	Copy link Message link Add to gist Remove
MasterDuke	nine: are my indices correct (aside from using the wrong variable)?	18:54	Copy link Message link Add to gist Remove
	because just changing pos to count still has the exact same behavior	18:55	Copy link Message link Add to gist Remove
nine	may be off by one? Btw. why are you reading buffer in the first place and not bytes?	18:56	Copy link Message link Add to gist Remove
	Buffer will contain only the stuff that was correct if I understand this correctly		Copy link Message link Add to gist Remove
MasterDuke	i very well could be wrong, i originally wrote that quite a while ago	18:57	Copy link Message link Add to gist Remove
nine	I get totally different valgrind errors when I replace pos with count	18:59	Copy link Message link Add to gist Remove
MasterDuke	hm. maybe i need to do a make clean and start trying these changes over	19:00	Copy link Message link Add to gist Remove
nine	MasterDuke: also you shouldn't check bufsize >= 3 but count >= 3	19:09	Copy link Message link Add to gist Remove
	Otherwise count - 2 may well be negative if there's an error right at the start of the stream		Copy link Message link Add to gist Remove
MasterDuke	but if i really should be getting stuff out of bytes then should it be pos >= 3?	19:10	Copy link Message link Add to gist Remove
nine	think so	19:11	Copy link Message link Add to gist Remove
MasterDuke	i.e., keep using pos (and use its size in the if/elsif), but index into bytes		Copy link Message link Add to gist Remove
nine	also with pos you'll definitely have to subtract 1, because it indiscriminately incs it in decode_utf8_byte(&state, &codepoint, bytes[pos++])		Copy link Message link Add to gist Remove
MasterDuke	or just change the conditionals to 4, 3, 2?	19:12	Copy link Message link Add to gist Remove
	oh, you mean in the indexing...	19:13	Copy link Message link Add to gist Remove
	nine++, valgrind doesn't complain anymore	19:18	Copy link Message link Add to gist Remove
	guess i should check that those are the right/useful things to be printing though...	19:19	Copy link Message link Add to gist Remove
nine	one step at a time :)	19:20	Copy link Message link Add to gist Remove
Geth	MoarVM: db9b770f60 \| (Jan-Olof Hendig)++ \| src/io/asyncsocketudp.c Add missing gc_mark entry to setup_op_table Given that a gc might happen during the lifetime of the async connect task we need to add a gc_mark function to its ops table. This fixes an intermittent fromspace error which could otherwise occur. nine++ for helping out with this	19:22	Copy link Message link Add to gist Remove
	MoarVM: 37f1f0beff \| niner++ (committed using GitHub Web editor) \| src/io/asyncsocketudp.c Merge pull request #1195 from dogbert17/add-missing-gc-marking Add missing gc_mark entry to setup_op_table		Copy link Message link Add to gist Remove
19:25 sena_kun left
timotimo	hm, can we use the "no write barrier" versions of bindings in deserialize.c?	19:30	Copy link Message link Add to gist Remove
nine	timotimo: you mean because everything's gonna be in gen2 anyway?	19:35	Copy link Message link Add to gist Remove
timotimo	are they?	19:37	Copy link Message link Add to gist Remove
	hm, yeah, gc_allocate_object in the call graph goes directly to gen2_allocate_zeroed	19:38	Copy link Message link Add to gist Remove
	allocate_nursery is only called 24k times in total in my profile		Copy link Message link Add to gist Remove
	actually, here's a gc_allocate_zeroed that has a caller in MVM_serialization_read_ref / work_loop / demand_object	19:39	Copy link Message link Add to gist Remove
	yeah, can't rely on that	19:46	Copy link Message link Add to gist Remove
	if we know in one function that every allocation will be in gen2, then maybe we can do all allocations there directly and not hit write barriers maybe?	19:47	Copy link Message link Add to gist Remove
	but mostly we're doing read_ref, which could give us an object that had been created outside of our little bubble		Copy link Message link Add to gist Remove
dogbert17	libuv 1.33 is out github.com/libuv/libuv/blob/v1.x/ChangeLog	20:30	Copy link Message link Add to gist Remove
MasterDuke	libtommath 1.2.0 is going to be released soon. it won't have my speedups for mp_to_radix, but hopefully those should make it into the next release (which will most likely be 2.0.0)	20:43	Copy link Message link Add to gist Remove
timotimo	cool!	20:51	Copy link Message link Add to gist Remove
	looking forward to that		Copy link Message link Add to gist Remove
MasterDuke	arg. what's some bad utf-8 i can test with?	20:53	Copy link Message link Add to gist Remove
timotimo	i'd just create a Buf from random ints between 0 and 255		Copy link Message link Add to gist Remove
	maybe try 255 over and over	20:54	Copy link Message link Add to gist Remove
20:54 ggoebel left
timotimo	m: Buf.new(255, 255, 255, 255).decode("utf8").say	20:54	Copy link Message link Add to gist Remove Run code
camelia	Malformed UTF-8 at line 1 col 1 in block <unit> at <tmp> line 1		Copy link Message link Add to gist Remove
MasterDuke	ugh, that hits a different error path	20:55	Copy link Message link Add to gist Remove
20:56 moon_child left
MasterDuke	oh, but maybe i could improve that error now i have a better idea of what to do...	20:56	Copy link Message link Add to gist Remove
Geth	MoarVM/mildly-faster-startup: 7 commits pushed by (Timo Paulssen)++ - P6int: make set_int and get_int globally available - remove tiny indirection from MVMHash_at_key/bind_key - deserialization: rely on MVMHash being the known repr - deserialization: use P6int repr being known in read_ref - introduce MVMP6str_get_str and MVMP6str_set_str - deserialization: rely on P6str REPR being known - speed up MVM_op_get_mark by splitting it down the middle	20:58	Copy link Message link Add to gist Remove
timotimo	there are different kinds of broken utf8		Copy link Message link Add to gist Remove
21:01 moon_child joined
MasterDuke	oh! since i'm no longer reading from `buffer`, i can just reinstate the `MVM_free(buffer)`. but i'm guessing freeing `bytes` actually is a bad idea	21:02	Copy link Message link Add to gist Remove
21:23 ggoebel joined
Geth	MoarVM/mildly-faster-startup: 4 commits pushed by (Timo Paulssen)++ - introduce MVM_VMArray_bind_pos - deserialization: rely on known REPR in read_array_var/int/str - introduce MVM_VMArray_push - deserialization: rely on knowledge of owned_objects REPR	21:25	Copy link Message link Add to gist Remove
timotimo	^- that very last one is only worth like 2k out of the 495mil	21:26	Copy link Message link Add to gist Remove
	but ... shrug		Copy link Message link Add to gist Remove
	read_ref is 2.5mil above read_int in terms of self, but i've been shoveling cycles from inclusive to self so it's not really going to shift over	21:28	Copy link Message link Add to gist Remove
MasterDuke	what do all the commits do for startup time?		Copy link Message link Add to gist Remove
timotimo	all of them together?		Copy link Message link Add to gist Remove
MasterDuke	yeah		Copy link Message link Add to gist Remove
timotimo	probably almost nothing		Copy link Message link Add to gist Remove
	my "time" only goes two decimal places		Copy link Message link Add to gist Remove
	and the result is noisy		Copy link Message link Add to gist Remove
MasterDuke	you don't have /usr/bin/time?		Copy link Message link Add to gist Remove
timotimo	so all i have is the cycle estimates from callgrind		Copy link Message link Add to gist Remove
	that seems to be the same thing	21:29	Copy link Message link Add to gist Remove
	that is GNU Time		Copy link Message link Add to gist Remove
MasterDuke	oh yeah, it's plain time that has 3 decimal places	21:30	Copy link Message link Add to gist Remove
timotimo	that'd probably be bash time or something?		Copy link Message link Add to gist Remove
MasterDuke	yeah		Copy link Message link Add to gist Remove
	m: Buf.new(100, 101, 255, 254, 253, 252, 251).decode("utf8").say		Copy link Message link Add to gist Remove Run code
camelia	Malformed UTF-8 at line 1 col 3 in block <unit> at <tmp> line 1		Copy link Message link Add to gist Remove
MasterDuke	is "Malformed UTF-8 near bytes fffffffe fffffffd at line 1 col 3" any better?	21:31	Copy link Message link Add to gist Remove
timotimo	m: say .sum / .elems given <0.126 0.139 0.124 0.125 0.125 0.124 0.123 0.131>	21:32	Copy link Message link Add to gist Remove Run code
camelia	0.127125		Copy link Message link Add to gist Remove
21:32 ggoebel left
timotimo	m: say .sum / .elems given <0.143 0.152 0.139 0.138 0.139 0.139 0.142 0.150>	21:33	Copy link Message link Add to gist Remove Run code
camelia	0.14275		Copy link Message link Add to gist Remove
timotimo	those were the "after" values for real, and user		Copy link Message link Add to gist Remove
	m: say .sum / .elems given <126 125 130 123 123 123 127 128>	21:34	Copy link Message link Add to gist Remove Run code
camelia	125.625		Copy link Message link Add to gist Remove
timotimo	m: say .sum / .elems given <144 144 143 142 143 139 144 140>		Copy link Message link Add to gist Remove Run code
camelia	142.375		Copy link Message link Add to gist Remove
timotimo	those are the before values (but i skipped the 0. because typing)		Copy link Message link Add to gist Remove
	so i guess it got a little slower actually?		Copy link Message link Add to gist Remove
21:38 patrickb joined
timotimo	maybe it's the 139/152 "outlier" in the after times	21:38	Copy link Message link Add to gist Remove
	m: say .sum / .elems given <0.126 0.124 0.125 0.125 0.124 0.123 0.131>		Copy link Message link Add to gist Remove Run code
camelia	0.125429		Copy link Message link Add to gist Remove
timotimo	m: say .sum / .elems given <0.143 0.139 0.138 0.139 0.139 0.142 0.150>		Copy link Message link Add to gist Remove Run code
camelia	0.141429		Copy link Message link Add to gist Remove
timotimo	that'd make it a tiny tiny bit faster		Copy link Message link Add to gist Remove
	we should look into getting that branch ready for prime-time that puts perl6.moarvm into the binary to be loaded by the OS immediately on startup	21:39	Copy link Message link Add to gist Remove
	i don't remember how much it was, but i think it had a significantly positive impact on startup time		Copy link Message link Add to gist Remove
MasterDuke	all/any improvement to start up time is good	21:40	Copy link Message link Add to gist Remove
	how about "Malformed UTF-8 near bytes fe fd at line 1 col 3"?		Copy link Message link Add to gist Remove
timotimo	cool	21:42	Copy link Message link Add to gist Remove
	m: say .sum / .elems given <506698217 506806723 507023689 506843017>	21:48	Copy link Message link Add to gist Remove Run code
camelia	506842911.5		Copy link Message link Add to gist Remove
timotimo	m: say .sum / .elems given <493795874 493827445 493842729 493967884>	21:49	Copy link Message link Add to gist Remove Run code
camelia	493858483		Copy link Message link Add to gist Remove
timotimo	m: say 493858483 * 100 / 506842911.5		Copy link Message link Add to gist Remove Run code
camelia	97.43817498373		Copy link Message link Add to gist Remove
timotimo	m: say 100 - (493858483 * 100 / 506842911.5)		Copy link Message link Add to gist Remove Run code
camelia	2.56182501627		Copy link Message link Add to gist Remove
timotimo	juuuust above 2.5% fewer Ir events measured by callgrind	21:50	Copy link Message link Add to gist Remove
	which hopefully have a correlation to real timing		Copy link Message link Add to gist Remove
MasterDuke	cool		Copy link Message link Add to gist Remove
timotimo	(this is with spesh turned off btw, because that's extra noisy)		Copy link Message link Add to gist Remove
	but yeah, what we want is 50% faster or 95% faster or 99.5% faster	21:51	Copy link Message link Add to gist Remove
MasterDuke	would SPESH_BLOCKING be enough less noisy to be useful?		Copy link Message link Add to gist Remove
timotimo	yeah, i should try that		Copy link Message link Add to gist Remove
	i was too lazy		Copy link Message link Add to gist Remove
	yeah that doesn't look so bad	21:53	Copy link Message link Add to gist Remove
	rebuilding master now to compare		Copy link Message link Add to gist Remove
	m: say .sum / .elems given <830933453 830956253 830991918 830882743 830904953>	21:54	Copy link Message link Add to gist Remove Run code
camelia	830933864		Copy link Message link Add to gist Remove
timotimo	m: say .sum / .elems given <843944655 843913123 844031628 543946032 843832973>	21:56	Copy link Message link Add to gist Remove Run code
camelia	783933682.2		Copy link Message link Add to gist Remove
timotimo	m: say 830933864 * 100 / 83933682.2		Copy link Message link Add to gist Remove Run code
camelia	989.9885745749		Copy link Message link Add to gist Remove
timotimo	oops		Copy link Message link Add to gist Remove
	i put a wrong number in there	21:57	Copy link Message link Add to gist Remove
	m: say .sum / .elems given <843944655 843913123 844031628 843832973 843832973>		Copy link Message link Add to gist Remove Run code
camelia	843911070.4		Copy link Message link Add to gist Remove
timotimo	m: say 100 * 843911070.4 / 830933864		Copy link Message link Add to gist Remove Run code
camelia	101.561761647		Copy link Message link Add to gist Remove
timotimo	m: say 100 * 843911070.4 / 830933864 - 100	21:58	Copy link Message link Add to gist Remove Run code
camelia	1.561761647		Copy link Message link Add to gist Remove
timotimo	not surprising that the percentage difference is lower, because none of the things i changed should apply to the spesh thread		Copy link Message link Add to gist Remove
	one difference between with spesh and without spesh is that with spesh has _int_malloc at the very top for Self time	22:01	Copy link Message link Add to gist Remove
Geth	MoarVM: MasterDuke17++ created pull request #1196: Show correct values in encoding errors		Copy link Message link Add to gist Remove
22:34 patrickb left 22:42 sivoais_ left, sivoais joined 23:06 MasterDuke left

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!