#moarvm on 10 September 2015 - Raku Programming Language Log

01:47 ilbot3 joined 02:26 tokuhiro_ joined 03:27 tokuhiro_ joined 03:39 tokuhiro_ joined 07:03 Ven joined 07:59 Ven joined, zakharyas joined
JimmyZ	A very nice book 'Static Single Assignment Book': ssabook.gforge.inria.fr/latest/book.pdf # almost complete, project address gforge.inria.fr/scm/?group_id=1950	08:26	Copy link Message link Add to gist Remove
08:48 FROGGS joined 08:50 lizmat joined 09:09 FROGGS joined
jnthn	JimmyZ++ # nice link indeed!	09:42	Copy link Message link Add to gist Remove
JimmyZ	scm.gforge.inria.fr/anonscm/svn/ss...torial.pdf # PPT for some more info :)	09:45	Copy link Message link Add to gist Remove
jnthn	m: say "SSA".flip # :P	09:48	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar c54773: OUTPUT«ASS␤»		Copy link Message link Add to gist Remove
timotimo	i wonder how many techniques we cannot reliably implement because older versions are not available in our ssa implementation	09:49	Copy link Message link Add to gist Remove
	of course we can always allocate a temporary register and set its value right after the desired version gets written	09:50	Copy link Message link Add to gist Remove
jnthn	"older versions"?		Copy link Message link Add to gist Remove
	Oh, I think know the issue you mean	09:51	Copy link Message link Add to gist Remove
	It's a trade-off.		Copy link Message link Add to gist Remove
	If you make the other one you get more costly/difficult deopt		Copy link Message link Add to gist Remove
timotimo	yes	09:52	Copy link Message link Add to gist Remove
	i remember that		Copy link Message link Add to gist Remove
	hum, now i remember i got nowhere with my deopt bridges thing yet	09:53	Copy link Message link Add to gist Remove
	not getting paid enough for complicated things :)	09:54	Copy link Message link Add to gist Remove
	not actually convinced i really can deal with complicated things that much better when I'm getting paid...	10:12	Copy link Message link Add to gist Remove
10:15 Ven joined
timotimo	i think i have the impostor syndrome	10:18	Copy link Message link Add to gist Remove
	but still better than impastor or inpasta		Copy link Message link Add to gist Remove
arnsholt	Is that where you make a lot of copy-pasta?	10:26	Copy link Message link Add to gist Remove
timotimo	mhhh pasta	10:30	Copy link Message link Add to gist Remove
	jnthn: were you able to find out why MapIterCommon doesn't have its new method spesh'd?	10:47	Copy link Message link Add to gist Remove
	if not, do you want me to litter the code with debug statements and figure it out?		Copy link Message link Add to gist Remove
jnthn	timotimo: No, if you could look into that it'd be great		Copy link Message link Add to gist Remove
	Because it's the kind of method that should spesh really well		Copy link Message link Add to gist Remove
timotimo	sure, did you use a specific benchmark for it?	10:48	Copy link Message link Add to gist Remove
jnthn	'cus it's jsut a bunch of binds to attributes		Copy link Message link Add to gist Remove
	for ^1000 { for ^1000 { } }		Copy link Message link Add to gist Remove
	Well, that may hit the for -> while opt maybe		Copy link Message link Add to gist Remove
	If that still works		Copy link Message link Add to gist Remove
timotimo	i recently fixed it	10:49	Copy link Message link Add to gist Remove
	it used to look for an &infix:<,> in th QAST, which was reoved duringGLR		Copy link Message link Add to gist Remove
jnthn	OK, well, just my @a = ^1000; for @a { for @a { } }		Copy link Message link Add to gist Remove
timotimo	rebuilding rakudo now	10:50	Copy link Message link Add to gist Remove
	interesting	10:55	Copy link Message link Add to gist Remove
	in the profile i'm looking at it gets called 1001		Copy link Message link Add to gist Remove
	1001 times, about 4/5th of those calls were even jitted		Copy link Message link Add to gist Remove
jnthn	Probably something initializ-y		Copy link Message link Add to gist Remove
	Oh?		Copy link Message link Add to gist Remove
timotimo	how did you reach the conclusion it doesn't get speshed?		Copy link Message link Add to gist Remove
	that piece of code initializes a crapton of IntLexRef	11:00	Copy link Message link Add to gist Remove
	3005001		Copy link Message link Add to gist Remove
	2002000 of those in sink-all and 1002001 in pull-one		Copy link Message link Add to gist Remove
	same with a :=, but i suspect that can be eased by implementing push-exactly or something in range's iterator	11:02	Copy link Message link Add to gist Remove
jnthn	Wow	11:03	Copy link Message link Add to gist Remove
	I need to look at the lex ref issues there		Copy link Message link Add to gist Remove
	Maybe that's what I'll do this evening		Copy link Message link Add to gist Remove
	I really need to do several hours on a $other-job today		Copy link Message link Add to gist Remove
timotimo	ah, sure		Copy link Message link Add to gist Remove
jnthn	But good to know		Copy link Message link Add to gist Remove
timotimo	there's a infix:<<> in the code you gave that gets only speshed, not jitted		Copy link Message link Add to gist Remove
	1002001 calls, 7.04% (155.9ms)	11:04	Copy link Message link Add to gist Remove
jnthn	How did I conclude it wasn't? Because in the Text::CSV profiler output it isn't being		Copy link Message link Add to gist Remove
timotimo	(exclusive time)		Copy link Message link Add to gist Remove
jnthn	So we'll need to look deeper :(		Copy link Message link Add to gist Remove
timotimo	i'll grab Text::CSV onto my laptop as well		Copy link Message link Add to gist Remove
	all the flaky mobile connections :\|	11:05	Copy link Message link Add to gist Remove
jnthn	if you liked it shoulda put 4G on it	11:06	Copy link Message link Add to gist Remove
timotimo	occasionally i do get 4G		Copy link Message link Add to gist Remove
	how do you invoke the benchmark?	11:07	Copy link Message link Add to gist Remove
jnthn	I created a file with this 1000 times:	11:08	Copy link Message link Add to gist Remove
	hello,","," ",world,"!"		Copy link Message link Add to gist Remove
	And then		Copy link Message link Add to gist Remove
	cat test-small.csv \| perl6-m -Ilib --profile test-t.pl		Copy link Message link Add to gist Remove
	You'll need to grab Slang::Tuxic and File::Temp and File::Directory::Tree also		Copy link Message link Add to gist Remove
timotimo	just noticed that	11:10	Copy link Message link Add to gist Remove
	rebootstrapping panda right now	11:11	Copy link Message link Add to gist Remove
	um ... even with test-t.pl i get almost 99% jitted new	11:15	Copy link Message link Add to gist Remove
	src/gen/m-CORE.setting:2696		Copy link Message link Add to gist Remove
	perhaps more worrying is sink-all of sequential map being 100% interpreted and 4.66% exclusive time	11:16	Copy link Message link Add to gist Remove
	and 520351 BOOTCode being allocated inside BUILDALL's while loop	11:17	Copy link Message link Add to gist Remove
	there's only 5010 calls to BUILDALL according to the routines tab	11:18	Copy link Message link Add to gist Remove
	hehe.	11:25	Copy link Message link Add to gist Remove
	List's iterator method has a class :: does Iterator in it		Copy link Message link Add to gist Remove
	that generates code to take a whole bunch of closures		Copy link Message link Add to gist Remove
	it gets called 36013 times		Copy link Message link Add to gist Remove
	allocates 180065 BOTOCode in total	11:26	Copy link Message link Add to gist Remove
	m: say 180065 / 36013	11:30	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 7c9911: OUTPUT«5␤»		Copy link Message link Add to gist Remove
timotimo	that's how many methods that class has :P		Copy link Message link Add to gist Remove
	i've just moved it out of the method and i'll see if it makes a big difference		Copy link Message link Add to gist Remove
	yeah, though i couldn't report it due to flaky network >_>	11:38	Copy link Message link Add to gist Remove
	55 instead of 59 gc runs	11:40	Copy link Message link Add to gist Remove
	we may want to move some more classes for iterators out of the methods that use them, to prevent taking closures fro all the methods		Copy link Message link Add to gist Remove
jnthn	Wait, why are they taking closrues?!		Copy link Message link Add to gist Remove
	If they are something's up with our code-gen	11:41	Copy link Message link Add to gist Remove
timotimo	they shouldn't be?		Copy link Message link Add to gist Remove
jnthn	The only time a method should take a closure is if it's an l-value		Copy link Message link Add to gist Remove
	uh damn		Copy link Message link Add to gist Remove
	r-value		Copy link Message link Add to gist Remove
	In a class body it's always in sink context	11:42	Copy link Message link Add to gist Remove
timotimo	how else would we have classes defined in inner scopes be able to refer to closed-over values?		Copy link Message link Add to gist Remove
jnthn	Classes aren't closures		Copy link Message link Add to gist Remove
timotimo	ah!		Copy link Message link Add to gist Remove
	well, then you can fix it :)		Copy link Message link Add to gist Remove
jnthn	Thus why we've had and fixed various bugs where people did refer to lexicals :)		Copy link Message link Add to gist Remove
	OK, I'll put it on my todo list along with looking at the code-gen issues that make too many lexicalrefs	11:43	Copy link Message link Add to gist Remove
	Hm, if we can fix these two then we would get GC runs down a lot		Copy link Message link Add to gist Remove
timotimo	likely (and hopefully)	11:44	Copy link Message link Add to gist Remove
jnthn	And so improve performance a whole lot		Copy link Message link Add to gist Remove
nwc10	other bloggage: morepypy.blogspot.co.at/2015/09/pyp...ments.html	11:47	Copy link Message link Add to gist Remove
timotimo	our GC isn't the fastest	11:48	Copy link Message link Add to gist Remove
	we still have those bunchtons of gen2 roots that are irking me a bit		Copy link Message link Add to gist Remove
	i'd love common gc run times to dwindle below 2ms :\|		Copy link Message link Add to gist Remove
	actually, if we want to ever be able to do 60fps game development or something, 2ms is still more than a single video frame	11:52	Copy link Message link Add to gist Remove
11:55 Ven joined
timotimo	maybe incremental GC would be a thing to consider at some point? i have no idea what requirements that adds to the rest of the VM and if we can get there easily enough	11:59	Copy link Message link Add to gist Remove
jnthn	On my box I see GC times of 3ms-4ms in various cases	12:02	Copy link Message link Add to gist Remove
nwc10	for now, I suspect that we get bigger net wins by doing other stuff, relying on KISS and the tail end of Moore's Law.	12:03	Copy link Message link Add to gist Remove
	but that's just an opinion.		Copy link Message link Add to gist Remove
jnthn	m: say 1/60	12:05	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 7c9911: OUTPUT«0.016667␤»		Copy link Message link Add to gist Remove
jnthn	That's a lot more than 0.002 :P		Copy link Message link Add to gist Remove
nwc10	jnthn: even if you're now in SECAM territory, surely as you're still in Europe, the correct fraction is 1/50 :-)		Copy link Message link Add to gist Remove
timotimo	wow, i want some of these gc times	12:07	Copy link Message link Add to gist Remove
	oh, i thought millisecond meants 1/100 second, haha, that's fail		Copy link Message link Add to gist Remove
	but still, i hardly ever get gc times as good as jnthn's getting :(	12:10	Copy link Message link Add to gist Remove
	or is that really just "in some cases"?		Copy link Message link Add to gist Remove
jnthn	I had 3.x ms average GC for the for lines('file'.IO) { }	12:11	Copy link Message link Add to gist Remove
	6-7ms is common in apps that are retaining more stuff		Copy link Message link Add to gist Remove
timotimo	let's see.	12:12	Copy link Message link Add to gist Remove
	ah, for that test-small.csv isn't big enough by far :)		Copy link Message link Add to gist Remove
	7 to 8 ms in that	12:13	Copy link Message link Add to gist Remove
	can our machines' performance differ this drastically?		Copy link Message link Add to gist Remove
	i'll teach the jit about continuationreset, that'll make pull-one jittable, which is at 15% exclusive time in the for-lines benchmark	12:18	Copy link Message link Add to gist Remove
jnthn	Wait, are you on latest?	12:19	Copy link Message link Add to gist Remove
	I made for 'foo'.IO.lines { } not use continuations	12:20	Copy link Message link Add to gist Remove
timotimo	oh!		Copy link Message link Add to gist Remove
jnthn	But sure, do that anyway :)		Copy link Message link Add to gist Remove
timotimo	shall i still go ahead?		Copy link Message link Add to gist Remove
jnthn	Because it'll make every gather/take thing faster :)		Copy link Message link Add to gist Remove
timotimo	right. first i'll have to get off this train, though		Copy link Message link Add to gist Remove
	damn, and my fav song of this album just came on :(		Copy link Message link Add to gist Remove
12:47 JimmyZ left, JimmyZ joined
timotimo	if something in interp.c sets the cur_op before calling the C function in question, i'll mark it :invokish in the oplist, so that the jit doesn't explode, right?	12:50	Copy link Message link Add to gist Remove
FROGGS	sounds reasonable		Copy link Message link Add to gist Remove
timotimo	though in this case it's not because it invokes stuff, but because it records the cur_op into the continuation's address	12:51	Copy link Message link Add to gist Remove
FROGGS	:throwish seems to have a similar effect	12:52	Copy link Message link Add to gist Remove
timotimo	BBL	12:59	Copy link Message link Add to gist Remove
13:18 virtualsue joined 13:37 brrt joined
brrt	\o	13:37	Copy link Message link Add to gist Remove
13:37 Ven joined
FROGGS	hi brrt	13:41	Copy link Message link Add to gist Remove
brrt	hi FROGGS	13:42	Copy link Message link Add to gist Remove
13:49 virtualsue left 14:33 tokuhiro_ joined 14:48 Ven joined
hoelzro	jnthn: regarding that string heap optimization, is the optimization that MoarVM SCs no longer have their own string heaps, and just expect the code to refer to the string heap in the bytecode itself?	15:21	Copy link Message link Add to gist Remove
	I really want to fix the nqp-js problem, and I think the only way to do that is to truly understand that optimization		Copy link Message link Add to gist Remove
jnthn	hoelzro: It's exactly that, yes	15:22	Copy link Message link Add to gist Remove
	hoelzro: Actually all we used to do was just build a string array		Copy link Message link Add to gist Remove
15:22 Ven joined
jnthn	So there was a huge push arr, "foo" sequence	15:22	Copy link Message link Add to gist Remove
	And the change was just to get SCs to use identical indexes to the string heap of the bytecode file itself	15:23	Copy link Message link Add to gist Remove
	So we could save that		Copy link Message link Add to gist Remove
	Which saved a bunch of work at startup		Copy link Message link Add to gist Remove
hoelzro	so the dependency string heap reference will almost definitely have a different index after the optimization, right? since it's referring to all strings in the compunit?	15:25	Copy link Message link Add to gist Remove
	or is that wrong? a compunit with a single SC would have essentially the same string heap as the SC itself, maybe?		Copy link Message link Add to gist Remove
	hoelzro looks as this as a good thing, because he never really understood the serialization stuff before	15:26	Copy link Message link Add to gist Remove
jnthn	iirc, the serializer pushes the unique strings into a list		Copy link Message link Add to gist Remove
	And then keeps that list somewhere internal in the VM	15:27	Copy link Message link Add to gist Remove
	Oh, on the current CompUnit I think		Copy link Message link Add to gist Remove
hoelzro	is there a way to get --dump to dump things like the string heap, or lower level info on the SCs in the compunit?		Copy link Message link Add to gist Remove
jnthn	And then uses it when it does the MAST -> bytecode		Copy link Message link Add to gist Remove
	Not that I'm aware of		Copy link Message link Add to gist Remove
16:04 FROGGS joined 17:14 Ven joined 18:30 arnsholt joined
timotimo	i'm puzzled	18:45	Copy link Message link Add to gist Remove
	as soon as the jit kicks in on "my num @values; loop { @values.push: 0.0e1 }" it complains "expected num register!"	18:46	Copy link Message link Add to gist Remove
18:46 vendethiel joined
timotimo	but the jit code that's responsible for what gets emitted there should really put MVM_reg_num64 into the slot that decides what happens	18:47	Copy link Message link Add to gist Remove
19:16 brrt joined 19:26 tokuhiro_ joined 19:38 Peter_R joined 20:07 Ven joined 21:03 brrt joined
brrt	holy mother of irregular instruction encoding	21:07	Copy link Message link Add to gist Remove
	timotimo: is that the old JIT? and what line says that?	21:08	Copy link Message link Add to gist Remove
	apparantly, kids, if and only if the register number of an indexed register is 4, then we need a second modrm byte, or something	21:10	Copy link Message link Add to gist Remove
timotimo	m)	21:11	Copy link Message link Add to gist Remove
	brrt: what do you mean "what line says that"?	21:12	Copy link Message link Add to gist Remove
brrt	what line says 'expteced num register!'		Copy link Message link Add to gist Remove
timotimo	ah		Copy link Message link Add to gist Remove
	that's from push		Copy link Message link Add to gist Remove
brrt	i expect we use push_n for that?	21:14	Copy link Message link Add to gist Remove
timotimo	it's as if this line was wrong:		Copy link Message link Add to gist Remove
	m)+ (op == MVM_OP_push_n \|\| op == MVM_OP_unshift_n) ? MVM_reg_num64 :		Copy link Message link Add to gist Remove
	(without the facepalm smiley in front)		Copy link Message link Add to gist Remove
	jitlog says it's been devirtualized		Copy link Message link Add to gist Remove
	and speshlog says it's actually a push_n		Copy link Message link Add to gist Remove
brrt	hmmm	21:15	Copy link Message link Add to gist Remove
timotimo	oh, my str @foo is NYI?		Copy link Message link Add to gist Remove
	perhaps a GLR thing?		Copy link Message link Add to gist Remove
brrt	i think .. i dunno		Copy link Message link Add to gist Remove
timotimo	it was also NYI pre-glr	21:17	Copy link Message link Add to gist Remove
brrt	the bytecode generation issue is an irregularity in the encoding of rbp	21:24	Copy link Message link Add to gist Remove
timotimo	x86 is hard		Copy link Message link Add to gist Remove
FROGGS	rbp?		Copy link Message link Add to gist Remove
timotimo	base pointer?		Copy link Message link Add to gist Remove
brrt	yes.... and also r12, since that looks just like rbp from the perspective of x86	21:26	Copy link Message link Add to gist Remove
	x86 is really, really hard		Copy link Message link Add to gist Remove
21:28 tokuhiro_ joined
brrt	actuayll, it's rsp, not rbp	21:30	Copy link Message link Add to gist Remove
	anyway...	21:31	Copy link Message link Add to gist Remove
	it looks like something i can crack		Copy link Message link Add to gist Remove
FROGGS	++brrt		Copy link Message link Add to gist Remove
brrt	keep the ++'s for when the commit comes :-P		Copy link Message link Add to gist Remove
FROGGS	sure :o)		Copy link Message link Add to gist Remove
	always got some in my pocket	21:32	Copy link Message link Add to gist Remove
timotimo	sounds like you know what way to go and all that might stand in your way is missing infrastructure for keeping the information that modrm needs to become bigger ... or something		Copy link Message link Add to gist Remove
brrt	hmm yeah, i guess	21:34	Copy link Message link Add to gist Remove
timotimo	does it seem like that's the last problem on your way?	21:35	Copy link Message link Add to gist Remove
brrt	last instruction encoding problem i'm aware of, yes		Copy link Message link Add to gist Remove
timotimo	awesome :)		Copy link Message link Add to gist Remove
brrt	there is a cheap-but-ugly workaround. it means giving up r12		Copy link Message link Add to gist Remove
	and not use it at all	21:36	Copy link Message link Add to gist Remove
timotimo	sure		Copy link Message link Add to gist Remove
	then you'll see if everything else works but that :)		Copy link Message link Add to gist Remove
brrt	that'd work. but it'd only be a matter of time before somebody would choose to push dynasm over the limit again		Copy link Message link Add to gist Remove
	possibly me		Copy link Message link Add to gist Remove
timotimo	giving up just a single register doesn't sound terrible		Copy link Message link Add to gist Remove
brrt	well, it's also all stack relative stuff	21:37	Copy link Message link Add to gist Remove
	r12 and rsp		Copy link Message link Add to gist Remove
timotimo	oh		Copy link Message link Add to gist Remove
	that's more interesting, then		Copy link Message link Add to gist Remove
brrt	why they are irregular, i don't know		Copy link Message link Add to gist Remove
timotimo	otherwise i'd have said "if fixing this takes too long, skipping it will get us to a working code gen faster"		Copy link Message link Add to gist Remove
brrt	well, i'm going to think about it more	21:38	Copy link Message link Add to gist Remove
	fairly sure this can be fixed		Copy link Message link Add to gist Remove
timotimo	mhm		Copy link Message link Add to gist Remove
brrt	we have the 'meaning' of the vreg at runtime		Copy link Message link Add to gist Remove
	so we can actually add/rewrite the bytes		Copy link Message link Add to gist Remove
	but it's tricky	21:39	Copy link Message link Add to gist Remove
	(the good bit is, it was already broken before i started ^^)		Copy link Message link Add to gist Remove
timotimo	yeah :)		Copy link Message link Add to gist Remove
	"we" being "inside the dynasm internals", right?		Copy link Message link Add to gist Remove
brrt	yes	21:41	Copy link Message link Add to gist Remove
	the runtime		Copy link Message link Add to gist Remove
	but it requires i study the entire bytes-meaning table	21:42	Copy link Message link Add to gist Remove
timotimo	urgh		Copy link Message link Add to gist Remove
	you'll has a bachelor of ft'aghn after that		Copy link Message link Add to gist Remove
brrt	wiki.osdev.org/X86-64_Instruction_E...addressing	21:43	Copy link Message link Add to gist Remove
	and this beaty here: wiki.osdev.org/X86-64_Instruction_E...dressing_2		Copy link Message link Add to gist Remove
timotimo	oh, that's not gigantic		Copy link Message link Add to gist Remove
	it's just a lot	21:44	Copy link Message link Add to gist Remove
brrt	yeah, it's managable, it's just highly irregular		Copy link Message link Add to gist Remove
21:45 kjs_ joined
timotimo	turn off your patern recognition brain parts and it'll feel better, eh? :D	21:45	Copy link Message link Add to gist Remove
	you won't even notice there's no regularity to it!		Copy link Message link Add to gist Remove
brrt	right :-)		Copy link Message link Add to gist Remove
	i'm going to sleep		Copy link Message link Add to gist Remove
	see you tomorrow!		Copy link Message link Add to gist Remove
timotimo	good night brrt!	21:46	Copy link Message link Add to gist Remove
22:29 tokuhiro_ joined 22:53 kjs_ joined 23:40 tokuhiro_ joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!