#moarvm on 23 August 2016 - Raku Programming Language Log

00:05 dalek joined 02:49 dalek joined
nine	afl-fuzzy++	05:38	Copy link Message link Add to gist Remove
	timotimo++ :)	05:39	Copy link Message link Add to gist Remove
05:41 travis-ci joined
travis-ci	MoarVM build errored. Timo Paulssen 'uncuddle an else'	05:41	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/154295591 github.com/MoarVM/MoarVM/compare/b...0666638314		Copy link Message link Add to gist Remove
05:41 travis-ci left 05:49 lizmat joined 06:49 domidumont joined 06:54 domidumont joined 07:47 zakharyas joined 08:13 TheLemonMan joined
TheLemonMan	timotimo, wrt your param checking code, I think the code would look neater by putting the outer if into the for, this way you can avoid writing two similar for	08:42	Copy link Message link Add to gist Remove
	#145 can be closed as it's been fixed at the rakudo level and roast tests have been submitted	08:43	Copy link Message link Add to gist Remove
	also #175 seems to be fixed now	09:08	Copy link Message link Add to gist Remove
09:26 lizmat_ joined, dalek joined
timotimo	i thought the code would be clearer if i separate it out	09:31	Copy link Message link Add to gist Remove
dalek	arVM: bace47f \| LemonBoy++ \| src/profiler/heapsnapshot.c: snprintf returns an int, not a size_t.	10:11	Copy link Message link Add to gist Remove
	arVM: 5c7fc80 \| lizmat++ \| src/profiler/heapsnapshot.c: Merge pull request #396 from LemonBoy/tautological-compare snprintf returns an int, not a size_t.		Copy link Message link Add to gist Remove
10:47 travis-ci joined
travis-ci	MoarVM build passed. lizmat 'Merge pull request #396 from LemonBoy/tautological-compare	10:47	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/154407989 github.com/MoarVM/MoarVM/compare/4...7fc80bc3c1		Copy link Message link Add to gist Remove
10:47 travis-ci left 12:41 zakharyas joined 12:43 zakharyas1 joined
timotimo	so, you know how python can look for modules in a zipfile that's concanetaned to the binary?	14:31	Copy link Message link Add to gist Remove
	maybe we should take that route towards making moar-based "fat packs"		Copy link Message link Add to gist Remove
16:10 domidumont joined 16:46 brrt joined
brrt	timotimo: i agree, that is probably the way to go	16:47	Copy link Message link Add to gist Remove
	you'd still need a reasonably clever wrapper executable	16:48	Copy link Message link Add to gist Remove
	and you'd need a statically linked moar (i'd think), or at least, static to our internal dependencies	16:49	Copy link Message link Add to gist Remove
	lwn.net/Articles/691070/	16:50	Copy link Message link Add to gist Remove
	dude says CPython guys will start making and/or using a JIT		Copy link Message link Add to gist Remove
	personally, i think CPython has painted itself into so many corners on that front that I think it unlikely		Copy link Message link Add to gist Remove
	also, the claim is that pypy and friends can't deal well with ffi calls	16:52	Copy link Message link Add to gist Remove
	it's a bit similar to our situation with nativecalls		Copy link Message link Add to gist Remove
	but... we have some hope of jitting these fast in the future	16:53	Copy link Message link Add to gist Remove
TheLemonMan	that's a start!		Copy link Message link Add to gist Remove
brrt	TheLemonMan: if you care, i can explain you the current state of that :-)	16:54	Copy link Message link Add to gist Remove
TheLemonMan	brrt, I'm all ears :)		Copy link Message link Add to gist Remove
brrt	basically, i've written the existing JIT, basically as a bolt-on to the spesh framework, about two years ago		Copy link Message link Add to gist Remove
	sorry, basically twice	16:55	Copy link Message link Add to gist Remove
	now the existing JIT is very simplistic: it literally translates the spesh graph (an annotated representation of the bytecode) to machine code using a little library called DynASM	16:56	Copy link Message link Add to gist Remove
	and the thing about that is: there is a literal mapping between moarvm opcodes and JIT output		Copy link Message link Add to gist Remove
	to such an extent that i can read from the machine code that is generated back into the bytecode	16:57	Copy link Message link Add to gist Remove
	that's handy for debugging, i've got to say that		Copy link Message link Add to gist Remove
	but it is a lousy way to generate code, and furthermore, it means that we have no easy ways to do 'lowering transformations'		Copy link Message link Add to gist Remove
	so if we have an array object, and we know it is an array object, and that has an integer index, then it is still kind of tricky to translate that into efficient bytecode	16:58	Copy link Message link Add to gist Remove
	stop me if this stops making sense	16:59	Copy link Message link Add to gist Remove
	so, what i've done, is split the JIT into two stages		Copy link Message link Add to gist Remove
	the first stage translates the moarvm bytecode into a low-level DAG structure called 'expression trees'		Copy link Message link Add to gist Remove
	this is a lowering that transforms from MoarVMs memory-to-memory model, to a register-to-register model	17:00	Copy link Message link Add to gist Remove
	all this while the existing JIT is still in place, mind you	17:01	Copy link Message link Add to gist Remove
	and the second stage (called 'tiling') transforms those trees (or DAGs) to a linear list of operations		Copy link Message link Add to gist Remove
	machine code operations		Copy link Message link Add to gist Remove
	well, function pointers to routines that output machine code, but you get the idea	17:02	Copy link Message link Add to gist Remove
	had to hack DynASM to do that, because official DynASM doesn't play nice with extended registers in x64		Copy link Message link Add to gist Remove
	so once i'd done that, i thought i'd just walk that list, invoke its generating functions, and call it a day, but it turns out that register allocation is both essential and tricky	17:03	Copy link Message link Add to gist Remove
	and that it really wants proper linear order and multiple passes and lookahead, and these things weren't there when i had to stop working on that fulltime	17:04	Copy link Message link Add to gist Remove
	(because i was in my final year of studying)		Copy link Message link Add to gist Remove
	anyway, i've more or less continued refactoring bits of the new JIT to get that nice linear order and a proper structure for the register allocator	17:05	Copy link Message link Add to gist Remove
	once that is done, it's basically good to be merged, and then the real fun starts		Copy link Message link Add to gist Remove
	because what we can do is take the specialized types and have them insert their own bit of expression code for opcodes	17:06	Copy link Message link Add to gist Remove
	so nativecall can be lowered at JIT time to any old function call		Copy link Message link Add to gist Remove
	and array indexes can be lowered to a single instruction		Copy link Message link Add to gist Remove
	types permitting, of course		Copy link Message link Add to gist Remove
TheLemonMan	that sounds like an awesome plan!	17:08	Copy link Message link Add to gist Remove
brrt	other nice bits include: to port the JIT to another arch, for example ARM, we only need to port the tiler and register allocator (or have a data-driven register allocator, which is the actual plan)	17:09	Copy link Message link Add to gist Remove
	and the tiles		Copy link Message link Add to gist Remove
	not the tiler		Copy link Message link Add to gist Remove
	tiler remains the same		Copy link Message link Add to gist Remove
	it is quite a nice plan if i do say so myself :-)		Copy link Message link Add to gist Remove
	(and we can do transforming modifications on the 'expression tree' like ... whatsitcalled	17:10	Copy link Message link Add to gist Remove
	when you remove duplicate computations of the same thing		Copy link Message link Add to gist Remove
timotimo	brrt: what, pypy can't deal well with ffi calls? is that a joke?	17:12	Copy link Message link Add to gist Remove
brrt	well, that's not my claim		Copy link Message link Add to gist Remove
timotimo	pypy already jits ffi calls into the same code a C compiler would		Copy link Message link Add to gist Remove
	where did that person get their info?		Copy link Message link Add to gist Remove
brrt	really?		Copy link Message link Add to gist Remove
	well, then pypy is ahead of us		Copy link Message link Add to gist Remove
	(as i would expect, really)		Copy link Message link Add to gist Remove
	i dunno		Copy link Message link Add to gist Remove
	he's a scientist in a eh... UCB	17:13	Copy link Message link Add to gist Remove
	focussed on the 'big picture'		Copy link Message link Add to gist Remove
timotimo	pypy is already ahead of pretty much everyone forever :)	17:14	Copy link Message link Add to gist Remove
brrt	anyhow, i think the issue with pypy and numpy is, or used to be, that they had their own implementation of numpy called numpypy	17:15	Copy link Message link Add to gist Remove
timotimo	i think you were looking for "common expression extraction" or something?		Copy link Message link Add to gist Remove
	that's right		Copy link Message link Add to gist Remove
brrt	and their vectorization just isn't as good as the C and FORTRAN compiler's		Copy link Message link Add to gist Remove
timotimo	i contributed a tiny bit of code to numpypy		Copy link Message link Add to gist Remove
brrt	ah, yes, that's the one		Copy link Message link Add to gist Remove
	common subexpression elimination		Copy link Message link Add to gist Remove
timotimo	yes!		Copy link Message link Add to gist Remove
brrt	CSE is cool. although not a definite optimization, because it increases register pressure	17:16	Copy link Message link Add to gist Remove
	can be a done with a bottom up traversal and a hash table		Copy link Message link Add to gist Remove
	i'll leave the rest to you :-P	17:17	Copy link Message link Add to gist Remove
	'an exercise for the reader'		Copy link Message link Add to gist Remove
	the literal LWN quote is: "The first consequence that Smith described is that, for libraries like NumPy, there is a "catch-22". If it needs to be fast for CPython, it has to be written in C, but if it needs to be fast for a JIT, you cannot use C. He showed a simple mysum() function that totaled up the elements in an iterable. If it is passed a Python object like list(range(N)), the JIT knows what it is and can do lots of optimizations. But if it is passed a NumP	17:21	Copy link Message link Add to gist Remove
	y array, which is "opaque C stuff", the JIT doesn't understand it, so it will have trouble even achieving the performance of a non-NumPy version on a JIT-less CPython. "		Copy link Message link Add to gist Remove
	that is pretty much certainly not true, but whatever	17:22	Copy link Message link Add to gist Remove
timotimo	do you know of the CSE band? :)		Copy link Message link Add to gist Remove
brrt	i do not know	17:23	Copy link Message link Add to gist Remove
timotimo	i bet if you use CSE for your band, you'll not be making catchy songs	17:24	Copy link Message link Add to gist Remove
brrt	hahahahaha		Copy link Message link Add to gist Remove
17:24 domidumont1 joined
brrt	well, perhaps better than the deflate band	17:24	Copy link Message link Add to gist Remove
timotimo	hah	17:26	Copy link Message link Add to gist Remove
brrt	anyway, the latest latest bit is that i've finally started working on the register allocator, that i finally have a good theoretical basis for doing that, that, in other words, i'm an idiiot who underestimated the scope of the project	17:32	Copy link Message link Add to gist Remove
17:35 cygx joined
cygx	timotimo: perhaps they meant that pypy cannot across the FFI boundary	17:35	Copy link Message link Add to gist Remove
timotimo	that'd be true, then	17:36	Copy link Message link Add to gist Remove
	but neither can a C compiler, really		Copy link Message link Add to gist Remove
cygx	Graal/Truffle (basically the same thing as PyPy with partial evaluation instead of tracing, cf stefan-marr.de/papers/oopsla-marr-d...valuation/ )		Copy link Message link Add to gist Remove
	where did the rest of that sentence go?	17:37	Copy link Message link Add to gist Remove
	Graal/Truffle can do such optimizations by implementing a C as well as LLVM bitcode interpreter		Copy link Message link Add to gist Remove
timotimo	then you'll have to keep the source around for things you want to ffi?	17:38	Copy link Message link Add to gist Remove
cygx	if you want to do inlining/partial evaluation/... across the FFI boundary, yes	17:39	Copy link Message link Add to gist Remove
timotimo	if so, i'm unwilling to keep calling that FFI :)		Copy link Message link Add to gist Remove
cygx	(in practice, probably LLVM bitcode, not C source code)	17:40	Copy link Message link Add to gist Remove
17:44 brrt joined
brrt	why not bytecode interpretation	17:45	Copy link Message link Add to gist Remove
	it isn't that hard, it's not like it's a friggin huge interface		Copy link Message link Add to gist Remove
	and they have the manpower to do it		Copy link Message link Add to gist Remove
	however, one has to ask oneself whether any of that is the point		Copy link Message link Add to gist Remove
timotimo	you mean interpreting the actual x86 machine code?	17:48	Copy link Message link Add to gist Remove
brrt	aye		Copy link Message link Add to gist Remove
	why not		Copy link Message link Add to gist Remove
timotimo	right ...	17:49	Copy link Message link Add to gist Remove
	maybe we'll end up being the first ones to be crazy enough? we'll implement it for moarvm :P		Copy link Message link Add to gist Remove
brrt	if you're so adamant to be faster than calling c		Copy link Message link Add to gist Remove
	hahaha		Copy link Message link Add to gist Remove
	we have a reputation for crazy to uphold		Copy link Message link Add to gist Remove
	... i'm done with internet technology news, though	17:52	Copy link Message link Add to gist Remove
	the frequency of times that i read something that begins interesting and ends with 'oh god, not this again' is too damn high		Copy link Message link Add to gist Remove
timotimo	yes, oh lord		Copy link Message link Add to gist Remove
brrt	oh lord?		Copy link Message link Add to gist Remove
timotimo	tech news sites	17:53	Copy link Message link Add to gist Remove
jnthn	I don't think that was meant as an honorific :P		Copy link Message link Add to gist Remove
brrt	no, neither did i		Copy link Message link Add to gist Remove
jnthn	Though I like how it can be parsed that way :)		Copy link Message link Add to gist Remove
brrt	but maybe i was coming accross as eh.. oh well		Copy link Message link Add to gist Remove
	jnthn has his weekly day of Moar / Perl 6 hackery tomorrow		Copy link Message link Add to gist Remove
brrt	\o/		Copy link Message link Add to gist Remove
	what's on the menu this time		Copy link Message link Add to gist Remove
jnthn	Same as the last few weeks, trying to robustify concurrency related things. :)	17:54	Copy link Message link Add to gist Remove
timotimo	here's an amusing one		Copy link Message link Add to gist Remove
	Program received signal SIGFPE, Arithmetic exception.		Copy link Message link Add to gist Remove
	0x00007ffff771e9e5 in deserialize_repr_data (tc=0x6047c0, st=0x609030, reader=<optimized out>)		Copy link Message link Add to gist Remove
	at src/6model/reprs/P6opaque.c:1063		Copy link Message link Add to gist Remove
brrt	amuse us		Copy link Message link Add to gist Remove
timotimo	1063 if (cur_offset % spec->align) {		Copy link Message link Add to gist Remove
brrt	how		Copy link Message link Add to gist Remove
timotimo	the fuzzer found that :)		Copy link Message link Add to gist Remove
	:D		Copy link Message link Add to gist Remove
brrt	and spec->algin can be zero how	17:55	Copy link Message link Add to gist Remove
timotimo	yes, how indeed.		Copy link Message link Add to gist Remove
	$2 = {inlineable = 0, bits = 0, align = 0, boxed_primitive = 0, can_box = 0, is_unsigned = 0 '\000'}		Copy link Message link Add to gist Remove
	apparently: "uninitialized"		Copy link Message link Add to gist Remove
	oh, apparently the fuzzer went ahead and just set the align value to 0 in the serialized blob		Copy link Message link Add to gist Remove
	and that's how it asplodes		Copy link Message link Add to gist Remove
	just a case of us not putting a check against 0 there to fail with "that's obviously BS."	17:56	Copy link Message link Add to gist Remove
	mighty AFL finds every single flaw, and then some.		Copy link Message link Add to gist Remove
brrt	hmmm	17:59	Copy link Message link Add to gist Remove
	yeah, arguably that is a flaw and wants a check	18:00	Copy link Message link Add to gist Remove
18:37 utat joined 18:39 domidumont joined
jnthn	Phew, I think I might have finished writing up my grant report...	18:46	Copy link Message link Add to gist Remove
timotimo	the kind that goes on your blog?	18:47	Copy link Message link Add to gist Remove
	or is there a separate kind you send off to TPF directly?		Copy link Message link Add to gist Remove
	#3 0x00007ffff75aef6c in MVM_interp_run (tc=tc@entry=0x6047c0, initial_invoke=0x0, invoke_data=0x1)	18:48	Copy link Message link Add to gist Remove
	how does that end up with initial_invoke being null?		Copy link Message link Add to gist Remove
	when called from MVM_vm_run_file		Copy link Message link Add to gist Remove
[Coke]	I'm pretty sure most reports to the tpf on most grants are publically blogged		Copy link Message link Add to gist Remove
timotimo	but it then goes on to call toplevel_initial_invoke anyway	18:49	Copy link Message link Add to gist Remove
	anyway, it's going on to frame_force_to_heap a null pointer, but i wonder where the earliest point is that we can spot this wrongness	18:51	Copy link Message link Add to gist Remove
jnthn	timotimo: The TPF kind that hopefully leads to payment :)	18:54	Copy link Message link Add to gist Remove
	Yes, it will appear publicly		Copy link Message link Add to gist Remove
timotimo	:D	18:56	Copy link Message link Add to gist Remove
jnthn	dinner; bbl	19:00	Copy link Message link Add to gist Remove
dalek	arVM: 9267528 \| timotimo++ \| src/6model/reprs/P6opaque.c: don't allow zero alignment in p6opaque storage spec	19:25	Copy link Message link Add to gist Remove
	arVM: edd5839 \| timotimo++ \| src/core/bytecode.c: index check lexicals when reading static flags		Copy link Message link Add to gist Remove
JimmyZ	github.com/MoarVM/MoarVM/issues/234 # it would be nice if someone can fix it :)	19:55	Copy link Message link Add to gist Remove
	SEGV bug		Copy link Message link Add to gist Remove
jnthn	Hm, does it for me too	19:57	Copy link Message link Add to gist Remove
	Will have a look in the morning :)	19:59	Copy link Message link Add to gist Remove
JimmyZ	thanks	20:00	Copy link Message link Add to gist Remove
20:14 Ven joined 21:00 TheLemonMan joined
TheLemonMan	JimmyZ, wrt #234 I think it's due a poor interaction between libuv and the exception handler, when an exception is raised you're basically longjmp'ing from a libuv callback to the opcode dispatch loop	21:04	Copy link Message link Add to gist Remove
	eg: ptpb.pw/UOuH	21:05	Copy link Message link Add to gist Remove
21:26 lizmat joined
timotimo	i suppose i can have a local patch to make that "crash" "not a crash" but a fail, so that i can get more "real" crashes when i run afl the next time	23:17	Copy link Message link Add to gist Remove
	refering to the initial invoke thing where the frame is 0x0		Copy link Message link Add to gist Remove
23:27 TimToady joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!