#parrot on 17 July 2014 - Raku Programming Language Log

Parrot 6.6.0 "Parrothead" \| parrot.org/ \| Log: irclog.perlgeek.de/parrot \| #parrotsketch meeting Tuesday 19:30 UTC Set by moderator on 16 July 2014.
01:26 FROGGS_ joined 05:39 bighugedog joined 07:04 cooper joined 08:46 basiliscos joined 09:45 Timbus joined 10:04 bighugedog joined 13:22 rurban1 joined 13:58 bluescreen joined 16:48 Chirag joined 17:29 autark joined 17:36 rurban1 joined 17:56 Hunger joined 20:52 robertle joined
robertle	hi folks, I am currently trying to learn more about VMs, and have of course looked at parrot as well as lua as examples. I have written a very simple stack-based VM, and would like to try a register based one next as an exercise.	20:54	Copy link Message link Add to gist Remove
	now there is one thing I don't quite understand, and would be extremely happy if I could some input on		Copy link Message link Add to gist Remove
	a) if you e.g. compare lua with parrot, then you will notice that in lua the registers are type-tagged, so you can store any type in any register. in parrot, the basic registers are typed. I can see the case for typed registers: you do not need to go through different code paths depending on the content of a register, you know from the opcode what's in them. so less branching. at the same time, you need more space for teh	20:56	Copy link Message link Add to gist Remove
	I forgot what b) was about now :)	20:57	Copy link Message link Add to gist Remove
	ah, obviously I am targeting a dynamically typed language as well, otherwise that would be stupid of course	20:59	Copy link Message link Add to gist Remove
rurban	imho lua's approach uses less space, so it's much faster in an intel cpu	21:09	Copy link Message link Add to gist Remove
	I also work on a lua-based vm: potion i.e. a lua with proper oo, traits	21:10	Copy link Message link Add to gist Remove
	the branching is not so important if the paths are short. the cpu both basic blocks	21:11	Copy link Message link Add to gist Remove
	the cpu fetches both	21:12	Copy link Message link Add to gist Remove
	with int it's a simple shift, with strings a mask, with double a full qword, with other objects a mask	21:13	Copy link Message link Add to gist Remove
robertle	ah! I didn't know they can fetch both and then just skip some if required!		Copy link Message link Add to gist Remove
rurban	lua/potion: no type specific ops, most ops are methods calls, which are cached	21:14	Copy link Message link Add to gist Remove
robertle	I thought essentially every branch is a potential branch miss pipeline stall unless the rpedictor can avoid that		Copy link Message link Add to gist Remove
rurban	no double arithmetic ops, all are method calls. arithmetic ops only for int, with a simple shift		Copy link Message link Add to gist Remove
	it only stalls if it has to wait. e.g. on fpu or io or far away heap mem.	21:15	Copy link Message link Add to gist Remove
robertle	so type-tagged registers sound ok then! do you know what the reason for the fiexd-type registers in parrot is?	21:16	Copy link Message link Add to gist Remove
rurban	the advantages for parrot are CPS: exceptions, concurrency		Copy link Message link Add to gist Remove
robertle	hmm, I thought that a branch misprediction would stall it as well. need to read up on that	21:17	Copy link Message link Add to gist Remove
	how do these special registers help with concurrency or exceptions?		Copy link Message link Add to gist Remove
rurban	and fixed type regs are also pretty fast. you just need a bunch of ops. as in java.		Copy link Message link Add to gist Remove
	the registers are stored in the continuation object		Copy link Message link Add to gist Remove
	no stack switching and stack adjustments necessary	21:18	Copy link Message link Add to gist Remove
robertle	ok, but why does it make a difference for that whether they are of a fixed type or of a variable type?		Copy link Message link Add to gist Remove
rurban	in lua/potion you have copy all stack values over manually with yield		Copy link Message link Add to gist Remove
	not really		Copy link Message link Add to gist Remove
	I prefer tagged types. but parrot went with fixed types	21:19	Copy link Message link Add to gist Remove
	no substantial differences		Copy link Message link Add to gist Remove
	fixed types allow full word ints, tagged only limited (as with a common lisp)	21:20	Copy link Message link Add to gist Remove
	so you cannot store pointeres in ints, and perl5 demands that		Copy link Message link Add to gist Remove
robertle	because you need space for the tag?		Copy link Message link Add to gist Remove
rurban	however, pointers are always aligned		Copy link Message link Add to gist Remove
	yes		Copy link Message link Add to gist Remove
	so lua uses the last 2 alignment bits for the type		Copy link Message link Add to gist Remove
	you can only store proper ptrs then, not pointers to the middle of some buffer/string.	21:21	Copy link Message link Add to gist Remove
robertle	ok, this is an interesting topic: assuming you want full-length contents and a tagged type, the main problem is that because of alignment etc, your tagged union gets quite big, right?		Copy link Message link Add to gist Remove
rurban	you cannot call it tagged then :)		Copy link Message link Add to gist Remove
	it's a normal 2 word value struct then		Copy link Message link Add to gist Remove
	lua/potion/common lisp is fast because it uses only one word	21:22	Copy link Message link Add to gist Remove
robertle	right, with the second being a union of possible types		Copy link Message link Add to gist Remove
rurban	perl/python/ruby are slow because their values are huge (3-10 words)		Copy link Message link Add to gist Remove
	with perl having a particular unfortunate layout	21:23	Copy link Message link Add to gist Remove
robertle	so one of the things I would like to try is to basically still use a full type for the register, no lower-order bits as tags, but not keep them together quite so much.		Copy link Message link Add to gist Remove
rurban	not keeping together? sounds bad	21:24	Copy link Message link Add to gist Remove
robertle	so a 32-wide register file would be 32word-sized register + 32nibble to denote type of that word		Copy link Message link Add to gist Remove
	or so		Copy link Message link Add to gist Remove
rurban	the type word pointing a type vtable? (as in common lisp)	21:25	Copy link Message link Add to gist Remove
robertle	right, the obvious problem would be cache effects, so you need to keep this small enough that it does not matter		Copy link Message link Add to gist Remove
rurban	you still need 1 bit or 2 for the GC		Copy link Message link Add to gist Remove
	or a full word for the refcount		Copy link Message link Add to gist Remove
	you can use the free bits in the type word for the gc. I guess ruby does it like this now	21:26	Copy link Message link Add to gist Remove
robertle	but that's part of the value, right? I mean you would only GC complex things like string, which have to be stored somewhere else and the "word" in the register would just point to it		Copy link Message link Add to gist Remove
rurban	yes, immediate values vs refs	21:27	Copy link Message link Add to gist Remove
robertle	yeah, that sounds good. I was thinking a nibble per register for types, but I don't think I'll end up with 16 types...		Copy link Message link Add to gist Remove
rurban	16 is pretty low. I have 25 so far		Copy link Message link Add to gist Remove
	nope, 22		Copy link Message link Add to gist Remove
robertle	so lets say in a primitve version you would have int, float, bool, nil, string, hash, vector, special	21:28	Copy link Message link Add to gist Remove
rurban	github.com/perl11/p2/blob/p2/core/potion.h#L133		Copy link Message link Add to gist Remove
robertle	first 4 being immediates		Copy link Message link Add to gist Remove
	I'll check it out!		Copy link Message link Add to gist Remove
rurban	yes, 3-4 immediates + plus a couple of basic types, the rest can fit into a generic container with known size and a vtable pointer	21:29	Copy link Message link Add to gist Remove
	fperrad also has a luajit based vm, with perl6 support in the works.	21:30	Copy link Message link Add to gist Remove
	github.com/fperrad/tvmjit/tree/master/src		Copy link Message link Add to gist Remove
robertle	one thing that I need to work out, and that I would be very glad to get ideas on, is how to structure the instruction set around the dynamic types		Copy link Message link Add to gist Remove
rurban	the compiler knows about immediates, the rest is done via method calls	21:31	Copy link Message link Add to gist Remove
robertle	initiall I thought that many of the ops would behave differently based on the types in the reisters, with many of the options raising an exception		Copy link Message link Add to gist Remove
rurban	but some arith ops need to check the types at run-time, for undef, bigints, doubles, strings and such	21:32	Copy link Message link Add to gist Remove
robertle	so e.g. a "multiply" would allow int and float in the input regs and store and int or float in the target, raise an exception on all other types		Copy link Message link Add to gist Remove
	right, that's what I mean! so the question is do I always do runtime type check/branch, or do I write an instruction set which only works if the correct types are in the registers, and make the compiler guarantee that	21:33	Copy link Message link Add to gist Remove
rurban	yes. see e.g. github.com/perl11/p2/blob/p2/core/vm.c for the simple lua-alike version ()simplier than lua in fact)		Copy link Message link Add to gist Remove
robertle	the second sounds like less runtime overhead, but you need more opcodes		Copy link Message link Add to gist Remove
	e.g. you need a separate opcode for float mult and int mult		Copy link Message link Add to gist Remove
	with e.g. a byteode format like lua's, you don't get that many...	21:34	Copy link Message link Add to gist Remove
rurban	you can add a type inferencer (hm style) or gradual typing to restrict types to help the compiler		Copy link Message link Add to gist Remove
	generally you have slower ops for the generic case		Copy link Message link Add to gist Remove
robertle	still: do I look at the type of the input registers when executing an opcode to determine what to do, or just at the opcode itself	21:35	Copy link Message link Add to gist Remove
	right, so an option would be to generally look at the input registers, which is slow but powerfull, and have a few fast ops that get executed often.		Copy link Message link Add to gist Remove
	int++ or so		Copy link Message link Add to gist Remove
rurban	with arith binops I look at both types, and if both are immed. ints, do it fast (one cpu op), otherwise delegate to a method call	21:36	Copy link Message link Add to gist Remove
	github.com/perl11/p2/blob/p2/core/vm.c#L355	21:37	Copy link Message link Add to gist Remove
	with parrot it's similar, we just have seperate ops for int, double and objects		Copy link Message link Add to gist Remove
	and the objects (integer, double, big, complex, ...) do their slow logic		Copy link Message link Add to gist Remove
robertle	right, when you say "I look at both types", you mean at opcode dispatch time?	21:38	Copy link Message link Add to gist Remove
rurban	yes. the biggest trick is to be able to call methods fast, not so the ops		Copy link Message link Add to gist Remove
	opcode dispatch is either in a byteocde vm (big switch) or if jitted it's already in the instruction cache linearily layed out	21:39	Copy link Message link Add to gist Remove
	with parrot we look at 3-6 arg types I guess. if it's a multi method call	21:40	Copy link Message link Add to gist Remove
	but the delegation is very slow then		Copy link Message link Add to gist Remove
	it was much faster a few years back though, before multi dispatch support	21:41	Copy link Message link Add to gist Remove
robertle	what I was trying to do get the next instruction (which looks like lua in structure), mask off the opcode, determine the source registers and copy the relevant nibbles over. so I end up with one word that contains both opcode and source types	21:43	Copy link Message link Add to gist Remove
	and the do a computed goto with that		Copy link Message link Add to gist Remove
	obviously there would be plenty of cases that do the same, whcih is just another goto	21:44	Copy link Message link Add to gist Remove
	if I see it right this is the "big computed goto" option, the other one to just do ti on the opcode, and then do something similar inside each op handler to switch on the operand types	21:45	Copy link Message link Add to gist Remove
	not sure what's better, nprobably needs experimentation		Copy link Message link Add to gist Remove
	I need to look at your p2 in more detail, but since it seems to execute lua bytecode, what's the core difference	21:48	Copy link Message link Add to gist Remove
	?		Copy link Message link Add to gist Remove
	I mean, obviously there is going to be lots of other bits like GC and IO		Copy link Message link Add to gist Remove
	or concurrency		Copy link Message link Add to gist Remove
	but I mean in the actual heart of the VM, the dispatcher?		Copy link Message link Add to gist Remove
	anyway, thanks a lot, I'll do some more playing and will be back. off to bed now...	21:54	Copy link Message link Add to gist Remove
rurban	robertle: yes, that's ertl's idea for fast vm's. ops + args in one word		Copy link Message link Add to gist Remove
	lua does it the same way		Copy link Message link Add to gist Remove
	p2 is basically an improved lua, much simplier, pre-luajit	21:55	Copy link Message link Add to gist Remove
	with proper oo system		Copy link Message link Add to gist Remove
	www.jilp.org/vol5/v5paper12.pdf	21:56	Copy link Message link Add to gist Remove
robertle	eh, when you say "ops + args in one word", do you mean lua-style bytecode, or the "big computed goto"	21:57	Copy link Message link Add to gist Remove
rurban	lua-style bytecode	21:58	Copy link Message link Add to gist Remove
	big computed goto, is the slow unjitted dispatch		Copy link Message link Add to gist Remove
	the fast dispatch is the jit		Copy link Message link Add to gist Remove
	See perl11.org/p2/ for the potion/p2 docs	21:59	Copy link Message link Add to gist Remove
robertle	right, but JIT aside the question is still whether you combine opcode and arg types into one big jump table, or do a computed goto based on opcode alone, and then something siomilar to sort out different behaviour based on the types		Copy link Message link Add to gist Remove
rurban	or better perl11.org/p2/html/		Copy link Message link Add to gist Remove
robertle	ah, that explains a bit! I was confused looking at your code, expecting a 3-address machine like lua is, simply beacuse the opccodes are so similar. but this is 2-address?	22:00	Copy link Message link Add to gist Remove
rurban	no, its the same 3 address machine	22:01	Copy link Message link Add to gist Remove
robertle	dest = op src what	22:02	Copy link Message link Add to gist Remove
	what is an operand as well?		Copy link Message link Add to gist Remove
	i mean, src and what are operands?		Copy link Message link Add to gist Remove
rurban	op + 2 operands, a bigger one and a smaller one		Copy link Message link Add to gist Remove
robertle	k	22:03	Copy link Message link Add to gist Remove
	argh, I reallY have to go to bed		Copy link Message link Add to gist Remove
	thanks!		Copy link Message link Add to gist Remove
rurban	bye!		Copy link Message link Add to gist Remove
	actually both operands use 12 bits: github.com/perl11/potion/blob/mast...odes.h#L20	22:04	Copy link Message link Add to gist Remove
23:29 dalek joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!