#moarvm on 7 June 2018 - Raku Programming Language Log

00:47 lizmat left 01:55 MasterDuke_ joined 01:57 MasterDuke left 01:59 MasterDuke_ is now known as MasterDuke
Kaiepi	how much work is it to add support for a new encoding?	02:16	Copy link Message link Add to gist Remove
	utf32 or wchar strings, i mean	02:17	Copy link Message link Add to gist Remove
03:05 MasterDuke left 03:18 MasterDuke joined 05:28 robertle left 05:29 domidumont joined 05:36 domidumont left 05:37 domidumont joined 05:58 Kaiepi left 05:59 lizmat joined 06:04 Kaiepi joined
samcv	Kaiepi: a new encoding or to add a new type of string?	06:04	Copy link Message link Add to gist Remove
	i mean we have 32 bit strings that are stored as 32 bit signed integers	06:05	Copy link Message link Add to gist Remove
	but that's not utf32. what are you trying to achieve	06:06	Copy link Message link Add to gist Remove
Kaiepi	i meant support for strings using wchar_t		Copy link Message link Add to gist Remove
samcv	like on the command line?		Copy link Message link Add to gist Remove
	what is this for. i'm confused	06:07	Copy link Message link Add to gist Remove
Kaiepi	i wanted to write bindings for editline, but it uses wchar_t or ascii depending on how it's compiled		Copy link Message link Add to gist Remove
samcv	are we talking about nativecall?	06:08	Copy link Message link Add to gist Remove
Kaiepi	yeah		Copy link Message link Add to gist Remove
samcv	ok. that makes this make a lot more sense		Copy link Message link Add to gist Remove
	Kaiepi: it seems like a compiler dependent type	06:09	Copy link Message link Add to gist Remove
Kaiepi	it is		Copy link Message link Add to gist Remove
samcv	i mean do we need a wchar_t* that is null terminated?	06:10	Copy link Message link Add to gist Remove
Kaiepi	i made a pullreq that exposes wchar_t as a type for moar, jvm, and the js runtime		Copy link Message link Add to gist Remove
samcv	Kaiepi: so do we need null terminated wchar_t*'s?	06:11	Copy link Message link Add to gist Remove
	is that what we'd have to pass through to the external libraries?	06:12	Copy link Message link Add to gist Remove
	or is another convention		Copy link Message link Add to gist Remove
Kaiepi	i think they might use WEOL instead, but i'll need to check		Copy link Message link Add to gist Remove
06:12 lizmat left
Kaiepi	going off the docs for wide strings on my system they use wide null characters	06:18	Copy link Message link Add to gist Remove
06:18 domidumont left 06:19 brrt joined
brrt	\o	06:34	Copy link Message link Add to gist Remove
06:35 robertle joined
nwc10	o/	06:35	Copy link Message link Add to gist Remove
brrt	what i've found is that it is really difficult to explain the original need for invokish and friends to a non-moarvm-engineer audinece	06:38	Copy link Message link Add to gist Remove
Kaiepi	i'm willing to try to understand	06:45	Copy link Message link Add to gist Remove
brrt	:-)	06:51	Copy link Message link Add to gist Remove
	well, it is no longer as necessary, since invokish was removed	06:56	Copy link Message link Add to gist Remove
	but the tl;dr is that MoarVM uses explicit control flow throughout;, that this makes implementation of things like resumable exceptions possible (unlike, say, implicit control flow through recursing over the C stack),	06:58	Copy link Message link Add to gist Remove
	but the problem for the JIT is that as far as the 'C' stack is concerned, JIT compiled code is running 'on top of' the interpreter frame		Copy link Message link Add to gist Remove
	i.e. [ start \| interpreter \| JIT -> (grow upwards)	06:59	Copy link Message link Add to gist Remove
	now if the JIT compiled code needs a service from the interpreter (like, invoking a new frame), it needs to store its current state and return to the interpreter, rather than call into a new interpreter frame		Copy link Message link Add to gist Remove
	which means that we need to explicitly managage our current position and state in the JIT as well as in the interpreter	07:00	Copy link Message link Add to gist Remove
	my recent patchset was about finding a much cleverer way to do this than we had before		Copy link Message link Add to gist Remove
07:36 zakharyas joined 07:47 zakharyas left, zakharyas joined 08:17 lizmat joined 08:30 domidumont joined 09:01 zakharyas left 10:03 zakharyas joined 10:13 brrt left 10:38 brrt joined 10:54 Kaiepi left 10:59 zakharyas left 11:14 lizmat left 11:17 lizmat joined 12:23 brrt left 12:36 Kaiepi joined 12:53 brrt joined 13:02 zakharyas joined 14:13 zakharyas left, zakharyas joined 14:36 scovit joined
Geth	MoarVM/pluggable-spesh: 22 commits pushed by (Jonathan Worthington)++ review: github.com/MoarVM/MoarVM/compare/3...0e61206253	14:59	Copy link Message link Add to gist Remove
jnthn	Just a rebase	15:00	Copy link Message link Add to gist Remove
15:09 robertle left 15:14 raiph joined 15:29 brrt left 15:34 domidumont left
Geth	MoarVM/pluggable-spesh: 9eebf10d05 \| (Jonathan Worthington)++ \| src/spesh/plugin.c Make spesh plugins work correctly with the JIT We already put a deopt annotation on the prepargs that we are going to delete as part of the optimization. Steal that and put it on the guards, so that we get a viable position to use for deopt_offset. It doesn't matter that this will not result in a very precise position, since this is just used to answer the question of "which (if any) of the inlines are we in". All the guards will fall within the inlines.	15:59	Copy link Message link Add to gist Remove
16:58 raiph left 17:04 robertle joined 17:06 zakharyas left, zakharyas joined 17:30 domidumont joined 18:04 zakharyas left 18:32 lizmat left 18:47 lizmat joined 18:53 ggoebel left 19:00 ggoebel joined 19:07 domidumont left 19:12 brrt joined 19:47 zakharyas joined
robertle	folks, I am wondering whether you could help me understand something about GC in general. a bit off topic, but perhaps you find it interesting as well.	19:59	Copy link Message link Add to gist Remove
	I am building a toy scheme interpreter, and I have some memory management and GC already. my heap objects are not movable, so that the immediate value on the stack or in another heap object can just be direkt pointer to the heap object.	20:00	Copy link Message link Add to gist Remove
20:00 brrt left
japhb	robertle: gchandbook.org/	20:01	Copy link Message link Add to gist Remove
jnthn	It can be a direct pointer even if they are movable. :)		Copy link Message link Add to gist Remove
	(You just need to be able to find all the pointers :)		Copy link Message link Add to gist Remove
robertle	this works, but I cannot compact my heap, I can't do generational GC, and my allocation is quite involved as I need to find an empty slot		Copy link Message link Add to gist Remove
japhb	I was serious about that link.		Copy link Message link Add to gist Remove
robertle	japhb: I'll definitely read it!	20:02	Copy link Message link Add to gist Remove
japhb	It (and it's predecessor) are very good. :-)		Copy link Message link Add to gist Remove
robertle	jnthn: right, but finding all pointers is quite hard too		Copy link Message link Add to gist Remove
jnthn	I read that book before doing the MoarVM GC. It was very helpful :)		Copy link Message link Add to gist Remove
	robertle: Well, at the very least it enforces that you know certain things	20:03	Copy link Message link Add to gist Remove
japhb	Heh		Copy link Message link Add to gist Remove
robertle	scheme creates insane amounts of short-lived garbage in the form of cons cells, so I would like to play with a different strategy. one where I can just use a quick bump allocator into a new generation, and then gc-copy into a different generation		Copy link Message link Add to gist Remove
jnthn	e.g. you can't just do a conservative stack scan		Copy link Message link Add to gist Remove
japhb	robertle: half-space compacting nursery sounds like something useful	20:04	Copy link Message link Add to gist Remove
	(MoarVM uses that too)		Copy link Message link Add to gist Remove
jnthn	Yeah, I picked semispace for Perl 6 'cus it does the very same		Copy link Message link Add to gist Remove
	Tons of short-live Scalars for example		Copy link Message link Add to gist Remove
	And Str and Int and friends are immutable		Copy link Message link Add to gist Remove
	*short-lived		Copy link Message link Add to gist Remove
robertle	the problem I currently have is that I seem to have a fundamental problem understanding something.		Copy link Message link Add to gist Remove
20:05 brrt joined
japhb	jnthn: How many semispace copies before promote to second gen these days?	20:05	Copy link Message link Add to gist Remove
robertle	so my first attempt was to have my immediate values still be pointers, but to boxes in a fixed location. that box then points to the actual heap. this way I can move the actual entry		Copy link Message link Add to gist Remove
jnthn	@japhb 1 (well, 2 if you count the copy inot gen2)		Copy link Message link Add to gist Remove
robertle	but that stinks big time: allocating the boxes is about as complicated as my previous allocation scheme		Copy link Message link Add to gist Remove
jnthn	So there's just a flag in the header saying "we already saw this in the nursery"	20:06	Copy link Message link Add to gist Remove
robertle	unsurprising really, they are also on the heap and immovable..., just smaller		Copy link Message link Add to gist Remove
japhb	nodnod		Copy link Message link Add to gist Remove
	robertle: You are going to recreate the history of GC manually if you don't watch out. ;-)		Copy link Message link Add to gist Remove
jnthn	robertle: Yeah, I suspect you need to go all-in with moving		Copy link Message link Add to gist Remove
japhb	s/manually/the hard way/		Copy link Message link Add to gist Remove
robertle	I tried to understand how this work in moar, but don't get it:		Copy link Message link Add to gist Remove
	if you have a value in say a register, and it refers to an object on the heap. does it refer directly to the heap object? or via an indirection?	20:07	Copy link Message link Add to gist Remove
japhb	robertle: ... also, to the top of the object? To the GC header? To the object field that's the actual referenced thing?	20:08	Copy link Message link Add to gist Remove
robertle	not sure, moar is so much more complicated than my toy that I can't really say. but really, anything on the heap that can be moved and that gets GCed	20:09	Copy link Message link Add to gist Remove
	what's the most simple thing that still ends up on the heap?		Copy link Message link Add to gist Remove
japhb	I suspect Scheme has simple enough primitives that you don't need to take references to object fields, as you are taking a reference to an entire object or a primitive directly.		Copy link Message link Add to gist Remove
robertle	correct, I only	20:10	Copy link Message link Add to gist Remove
	ever reference objects		Copy link Message link Add to gist Remove
japhb	robertle: Careful to separate "implementation language's heap" (malloc and friends) and "target language's heap" (cons and friends)		Copy link Message link Add to gist Remove
robertle	ok, I meant target language heap	20:11	Copy link Message link Add to gist Remove
jnthn	robertle: It refers directly to the object's memory, which starts with a predictable header		Copy link Message link Add to gist Remove
japhb	And for the latter, the MoarVM equivalent is I believe MVMCollectable or so.		Copy link Message link Add to gist Remove
jnthn	robertle: That header lets us find out what kind of thing it is, so we can in turn discover its pointers		Copy link Message link Add to gist Remove
robertle	ok, that much I understand.		Copy link Message link Add to gist Remove
jnthn	robertle: We also have metadata telling us which registers hold objects		Copy link Message link Add to gist Remove
	The big change in terms of what you work with inside of your GC is probably going to be that you get an extra LoI	20:12	Copy link Message link Add to gist Remove
	So the queue of things to process is not pointers to objects, but rather pointers to pointers to objects		Copy link Message link Add to gist Remove
robertle	"LoI"?		Copy link Message link Add to gist Remove
jnthn	So that the GC can update them		Copy link Message link Add to gist Remove
	Level of Indirection		Copy link Message link Add to gist Remove
japhb	Level of Indirection		Copy link Message link Add to gist Remove
	jinx!		Copy link Message link Add to gist Remove
jnthn	:)		Copy link Message link Add to gist Remove
	jnthn gotta go, but bbi20	20:13	Copy link Message link Add to gist Remove
japhb	o/		Copy link Message link Add to gist Remove
jnthn	(happy to continue answering questions then :))		Copy link Message link Add to gist Remove
robertle	hold on a moment. so in my very simple case I have values e.g. on the stack. in the case of e.g. a string, that value has a type tag and a pointer to the string object on the target language heap	20:14	Copy link Message link Add to gist Remove
	you are saying that is basically the same in moar? the vale in a register directly references target heap memory?		Copy link Message link Add to gist Remove
japhb	robertle: MoarVM has virtual registers.	20:15	Copy link Message link Add to gist Remove
	There is separate handling of pointers in machine registers, which is what the MVMROOT macro is for.	20:16	Copy link Message link Add to gist Remove
brrt	actually, being pedantic, MVMROOT is for collectable objects that don't live in registers, since registers are GC roots by default	20:17	Copy link Message link Add to gist Remove
japhb	(It marks that pointer as being in the "root set" which is the list of starting points, the roots of the forest of object graphs, that will eventually lead to all live objects in the target language's heap		Copy link Message link Add to gist Remove
robertle	ok, but still. you have a pointer to the heap object. so if you move the object during GC, you update the register?		Copy link Message link Add to gist Remove
japhb	)		Copy link Message link Add to gist Remove
brrt	and registers are marked as containing collectable objects		Copy link Message link Add to gist Remove
japhb	brrt, apologies for that.		Copy link Message link Add to gist Remove
brrt	my apologies for pedantry :-)		Copy link Message link Add to gist Remove
	robertle: correct. the GC manages these all the pointers to objects that might be moved	20:18	Copy link Message link Add to gist Remove
japhb	Yes, GC is generally done at one additional level of indirection, so it operates on pointers to pointers, because it needs the pointers to be themselves writable.		Copy link Message link Add to gist Remove
brrt	so suppose object a refers to object b; the GC takes a pointer to the pointer from A to B		Copy link Message link Add to gist Remove
robertle	right, we are getting closer to where my brain gets stuck: so your GC has a pointer to the register, so that it can update it when it moves the object the register points to	20:19	Copy link Message link Add to gist Remove
brrt	'virtual' register, but yes		Copy link Message link Add to gist Remove
robertle	but surely there can be multiple registers pointing to the same object		Copy link Message link Add to gist Remove
	and also slots in objects on the heap		Copy link Message link Add to gist Remove
brrt	effectively more like a stack address		Copy link Message link Add to gist Remove
	yes		Copy link Message link Add to gist Remove
	so these are managed in an array of pointer pointers	20:20	Copy link Message link Add to gist Remove
	in c, this is a pointer pointer pointer :-)		Copy link Message link Add to gist Remove
japhb	.oO( You had me at *** ... )	20:21	Copy link Message link Add to gist Remove
robertle	and also slots in objects on the heapto the same object?		Copy link Message link Add to gist Remove
brrt	all of 'm	20:22	Copy link Message link Add to gist Remove
robertle	what? I didn't mean to write that last line, doesn't make any sense! what I wanted to say:		Copy link Message link Add to gist Remove
brrt	also, because of generational GC, we also maintain all cross-generation pointers		Copy link Message link Add to gist Remove
robertle	and also slots in objects on the heapto the same object?		Copy link Message link Add to gist Remove
brrt	yes		Copy link Message link Add to gist Remove
robertle	very simple case: one object on the heap, two registers that point to that object. GC kicks in. what happens?	20:23	Copy link Message link Add to gist Remove
brrt	GC creates a list of pointers to the registers	20:24	Copy link Message link Add to gist Remove
	GC moves the object		Copy link Message link Add to gist Remove
	GC updates the registers through the pointers		Copy link Message link Add to gist Remove
	done		Copy link Message link Add to gist Remove
	(a 'list' being a contiguous array of memory)	20:25	Copy link Message link Add to gist Remove
	i.e. an array		Copy link Message link Add to gist Remove
robertle	ok, but I don't see how that is done effectively: you either update the list of pointers when you move the object. so go through the whole list and update every pointer that points to the old location. for every object.	20:26	Copy link Message link Add to gist Remove
	I don't see how I can make that not quadratic		Copy link Message link Add to gist Remove
brrt	it seems very linear time to me :-)	20:27	Copy link Message link Add to gist Remove
	you only update each reference to the old position once	20:28	Copy link Message link Add to gist Remove
	and you don't need to look them up because you've stashed them in a list		Copy link Message link Add to gist Remove
robertle	right, but you need to scan the list of all pointers. and you need to do that		Copy link Message link Add to gist Remove
	for each object moved		Copy link Message link Add to gist Remove
brrt	but you scan all objects in a separate phase, once, then figure out which ones you need to move	20:29	Copy link Message link Add to gist Remove
robertle	ok, so I have a list of all object that need moving. possibly with an old and new location		Copy link Message link Add to gist Remove
brrt	yes. and you also have a list of all references to those objects		Copy link Message link Add to gist Remove
robertle	hold on, that may be the critical part	20:30	Copy link Message link Add to gist Remove
	where do you keep that list of per-object references?		Copy link Message link Add to gist Remove
brrt	you create it while scanning the heap for all reachable objects	20:32	Copy link Message link Add to gist Remove
robertle	k, but you somehow have to link it from the object itself?		Copy link Message link Add to gist Remove
brrt	every object you encounter, you maybe put it on the list to be moved, and alll to objects you encounter, you put on the list of references-to-update		Copy link Message link Add to gist Remove
robertle	getting closer, this is great	20:34	Copy link Message link Add to gist Remove
	so what data do you have as part of the heap object to support this? I would imagine you either need to store a pointer from the object to the list of references to it, or have a slot in which you temporarily store the new location	20:35	Copy link Message link Add to gist Remove
	dude that book isn't exactly cheap! but there you go...	20:37	Copy link Message link Add to gist Remove
brrt	i'm confused now... what we have is a flag that says if something is first-or second gen (i.e. moveable or not), and a function that lists all references within the object	20:38	Copy link Message link Add to gist Remove
	anyway, i'm afk. i wish you good luck :-)		Copy link Message link Add to gist Remove
20:39 brrt left 20:52 zakharyas left
jnthn	robertle: Perhaps the key part missing here is the need for a forwarding pointer	20:57	Copy link Message link Add to gist Remove
robertle	explain please		Copy link Message link Add to gist Remove
jnthn	If we have 3 references to the same object, then when we discover each of them we'll put a pointer to location of the reference into our "todo" list	20:58	Copy link Message link Add to gist Remove
	We do need to update all of those references when we move the object, but we only want to move the object once	20:59	Copy link Message link Add to gist Remove
robertle	ah, wait!		Copy link Message link Add to gist Remove
	don't say anymore		Copy link Message link Add to gist Remove
	I get it: whan you move the object, you put a pointer in it's place that refers to the new location.	21:00	Copy link Message link Add to gist Remove
	right?		Copy link Message link Add to gist Remove
jnthn	Yeah. I guess in your existing collector you already have some kind of "seen" bit in your object headers to know which objects you already marked, so you don't get stuck when marking self-referential data structures?	21:01	Copy link Message link Add to gist Remove
	In a moving collector that bit really becomes an "I already moved this"	21:02	Copy link Message link Add to gist Remove
robertle	yes, I have two bits of GC management per heap object		Copy link Message link Add to gist Remove
jnthn	So there's two cases when processing a pointer to an object reference		Copy link Message link Add to gist Remove
	The first one is you come, follow the pointer to the object, and observe that it doesn't have the "already moved" bit set		Copy link Message link Add to gist Remove
	In that case, you 1) copy the object's memory to the new location where it will live, 2) set the "already moved" bit, 3) save the new address inside the old memory of the object (which is safe because you already copied it), and 4) update the reference that you saw to point ot the new location	21:03	Copy link Message link Add to gist Remove
	The second came is that you follow the pointer to the object and see that object was already moved. You the find the address written in 3 and then do 4	21:04	Copy link Message link Add to gist Remove
	*second case		Copy link Message link Add to gist Remove
robertle	cool, understood. and I don't even need an extra bit: i currently have a "mark" bit. if I immediately move the object when I mark it, it's the same :)	21:05	Copy link Message link Add to gist Remove
jnthn	Right :)		Copy link Message link Add to gist Remove
robertle	beautiful, that solves my problems and isn't even so much work to do from where I currently stand		Copy link Message link Add to gist Remove
jnthn	In the MoarVM collector, the second case is handled here: github.com/MoarVM/MoarVM/blob/mast...ect.c#L214		Copy link Message link Add to gist Remove
	There's only one line of actual work in there, the rest is all just debug/assertion :)	21:06	Copy link Message link Add to gist Remove
	And here is the first case: github.com/MoarVM/MoarVM/blob/mast...ect.c#L312	21:07	Copy link Message link Add to gist Remove
	It's a bit more involved here there's the generational thing going on, but you can see the memcpy and the bit being set		Copy link Message link Add to gist Remove
robertle	great, and it also solves one of my worries which is the extra memory I need for GC and heap management: most objects are cons cells, and they are just 16bytes on the heap. I really need to avoid large data management data	21:08	Copy link Message link Add to gist Remove
jnthn	process_worklist is the heart of the MoarVM collector, really		Copy link Message link Add to gist Remove
	Pretty much everything else is just supporting that algorithm :)	21:09	Copy link Message link Add to gist Remove
robertle	totally unrelated to this question: but does moar do any of the GC work in parallel with heap-mutating code? or at least do some stop-the-world GC work in parallel?		Copy link Message link Add to gist Remove
jnthn	(Which in places is still quite tricky, for example sync-up of threads)		Copy link Message link Add to gist Remove
	No and yes		Copy link Message link Add to gist Remove
	The first case tends to be called a concurrent collector (it collects concurrent with the running code)	21:10	Copy link Message link Add to gist Remove
robertle	yeah		Copy link Message link Add to gist Remove
jnthn	The second is a parallel collector		Copy link Message link Add to gist Remove
robertle	so moar is non-concurrent but parallel?		Copy link Message link Add to gist Remove
jnthn	MoarVM's collector is parallel - multiple threads join in to do the collectiong work - but it's barely parallel		Copy link Message link Add to gist Remove
	oops		Copy link Message link Add to gist Remove
	MoarVM's collector is parallel - multiple threads join in to do the collectiong work - but it's barely concurrent		Copy link Message link Add to gist Remove
	At the end of the collection process, it reaches a point where individual threads can finish up their bit and then go on their way as soon as they have	21:11	Copy link Message link Add to gist Remove
	But for most of the time it needs the world to be stopped		Copy link Message link Add to gist Remove
	There's a latency/throughput trade-off here	21:12	Copy link Message link Add to gist Remove
robertle	ok, and what did you mean with the "sync-up of threads" before?		Copy link Message link Add to gist Remove
jnthn	Taking all the running threads and making them stop their work and come join in with GC		Copy link Message link Add to gist Remove
	That's what the stuff in src/gc/orchestrate.c does		Copy link Message link Add to gist Remove
robertle	ok, I see	21:13	Copy link Message link Add to gist Remove
jnthn	But it also has to account for stuck threads		Copy link Message link Add to gist Remove
	e.g. those that are doing blocking I/O or sleep or waiting on a mutex		Copy link Message link Add to gist Remove
	In MoarVM, when going to wait on such a think, the thread flags itself as unable to join in with GC	21:14	Copy link Message link Add to gist Remove
	*thing		Copy link Message link Add to gist Remove
robertle	ah, darn! my toy doesn't have threads yet, so I am good		Copy link Message link Add to gist Remove
jnthn	Yeah, that makes things far easier :)		Copy link Message link Add to gist Remove
	When you do, you'll notice that being first to move an object is a data race, and will need a scheme to deal with it :)		Copy link Message link Add to gist Remove
	The MoarVM scheme is that every object has an owner, and we delegate its GC work to the thread responsible for its owner	21:15	Copy link Message link Add to gist Remove
robertle	you mean with multiple threads doing GC? right, that's why I was asking...		Copy link Message link Add to gist Remove
jnthn	That's what the "work passing" stuff in collect.c		Copy link Message link Add to gist Remove
	is doing		Copy link Message link Add to gist Remove
	A potential alternative is to race to write the forwarder pointer	21:16	Copy link Message link Add to gist Remove
	But there's a tricky detail there, I guess, because how do you know if the address there is a forwarder or not?	21:17	Copy link Message link Add to gist Remove
	One way to deal with that is to make every single object 1 pointer bigger so that can always be zeroed to indicate "not moved"		Copy link Message link Add to gist Remove
	But that makes every object bigger		Copy link Message link Add to gist Remove
	Another would be to require the platform can do a double-cas and then install the pointer and a header bit together	21:18	Copy link Message link Add to gist Remove
	But that may not be portable enough		Copy link Message link Add to gist Remove
robertle	my heap is segmented into arenas, and each arena has it's own list of things that still need to be done. so in theory I could shard by arena. I then would need to protect the list when appending/consuming...		Copy link Message link Add to gist Remove
jnthn	Yeah		Copy link Message link Add to gist Remove
	Yeah, there's what MoarVM is doing I guess. Each thread has its own nursery		Copy link Message link Add to gist Remove
robertle	you would need the pointer and the header bit to be quite close together to be cas-able. I currently have them quite far apart	21:19	Copy link Message link Add to gist Remove
jnthn	It's the forwarder in the object and the header bit in the object, so they're close enough		Copy link Message link Add to gist Remove
	But you still need a double-cas operation		Copy link Message link Add to gist Remove
robertle	btw, this is the toy: github.com/robertlemmen/keiryaku	21:20	Copy link Message link Add to gist Remove
jnthn	I guess yet another scheme is to have two bits, CAS in place the "I'm going to write a forwarder" bit, if you win then write the forwarder, then cas in an "I'm done writing the forwarder". If you lose the first race then you have to spin.		Copy link Message link Add to gist Remove
	I guess with enough clever you could use that as a fallback on platforms without double-cas and use the double-cas where possible :)	21:21	Copy link Message link Add to gist Remove
	But we'd actually then have a new problem if we did that in MoarVM		Copy link Message link Add to gist Remove
robertle	??	21:22	Copy link Message link Add to gist Remove
jnthn	We rely on the things in a particular thread's fromspace (bit of memory we copy from) to land up in the same thread's tospace.	21:23	Copy link Message link Add to gist Remove
	Which in turn means we know we'll always end up with no more in tospace than fromspace, so we can't ever overflow it		Copy link Message link Add to gist Remove
	We'd need a new scheme to deal with that		Copy link Message link Add to gist Remove
	Also, the current design tries to keep things local, and we'd lose that too	21:24	Copy link Message link Add to gist Remove
	So maybe the current work-pass scheme ain't so bad after all :)		Copy link Message link Add to gist Remove
robertle	hmm, all very difficult. do you have much in terms of benchmarks or so to test the GC against?	21:25	Copy link Message link Add to gist Remove
jnthn	I've mainly tuned it on real-world workloads, since we have plenty of those :-)	21:26	Copy link Message link Add to gist Remove
	CORE.setting compilation is quite good for measuring how well generational things are working		Copy link Message link Add to gist Remove
	Hammering a multi-threaded web server isn't bad for looking at the threading case	21:27	Copy link Message link Add to gist Remove
	I rewrote the thread sync-up algorithm at some point after looking at profiles of the latter		Copy link Message link Add to gist Remove
robertle	aren't they very fickle? I have an app at work for example (java). it creates shitloads of garbage, so really the main performance factor is the nursery size. if short-lived shit makes it into the real heap, then performance goes down the drain...		Copy link Message link Add to gist Remove
jnthn	That's a factor too, though also a big nursery means longer collects and less cache locality in the collects	21:28	Copy link Message link Add to gist Remove
	What I did find was very worth tuning was the decision about when to do full collects	21:29	Copy link Message link Add to gist Remove
robertle	exactly. so more tuning required than one wants to do...		Copy link Message link Add to gist Remove
	more generally: are you happy with it's performance in the web service (i.e. cro I guess) case?		Copy link Message link Add to gist Remove
jnthn	Pretty sure a tuning there got 10% off CORE.setting compilation by putting into MoarVM an algorithm that used the overall heap size to decide how often to collect gen2		Copy link Message link Add to gist Remove
robertle	I played with that a while ago because it's also what I do at work, and have to admit the results seemed a bit mixed to me	21:30	Copy link Message link Add to gist Remove
jnthn	Happy with performance overall with Cro overall? Not really yet, though it's easily handling my production workloads.	21:31	Copy link Message link Add to gist Remove
	But GC isn't much of a factor there, I don't think.		Copy link Message link Add to gist Remove
robertle	what do you think are the big possible performance gains still to be had then?		Copy link Message link Add to gist Remove
jnthn	A lot more of the cost comes from concurrency control overheads in Supply		Copy link Message link Add to gist Remove
	Which is in no small part because LEAVE is essential for safely releasing locks, but also a pig.	21:32	Copy link Message link Add to gist Remove
	I'm overall quite happy with the GC itself. We could do with escape analysis and "stack" allocation.	21:33	Copy link Message link Add to gist Remove
	Which would mean that a lot less things ever end up being the GCs job		Copy link Message link Add to gist Remove
robertle	ah! I have read and tried to understand dybvig's scheme paper, which I believe is at it's core about escape analysis and figuring out when stuff can safely be put on the stack	21:34	Copy link Message link Add to gist Remove
	but it's mind-boggling		Copy link Message link Add to gist Remove
jnthn	The other thing is that we've just a bunch more things to do to turn Perl 6 programs into better quickened bytecode and/or machine code		Copy link Message link Add to gist Remove
	I'll probably spend much of next week re-working Scalar and assignment	21:35	Copy link Message link Add to gist Remove
	Because at the moment we can't optimize out assignment type checks or lower it into something cheap, despite it being a really common thing.		Copy link Message link Add to gist Remove
robertle	hmm, yeah. I noticed that there is quite some work going into hand-crafting core functionality in nqp. should't it be rakudo that generates reasonable code from perl6?		Copy link Message link Add to gist Remove
jnthn	The problem is that a lot of the properties that stops it generating simpler code are things that aren't so easy to do static analysis on	21:36	Copy link Message link Add to gist Remove
robertle	are you saying the language shouldn't be quite so late-binding? ;)		Copy link Message link Add to gist Remove
jnthn	Not really, I'm saying that Rakudo's code-gen isn't really where we can make the gains	21:37	Copy link Message link Add to gist Remove
robertle	ok, but if that is the case, why does it help if people write clever nqp implementations of stuff?		Copy link Message link Add to gist Remove
jnthn	And where we do need to generate different/better code, the main reason is so the VM can do a better job	21:38	Copy link Message link Add to gist Remove
	Because they aren't writing Perl 6 code by that point, and thus avoiding various of the trickier-to-analyze things		Copy link Message link Add to gist Remove
	Same reason that NQP programs generally run faster	21:40	Copy link Message link Add to gist Remove
robertle	hm, ok		Copy link Message link Add to gist Remove
jnthn	Gotta go again for a bit	21:41	Copy link Message link Add to gist Remove
robertle	anyway thanks for the chat, the forwarding-pointer-in-old-heap location was exactly what I was missing. I am sure that is going to make it relatively easy to switch to a bump allocator and generational GC.		Copy link Message link Add to gist Remove
	I need to be in bed as well :)	21:42	Copy link Message link Add to gist Remove
jnthn	:-)		Copy link Message link Add to gist Remove
	'night o/		Copy link Message link Add to gist Remove
22:26 robertle left 23:44 Kaiepi left

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!