#moarvm on 24 January 2016 - Raku Programming Language Log

01:56 colomon joined 03:04 colomon joined 05:35 Hotkeys joined
Hotkeys	hello	05:35	Copy link Message link Add to gist Remove
	as of github.com/MoarVM/MoarVM/commit/ba...f1ef94cd25 moar won't build on windows (for me on two machines) because it cannot find msinttypes/inttypes.h	05:36	Copy link Message link Add to gist Remove
	It appears that $config{'toolchain'} isn't initialized	05:42	Copy link Message link Add to gist Remove
	"Configuring native build environment ................... Use of uninitialized value $config{"toolchain"} in string eq at Configure.pl line 266."		Copy link Message link Add to gist Remove
	I'm not really sure what a good solution here would be, so I'm just reporting it		Copy link Message link Add to gist Remove
	oops sorry	05:46	Copy link Message link Add to gist Remove
	looks like this was already fixed and rakudo just hasn't bumped the moar version yet	05:47	Copy link Message link Add to gist Remove
	my apologies		Copy link Message link Add to gist Remove
07:28 domidumont joined 07:33 domidumont joined 10:13 leont joined 12:58 colomon joined 13:07 FROGGS joined 13:52 FROGGS_ joined 15:24 zakharyas joined
timotimo	jnthn: i'm interested in seeing if an optimization to store single-grapheme strings inside the string struct itself rather than mallocing a single-grapheme buffer; how do i best measure this? potentially by making my old gdb thingie that goes through the whole heap work again against latest moar?	16:33	Copy link Message link Add to gist Remove
jnthn	Maybe. Though .comb of a long string may also show it up in its memory use :)	16:37	Copy link Message link Add to gist Remove
timotimo	right		Copy link Message link Add to gist Remove
jnthn	How common are single-char strings?		Copy link Message link Add to gist Remove
	We need to make sure the opt is worth the complexity.	16:38	Copy link Message link Add to gist Remove
timotimo	i'd start by looking at what "the empty program" ends up having		Copy link Message link Add to gist Remove
jnthn	nod		Copy link Message link Add to gist Remove
timotimo	well, that's exactly what i want to find out :)		Copy link Message link Add to gist Remove
	alternatively, is there a single point where MVMString gets created? perhaps.	16:39	Copy link Message link Add to gist Remove
	mostly in the encode and decode functions, right?		Copy link Message link Add to gist Remove
	and perhaps substr and flatten ropes in the ops		Copy link Message link Add to gist Remove
jnthn	Well, a lot of places use the codepoint and grapheme iterators.	16:40	Copy link Message link Add to gist Remove
	All the encode/decode functions certainly do that.		Copy link Message link Add to gist Remove
timotimo	oh, wait, MVMString is also a REPR by itself		Copy link Message link Add to gist Remove
jnthn	aye		Copy link Message link Add to gist Remove
timotimo	but its initialize method wouldn't be late enough to catch the size of the created string	16:41	Copy link Message link Add to gist Remove
	i ought to ponder this a bit more		Copy link Message link Add to gist Remove
	BBIAB		Copy link Message link Add to gist Remove
	taking hashes of our strings is also driven by the codepoint iterator now?	16:57	Copy link Message link Add to gist Remove
	if so, i must have missed the commits that made it so during the bigger nfg refactor or where-ever that happened	16:58	Copy link Message link Add to gist Remove
	the more interesting optimization may be - if the hash stuff is already taken care of - to actually use unsigned-8bit-storage for known-to-be-ascii strings		Copy link Message link Add to gist Remove
jnthn	No, hashing them always turns thme into 32-bit grapheme buffers at the moment.	16:59	Copy link Message link Add to gist Remove
	That'll want to change	17:00	Copy link Message link Add to gist Remove
timotimo	ah		Copy link Message link Add to gist Remove
	we put basically every string into a hash while compiling to build the string heap, so at least in that place it won't help if that optimization lands	17:01	Copy link Message link Add to gist Remove
jnthn	aye		Copy link Message link Add to gist Remove
	I think it may be a reasonable optimization, though if it causes a lot of code churn to do it now I'll be a tad hesitant		Copy link Message link Add to gist Remove
timotimo	that's fair	17:02	Copy link Message link Add to gist Remove
jnthn	Though if the remedy to "causes a lot of code churn" is "OK, here's some patches that first refactor things so such changes are easy", I'll be less hesitant :)		Copy link Message link Add to gist Remove
timotimo	this neat little article here says malloc(1) gives you 32 bytes of used-up memory	17:04	Copy link Message link Add to gist Remove
	8 byte for tracking data and then it's padded up to the minimum malloc will allocate, which seems to be 32		Copy link Message link Add to gist Remove
	huh, i can probably find out how often a one-character-string gets allocated with a little perf script	17:09	Copy link Message link Add to gist Remove
	find all calls to malloc that have the size set to 1 * 32bit, kick out anything that doesn't have a frame from string/*.c in it, and presto!		Copy link Message link Add to gist Remove
	Caution: When you run your program in the Visual Studio or with any debugger attached, by default the malloc behaviour is changed a lot, Low Fragmentation Heap is not used and a memory overhead may be not representative of real usage - huh! TIL!	17:16	Copy link Message link Add to gist Remove
18:34 lizmat joined 19:17 virtualsue joined 19:21 domidumont joined 19:43 ggoebel14 joined 20:00 colomon joined 20:04 domidumont joined 22:32 kjs_ joined 22:51 colomon joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!