#moarvm on 3 October 2017 - Raku Programming Language Log

00:08 evalable6 joined 01:05 evalable6 joined
MasterDuke	timotimo: doh, thanks. getting farther in stage parse now	01:09	Copy link Message link Add to gist Remove
samcv	MasterDuke, can i see your patch so far?	01:10	Copy link Message link Add to gist Remove
MasterDuke	samcv: github.com/MasterDuke17/MoarVM/tre...ng_storage	01:12	Copy link Message link Add to gist Remove
Geth	MoarVM: c5c389101d \| (Samantha McVey)++ \| .appveyor.yml Appveyor: diable VS2017 builds We can't reenable them until we know where SetEnv.cmd is, since nmake isn't in the path without running it.	01:13	Copy link Message link Add to gist Remove
01:56 ilbot3 joined
samcv	yeah i did	01:58	Copy link Message link Add to gist Remove
	i didn't know it was that low, the binsize		Copy link Message link Add to gist Remove
	what is the size of it?		Copy link Message link Add to gist Remove
	i see MVM_FSA_BINS is 96. that can't be 96 bytes can it?		Copy link Message link Add to gist Remove
	i mean if it's really that small, then why don't we just use alloca in MVM_string_join?	02:05	Copy link Message link Add to gist Remove
	since we don't need it to persist		Copy link Message link Add to gist Remove
MasterDuke	i don't remember the exact value, but a quick fprintf shows a byte size of 772 equals a bin of 96	02:07	Copy link Message link Add to gist Remove
samcv	so you can only allocate up to 772 bytes before it just mallocs it then?		Copy link Message link Add to gist Remove
MasterDuke	up to something about 772, don't know the exact value	02:08	Copy link Message link Add to gist Remove
	760 i think	02:10	Copy link Message link Add to gist Remove
geekosaur	samcv, you might want to be careful with alloca anyway/. although on currently supported platforms up to 3k is probably safe	02:12	Copy link Message link Add to gist Remove
	(4k is asking for trouble if it causes multiple pages of stack segment to be allocated, and you need to be aware of anything else that also uses or allocates from stack)	02:13	Copy link Message link Add to gist Remove
samcv	geekosaur, so you think i should lower it to 3000?	02:14	Copy link Message link Add to gist Remove
	i have it set to 4096 right now. i'm fine with lowering it		Copy link Message link Add to gist Remove
MasterDuke	.tell timotimo down to only 12 errors in stage parse, all exactly the same (gist updated with an example)		Copy link Message link Add to gist Remove
yoleaux	MasterDuke: I'll pass your message to timotimo.		Copy link Message link Add to gist Remove
geekosaur	if you occasionally see segfaults in the code that uses it, you're running into the problem	02:15	Copy link Message link Add to gist Remove
samcv	but 760 bytes we could just alloca join onto the stack since that's not very much and it'd be faster than FSA, since we don't need to keep it around		Copy link Message link Add to gist Remove
geekosaur	(basically, whether something ounts as a stack allocation or a bad pointer is determined by an OS heuristic that can guess wrong, so allocating too much on stack can cause the next use of the stack (function calls, another allocation, etc.) to be mistaken for a bad pointer	02:16	Copy link Message link Add to gist Remove
MasterDuke	samcv: for `pieces` in MVM_string_join?		Copy link Message link Add to gist Remove
samcv	yeah		Copy link Message link Add to gist Remove
MasterDuke	sure, give it a shot	02:17	Copy link Message link Add to gist Remove
geekosaur	and this is determined by stack pages, so the only safe use is if it allocates exactly one more page to stack. page size is 4096 bytes on supported platforms		Copy link Message link Add to gist Remove
	basically,, you can try it, if you see occasional segfaults then back the size down and see if they go away		Copy link Message link Add to gist Remove
samcv	well i haven't see any segfaults on this, or freebsd or on alpine with musl	02:18	Copy link Message link Add to gist Remove
	but i can back it down from 4096		Copy link Message link Add to gist Remove
02:19 zakharyas joined
Geth	MoarVM: 474ab7cdd1 \| (Samantha McVey)++ \| src/strings/ops.c In KMP index use malloc not FSA. Set max stack alloc to 3000 Reduce stack alloc from 4096 to 3000, as recommended by geekosaur++. Use malloc instead of FSA because FSA would just malloc anyway since it's larger than the max FSA amount.	05:25	Copy link Message link Add to gist Remove
samcv	MasterDuke, if i bump MVM_FSA_BINS by 2 we don't overflow it during core setting compilation, well before it said 6.2GB max allocated	05:35	Copy link Message link Add to gist Remove
	after it showed 5.1		Copy link Message link Add to gist Remove
	increased it by 4x and it gets 1.5GB of mallocs	05:41	Copy link Message link Add to gist Remove
	seems to shave 2s off core setting compilation	05:46	Copy link Message link Add to gist Remove
06:21 domidumont joined 06:24 domidumont joined 06:31 domidumont joined 06:53 brrt joined 08:12 leont joined 08:29 robertle_ joined
jnthn	samcv: What does it do to total memory use, though? Also, of memory use of perl6 -e ''	09:13	Copy link Message link Add to gist Remove
dogbert17	to get everyone started: dilbert.com/strip/2017-10-02	09:19	Copy link Message link Add to gist Remove
09:19 brrt joined
nwc10	nice punchline in the 3rd panel	09:19	Copy link Message link Add to gist Remove
09:24 ilbot3 joined
dogbert17	so the gains will be realized later	09:24	Copy link Message link Add to gist Remove
09:29 ilbot3 joined 09:56 brrt joined 10:10 lizmat joined
brrt	jnthn: thanks for reviewing!	10:34	Copy link Message link Add to gist Remove
timotimo	jnthn: we could have a point where we allocate fewer buckets in the same page, for the very big pages	10:54	Copy link Message link Add to gist Remove
yoleaux	02:14Z <MasterDuke> timotimo: down to only 12 errors in stage parse, all exactly the same (gist updated with an example)		Copy link Message link Add to gist Remove
jnthn	timotimo: We could, yeah, I pondered that before. It's probably sensible.	10:55	Copy link Message link Add to gist Remove
	brrt: Welcome :-) Thanks for writing us a new JIT ;)		Copy link Message link Add to gist Remove
timotimo	oh, the review is done? awesome!		Copy link Message link Add to gist Remove
	brrt++		Copy link Message link Add to gist Remove
jnthn	Yeah, provided brrt is happy for it to do so, it can go in :)	10:56	Copy link Message link Add to gist Remove
timotimo	i really like the sound of that		Copy link Message link Add to gist Remove
	lizmat is looking forward to the final grant report :-)	10:57	Copy link Message link Add to gist Remove
jnthn	I semi-pondered "its the Star release this month" but...given how many far more risky things landed in Rakudo this month, I'd say expr JIT is some way down the risks list :)		Copy link Message link Add to gist Remove
timotimo	yeah	10:59	Copy link Message link Add to gist Remove
	i'll look into bigger bins getting smaller :)		Copy link Message link Add to gist Remove
jnthn	I suspect if I work on anything else in Rakudo ahead of this month's release, it'll be hyper/race, which are so broken at the moment anyway I almost can't make it worse :)		Copy link Message link Add to gist Remove
timotimo	i hear ya :\|		Copy link Message link Add to gist Remove
jnthn	But yeah, overall let's try and be a bit cautious about what we put in over the week leading up to the next bunch of releases	11:00	Copy link Message link Add to gist Remove
	The Star ones do get wider use		Copy link Message link Add to gist Remove
timotimo	we still have one and a half weeks or so, right?	11:01	Copy link Message link Add to gist Remove
jnthn	Yeah, indeed	11:02	Copy link Message link Add to gist Remove
timotimo	could even give the fsa logic to make the first page for a given bin smaller than all the rest	11:03	Copy link Message link Add to gist Remove
	to better handle cases like "bin 91 gets five items ever, but 90 and 92 get hundreds"		Copy link Message link Add to gist Remove
	we have setup_bin and add_page, so the distinction is already there in code	11:04	Copy link Message link Add to gist Remove
	oh, no, not quite		Copy link Message link Add to gist Remove
	oh, no, it does add a page		Copy link Message link Add to gist Remove
	a very first page	11:05	Copy link Message link Add to gist Remove
	i made it limit page sizes to 32kbyte, which means page 95 has 42 elements	11:18	Copy link Message link Add to gist Remove
	and i might make the size limit for the very first page half or quarter that		Copy link Message link Add to gist Remove
jnthn	OK	11:19	Copy link Message link Add to gist Remove
	Let's try that :)		Copy link Message link Add to gist Remove
timotimo	let's see, sam wanted to bump the count by 4 ... or really 4x?		Copy link Message link Add to gist Remove
	oh wow	11:20	Copy link Message link Add to gist Remove
	for perl6 -e i now get a good view of which sizes get how many pages allocated	11:21	Copy link Message link Add to gist Remove
11:21 domidumont joined
timotimo	a whole lot of 'em only get the initial page, even though i quartered its size	11:21	Copy link Message link Add to gist Remove
	hm, the maxresident difference is rather small it seems	11:23	Copy link Message link Add to gist Remove
	hm, it looks like i might actually be using more memory		Copy link Message link Add to gist Remove
	nope, there was some other difference in my measurements	11:24	Copy link Message link Add to gist Remove
	only like 80k on -e ''	11:25	Copy link Message link Add to gist Remove
	the difference is more pronounced for -e 'say "hi"'	11:27	Copy link Message link Add to gist Remove
jnthn	Save 80k?		Copy link Message link Add to gist Remove
nwc10	at a guess, is that because -e '' doesn't even allocate some bucket sizes?		Copy link Message link Add to gist Remove
jnthn	Yeah, probably that		Copy link Message link Add to gist Remove
timotimo	m: say 75236 / 75458		Copy link Message link Add to gist Remove Run code
camelia	0.997058		Copy link Message link Add to gist Remove
timotimo	m: say 75236 - 75458		Copy link Message link Add to gist Remove Run code
jnthn	ooh, lunch time :)		Copy link Message link Add to gist Remove
camelia	-222		Copy link Message link Add to gist Remove
timotimo	about 200k saved in that scenario		Copy link Message link Add to gist Remove
	let's do a real measurement: core setting compilation	11:29	Copy link Message link Add to gist Remove
11:43 AlexDaniel joined
AlexDaniel	squashable6: status	11:45	Copy link Message link Add to gist Remove
squashable6	AlexDaniel, ⚠🍕 Next SQUASHathon in 2 days and ≈22 hours (2017-10-07 UTC-12⌁UTC+14). See github.com/rakudo/rakudo/wiki/Mont...Squash-Day		Copy link Message link Add to gist Remove
11:50 zakharyas joined
timotimo	looks like it actually gets worse from my changes	11:50	Copy link Message link Add to gist Remove
Geth	MoarVM/fsa_tune_page_sizes: 15ba542eea \| (Timo Paulssen)++ \| src/core/fixedsizealloc.c limit FSA pages to 32k (8k for very first page) helps perl6 -e '' and -e 'say "hi"' a lot, but seems to actually increase memory usage in a core setting compilation.	11:52	Copy link Message link Add to gist Remove
timotimo	though of course the results will be different still with more stuff using the fsa	11:54	Copy link Message link Add to gist Remove
lizmat	jnthn timotimo: could you give me a sanity check wrt to BUILDALL?		Copy link Message link Add to gist Remove
	if a class has an empty BUILDPLAN (like a mixin that doesn't add any attributes)		Copy link Message link Add to gist Remove
	doesn't that imply I don't need to generate a BUILDALL for that class, as its first parent will have the correct BUILDALL already generated ?	11:55	Copy link Message link Add to gist Remove
	the invocant signature might not be 100% correct, but still valid anyway		Copy link Message link Add to gist Remove
timotimo	hm, that does sound sensible	11:57	Copy link Message link Add to gist Remove
	maybe i want something a little bit faster than the core setting compilation for measuring this :\|	11:58	Copy link Message link Add to gist Remove
jnthn	lizmat: If it has no attributes, then yeah, that sounds reasonable	12:03	Copy link Message link Add to gist Remove
lizmat	that would save quite a few generated methods, as its quite common to only mixin methods, and not attributes	12:04	Copy link Message link Add to gist Remove
timotimo	good point		Copy link Message link Add to gist Remove
12:10 AlexDaniel joined
timotimo	1342402 is the average before	12:17	Copy link Message link Add to gist Remove
	1341298 is the average after		Copy link Message link Add to gist Remove
	"1338332".."1343264"	12:18	Copy link Message link Add to gist Remove
	that's the minmax of "after"		Copy link Message link Add to gist Remove
lizmat	1100 bytes difference ?		Copy link Message link Add to gist Remove
timotimo	"1339700".."1343508"	12:19	Copy link Message link Add to gist Remove
	that's the minmax of before		Copy link Message link Add to gist Remove
	that's kbytes		Copy link Message link Add to gist Remove
lizmat	ah, so ~1MB difference		Copy link Message link Add to gist Remove
	or a bit more...	12:20	Copy link Message link Add to gist Remove
timotimo	m: say 1338332 - 1343264; say 1339700 - 1343508		Copy link Message link Add to gist Remove Run code
camelia	-4932 -3808		Copy link Message link Add to gist Remove
timotimo	that's how noisy the measurement is	12:21	Copy link Message link Add to gist Remove
	the measurements for "with my patch" are further apart	12:22	Copy link Message link Add to gist Remove
	i'm not sure if it makes sense to use these measurements, given the amount of noise is ~4x the difference		Copy link Message link Add to gist Remove
Geth	MoarVM/even-moar-jit: dc3d40fb22 \| (Bart Wiegmans)++ \| 4 files Improve the expression JIT documentation Add a document describing its most important components (expression template processor / tree builder, tiler, and register allocator).	12:27	Copy link Message link Add to gist Remove
	MoarVM/even-moar-jit: 6de7455e9a \| (Bart Wiegmans)++ \| 2 files More documentation fixes Some of the things in tiles.md were no longer true		Copy link Message link Add to gist Remove
	MoarVM/even-moar-jit: 7f7ce9ca40 \| (Bart Wiegmans)++ \| 3 files ^cu_string - is lazy-loaded so use wrapper The direct access of MVMCompUnit->body.strings was a legacy from simpler days when compunit strings were loaded eagerly. As they're now using lazy loading, that isn't really valid anymore. Possible future development would be to force eager loading during JIT compilation and/or upgrading to second-generation memory.	12:28	Copy link Message link Add to gist Remove
brrt	oh, oops, we're seeing a segv		Copy link Message link Add to gist Remove
12:28 AlexDaniel joined
Geth	MoarVM/even-moar-jit: 33270f003d \| (Bart Wiegmans)++ \| src/jit/macro.expr MVM_cu_string - second argument is cu* Not idx, oops	12:31	Copy link Message link Add to gist Remove
timotimo	i wonder what the main source of nondeterminism in core setting compilation is, the one that causes memory usage to vary so drastically	12:40	Copy link Message link Add to gist Remove
	could it just be spesh?	12:41	Copy link Message link Add to gist Remove
brrt	tbh i don't find spesh to be very nondeterministic	12:45	Copy link Message link Add to gist Remove
timotimo	core setting compilation doesn't start any other threads, so it also won't do the logs it gets in different orders every time	12:46	Copy link Message link Add to gist Remove
lizmat	random hash order ?	12:50	Copy link Message link Add to gist Remove
timotimo	hm, how often do we iterate over hashes in compilation i wonder		Copy link Message link Add to gist Remove
lizmat	I have no idea :-)	12:51	Copy link Message link Add to gist Remove
brrt	i expect quite often	12:55	Copy link Message link Add to gist Remove
timotimo	but we don't randomize hashes yet, do we?	12:59	Copy link Message link Add to gist Remove
lizmat	timotimo: not sure	13:03	Copy link Message link Add to gist Remove
14:26 zakharyas joined 14:43 AlexDaniel joined
	timotimo is idly playing around with systemtap	14:56	Copy link Message link Add to gist Remove
	oh geez	14:57	Copy link Message link Add to gist Remove
	tried to record stack traces for every fsa_alloc hit		Copy link Message link Add to gist Remove
	it's filling up my disk good		Copy link Message link Add to gist Remove
	i can't stop it o_O	14:58	Copy link Message link Add to gist Remove
	now it did stop and the resulting file is apparently b0rken	14:59	Copy link Message link Add to gist Remove
	timotimo frees up some disk space ...	15:02	Copy link Message link Add to gist Remove
15:06 brrt joined
brrt	i'm investigating three spectest failures	15:28	Copy link Message link Add to gist Remove
	t/spec/S29-os/system.rakudo.moar test 35	15:29	Copy link Message link Add to gist Remove
	t/spec/S28-named-variables/init-instant.t		Copy link Message link Add to gist Remove
	t/spec/S17-supply/watch-path.t		Copy link Message link Add to gist Remove
	they are individually succesfull	15:30	Copy link Message link Add to gist Remove
	so, any objections to me pushing the merge button? :-)		Copy link Message link Add to gist Remove
Zoffix	\o/ \o/ \o/	15:31	Copy link Message link Add to gist Remove
jnthn	I've seen the first happen for a while		Copy link Message link Add to gist Remove
	Not every time, but now and then		Copy link Message link Add to gist Remove
	Others I dunno about		Copy link Message link Add to gist Remove
	But...yeah, let's merge it		Copy link Message link Add to gist Remove
timotimo	yeeeeaaahhhh		Copy link Message link Add to gist Remove
jnthn	Sounds like timing or other issues if they work outside of harness		Copy link Message link Add to gist Remove
timotimo	a sizable portion of allocations is from the nfa (in perl6 -e 'say "hi"')	15:33	Copy link Message link Add to gist Remove
	almost a quarter	15:34	Copy link Message link Add to gist Remove
	but that's only "how many times is the allocator called", it ignores the size of each allocation		Copy link Message link Add to gist Remove
	and i think it skipped some events because I/O was too slow		Copy link Message link Add to gist Remove
jnthn	I already reduced its allocations by a good bit before, I think		Copy link Message link Add to gist Remove
	iirc the remaining one is the result	15:35	Copy link Message link Add to gist Remove
	Which we need to hand back, and might live on for a while		Copy link Message link Add to gist Remove
timotimo	yeah, it's not bad	15:36	Copy link Message link Add to gist Remove
	the code, i mean		Copy link Message link Add to gist Remove
	just an observation		Copy link Message link Add to gist Remove
	another big chunk is from hash entries		Copy link Message link Add to gist Remove
brrt	thank you everybody for your incredible patience	15:38	Copy link Message link Add to gist Remove
timotimo	not sure i can get much sensible information out of this any more		Copy link Message link Add to gist Remove
	but it was good to refresh my memory of how the systemtap portion of perf works	15:39	Copy link Message link Add to gist Remove
brrt	also, geth is dead, maybe	15:40	Copy link Message link Add to gist Remove
jnthn	Aww, no commit report		Copy link Message link Add to gist Remove
	brrt++ though		Copy link Message link Add to gist Remove
	Already built it here :)		Copy link Message link Add to gist Remove
brrt	cool :-)		Copy link Message link Add to gist Remove
	brrt hoping for the best		Copy link Message link Add to gist Remove
[Coke]	did it merge to master? (do I need to build rakudo with nqpmaster and moarmaster?)	15:42	Copy link Message link Add to gist Remove
jnthn	[Coke]: To MoarVM master, yes. No version bumps as yet	15:43	Copy link Message link Add to gist Remove
[Coke]	building on win & mac..	15:44	Copy link Message link Add to gist Remove
	(src\profiler\heapsnapshot.c(823): warning C4293: '<<': shift count negative or too big, undefined behavior)	15:49	Copy link Message link Add to gist Remove
Zoffix	wow	15:50	Copy link Message link Add to gist Remove
	brrt++		Copy link Message link Add to gist Remove
	\o/		Copy link Message link Add to gist Remove
brrt	i'll take a look, although that is not part of the branch :-)	15:51	Copy link Message link Add to gist Remove
[Coke]	brrt: that was not meant for you specifically. ;)	15:52	Copy link Message link Add to gist Remove
brrt	[Coke] what platform are you on?	15:54	Copy link Message link Add to gist Remove
	32 bit by any chance?		Copy link Message link Add to gist Remove
	seems like your long is not long enough :-)		Copy link Message link Add to gist Remove
	anyway, rather than 1l, probably better to write UINT64_C(1)		Copy link Message link Add to gist Remove
	which is i think stdint defined		Copy link Message link Add to gist Remove
[Coke]	brrt: that is on a win64 (I think) win 10 vm.		Copy link Message link Add to gist Remove
brrt	hmm, that's somewhat surprising	15:55	Copy link Message link Add to gist Remove
[Coke]	64-bit, x64		Copy link Message link Add to gist Remove
brrt	i think i may have seen that on the JIT as well on windows		Copy link Message link Add to gist Remove
[Coke]	using ms vs Community 2017	15:56	Copy link Message link Add to gist Remove
brrt	anyway, can you pls check if this helps: gist.github.com/bdw/cb8ecce419ec0a...2e883b5dc9		Copy link Message link Add to gist Remove
	i have a VM as well, i just don't feel like starting it up :-)		Copy link Message link Add to gist Remove
15:56 zakharyas joined
[Coke]	brrt: let me finish the as-is build first, then will re-try that.	15:56	Copy link Message link Add to gist Remove
brrt	also, i will not be online for the coming afternoon / night, so, if any problems arise, i can't respond; if trouble is severe, you know where the revert button is	15:57	Copy link Message link Add to gist Remove
	i don't expect it very much, but just so you know :-)		Copy link Message link Add to gist Remove
[Coke]	spectest clean on mac.	15:58	Copy link Message link Add to gist Remove
brrt	\o/		Copy link Message link Add to gist Remove
[Coke]	nativecall cpp tests still failing on win 10 `nmake test`, no change there. kicking of win10 spectest.	15:59	Copy link Message link Add to gist Remove
	(as I recall, we have a lot of win failures atm so I'm not sure this will show anything. :\|	16:01	Copy link Message link Add to gist Remove
	if `perl6 -V \| grep -i jit` has moar::jit_arch set to a value, does that mean I have a jit?	16:02	Copy link Message link Add to gist Remove
	(or is there a better way to tell?)	16:06	Copy link Message link Add to gist Remove
bartolin	it would be great if someone could take a look at github.com/MoarVM/MoarVM/pull/714 . currently opening a socket to a remote host is broken on freebsd.		Copy link Message link Add to gist Remove
jnthn	hm, thought i already reviewed/merged that one...	16:10	Copy link Message link Add to gist Remove
Geth	MoarVM: d04c8dccbc \| usev6++ \| src/io/syncsocket.c Fix getaddrinfo failing with EAI_HINTS on FreeBSD This fixes spectest failures (e.g. in S32-io/IO-Socket-INET.t) on FreeBSD. According to the man page of getaddrinfo() '[a]ll other elements of the addrinfo structure passed via hints must be zero or the null pointer' (similiar wording on Linux). This requirement is actually enforced on FreeBSD: github.com/freebsd/freebsd/blob/89...nfo.c#L429		Copy link Message link Add to gist Remove
	MoarVM: 3da7cce276 \| (Jonathan Worthington)++ (committed using GitHub Web editor) \| src/io/syncsocket.c Merge pull request #714 from usev6/getaddrinfo_hints Fix getaddrinfo failing with EAI_HINTS on FreeBSD		Copy link Message link Add to gist Remove
	MoarVM: 23c16b3031 \| (Patrick Sebastian Zimmermann)++ \| 2 files Probe for gcc -Werror= support. This allows building MoarVM on older GCCs.* The -Werror=* probe has to run before setting the cc/ldmiscflags. The compiler_usability check only makes sense before the -Werror=* probe. Thus that one is also moved earlier.	16:11	Copy link Message link Add to gist Remove
	MoarVM: 48f5efaaf9 \| (Jonathan Worthington)++ (committed using GitHub Web editor) \| 2 files Merge pull request #696 from patzim/master Probe for gcc -Werror=* support		Copy link Message link Add to gist Remove
bartolin	thanks anyway :-)	16:12	Copy link Message link Add to gist Remove
Geth	MoarVM/master: 9 commits pushed by Mario++, M++, (Jonathan Worthington)++ - getsockname / end of non-void function - throw_error with no_return - no need for the ignore-no-return-pragma - #include <ws2tcpip.h> - typo - per request, throw_error removed from header file. - pre request - throw_error removed from header - Merge pull request #692 from duke-m/patch-2	16:13	Copy link Message link Add to gist Remove
[Coke]	looks like win64 spectest just hung with 3 remaining tests.	16:42	Copy link Message link Add to gist Remove
16:45 travis-ci joined
travis-ci	MoarVM build canceled. Jonathan Worthington 'Merge pull request #696 from patzim/master	16:45	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/282804503 github.com/MoarVM/MoarVM/compare/3...f5efaaf92a		Copy link Message link Add to gist Remove
16:45 travis-ci left 16:55 lizmat joined 17:20 domidumont joined
	Zoffix stresstests and version bumps	17:47	Copy link Message link Add to gist Remove
	"2017.09.1-553-ga4fef0b" that's quite a high number :D	17:50	Copy link Message link Add to gist Remove
18:03 travis-ci joined
travis-ci	MoarVM build passed. Jonathan Worthington 'Merge pull request #692 from duke-m/patch-2	18:03	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/282805138 github.com/MoarVM/MoarVM/compare/4...fef0bd36cc		Copy link Message link Add to gist Remove
18:03 travis-ci left 18:20 AlexDaniel joined 19:09 brrt joined 19:21 zakharyas joined
samcv	jnthn, 4x is max 65.5MB of memory usage. compared to 56.7MB	19:30	Copy link Message link Add to gist Remove
timotimo	samcv: is that actually going from ~90 bins to ~360?	19:34	Copy link Message link Add to gist Remove
samcv	yeah		Copy link Message link Add to gist Remove
	96*4		Copy link Message link Add to gist Remove
	and with 4x we only have 1.2GB alloced with FSA instead of 6.2GB		Copy link Message link Add to gist Remove
timotimo	wow	19:35	Copy link Message link Add to gist Remove
samcv	for core setting compilation. doing 2x, we have 5.1GB malloced in FSA		Copy link Message link Add to gist Remove
	sorry i should say malloced, not alloced		Copy link Message link Add to gist Remove
jnthn	samcv: Remind what you're tweaking this for? :)		Copy link Message link Add to gist Remove
samcv	also core setting is 2s faster		Copy link Message link Add to gist Remove
	reducing the mallocs		Copy link Message link Add to gist Remove
	that the FSA has to do		Copy link Message link Add to gist Remove
	it seems to make many GB more than is really needed	19:36	Copy link Message link Add to gist Remove
	optimally		Copy link Message link Add to gist Remove
jnthn	Yeah, but I thought you had a particular use case that 4x covers?	19:37	Copy link Message link Add to gist Remove
samcv	i was recording the setting compilation		Copy link Message link Add to gist Remove
jnthn	Ah, OK		Copy link Message link Add to gist Remove
	I thought it was something about a KMP table fitting in or something	19:38	Copy link Message link Add to gist Remove
samcv	from 6.2GB malloced to 1.2GB		Copy link Message link Add to gist Remove
	nah		Copy link Message link Add to gist Remove
jnthn	ah, OK	19:39	Copy link Message link Add to gist Remove
	But we gain 10MB extra memory use for doing nothing?		Copy link Message link Add to gist Remove
timotimo	did i already push my tuning branch for the fsa page size?		Copy link Message link Add to gist Remove
	it would probably want a minimum items per page, too		Copy link Message link Add to gist Remove
	otherwise it'll go down to 0 items per page, or 1	19:40	Copy link Message link Add to gist Remove
	also, perhaps we should make bigger steps after a given page size	19:42	Copy link Message link Add to gist Remove
	item size i mean	19:45	Copy link Message link Add to gist Remove
jnthn	Yeah, possibly that also	19:48	Copy link Message link Add to gist Remove
samcv	jnthn, yep we gain 10MB extra memory doing nothing	19:49	Copy link Message link Add to gist Remove
timotimo	samcv: check the fsa_tune_page_sizes branch	19:50	Copy link Message link Add to gist Remove
	give it a minimum "effective_item_count"		Copy link Message link Add to gist Remove
	then let's see if it helps at all		Copy link Message link Add to gist Remove
samcv	how do i get a branch i don't have		Copy link Message link Add to gist Remove
timotimo	it should be enough to, after "git fetch", just git checkout fsa_tune_page_sizes		Copy link Message link Add to gist Remove
	it ought to set it up to track origin/that_branch_name for you	19:51	Copy link Message link Add to gist Remove
samcv	that does not work		Copy link Message link Add to gist Remove
	oh i got it now		Copy link Message link Add to gist Remove
jnthn	samcv: OK, then further tweakery needed to try and have our cake and eat it, IMO		Copy link Message link Add to gist Remove
	I suspect something along the lines of what timotimo has looked at may do it		Copy link Message link Add to gist Remove
samcv	and timotimo's branch has 52.8MB peak. so down 4mb	19:52	Copy link Message link Add to gist Remove
	will try the setting now		Copy link Message link Add to gist Remove
timotimo	i tried measuring the core setting		Copy link Message link Add to gist Remove
	and the difference between two runs was enormous	19:53	Copy link Message link Add to gist Remove
samcv	you mean two runs of the same branch?		Copy link Message link Add to gist Remove
timotimo	yes	19:54	Copy link Message link Add to gist Remove
samcv	interesting...		Copy link Message link Add to gist Remove
timotimo	so you might have to be extra careful in your measurements, too	19:55	Copy link Message link Add to gist Remove
samcv	so i got 1.2GB peak. which is similar to having the bin 4x	19:57	Copy link Message link Add to gist Remove
	so that seems good		Copy link Message link Add to gist Remove
	err wait. no it's 1.6GB		Copy link Message link Add to gist Remove
	i was looking at the peak. 1.6G is the amount allocated by the FSA in total		Copy link Message link Add to gist Remove
timotimo	you're using heaptrack for this?		Copy link Message link Add to gist Remove
samcv	err no. the amonut allocated by malloc	19:59	Copy link Message link Add to gist Remove
	my bad		Copy link Message link Add to gist Remove
	but that's much closer to the 1.2Gb i got :)		Copy link Message link Add to gist Remove
	yeah	20:00	Copy link Message link Add to gist Remove
	i'm looking at the total amount that was malloc'd and comparing it. though i should also compare the peak usage as well		Copy link Message link Add to gist Remove
20:04 leont_ joined 20:40 lizmat joined
timotimo	i wonder if we'd benefit from a "pre-size this hash for n elements" operation that we can call in the deserialization code	20:46	Copy link Message link Add to gist Remove
MasterDuke	timotimo: did you try with MVM_SPESH_BLOCKING=1? did that reduce the variability?	20:49	Copy link Message link Add to gist Remove
timotimo	i did not		Copy link Message link Add to gist Remove
MasterDuke	oh, and how were you measuring?	21:03	Copy link Message link Add to gist Remove
timotimo	just "time" on the commandline		Copy link Message link Add to gist Remove
MasterDuke	nice and simple (and doesn't slow the build!)	21:05	Copy link Message link Add to gist Remove
samcv	i'm going to case the switches for 8bit and ascii strings being joined (flat)	21:31	Copy link Message link Add to gist Remove
	tests show 2.4x speed improvement joining very long 8bit strings	21:32	Copy link Message link Add to gist Remove
	(that are flat)		Copy link Message link Add to gist Remove
lizmat	m: use experimental :collation; $*COLLATION.set(:!tertiary); dd "a" coll "A" # samcv: is that correct ?	21:33	Copy link Message link Add to gist Remove Run code
camelia	Order::More		Copy link Message link Add to gist Remove
21:33 zakharyas joined
samcv	ah. quaternary needs to be disabled	21:33	Copy link Message link Add to gist Remove
	that breaks ties by codepoint		Copy link Message link Add to gist Remove
	m: use experimental :collation; $*COLLATION.set(:!tertiary, :!quaternary); dd "a" coll "A"	21:34	Copy link Message link Add to gist Remove Run code
camelia	Order::More		Copy link Message link Add to gist Remove
samcv	err maybe i spelled it wrong		Copy link Message link Add to gist Remove
lizmat	yeah :-)		Copy link Message link Add to gist Remove
samcv	m: use experimental :collation; $*COLLATION.set(:!tertiary, :!quaternary); dd "a" coll "A"		Copy link Message link Add to gist Remove Run code
camelia	Order::More		Copy link Message link Add to gist Remove
samcv	m: use experimental :collation; $COLLATION.set(:!tertiary, :!quaternary); say $COLLATION		Copy link Message link Add to gist Remove Run code
camelia	collation-level => 5, Country => International, Language => None, primary => 1, secondary => 1, tertiary => 0, quaternary => 0		Copy link Message link Add to gist Remove
samcv	5	21:35	Copy link Message link Add to gist Remove
	that sounds right		Copy link Message link Add to gist Remove
lizmat	5?		Copy link Message link Add to gist Remove
samcv	hm		Copy link Message link Add to gist Remove
lizmat	but More should be Same, right?		Copy link Message link Add to gist Remove
samcv	yeah 5 is right. but More should be same when quaternary is removed	21:36	Copy link Message link Add to gist Remove
	looks like a bug. will look at it in about an hour when i get back		Copy link Message link Add to gist Remove
lizmat	shall I rakudobug it ?		Copy link Message link Add to gist Remove
samcv	yeah		Copy link Message link Add to gist Remove
lizmat	oki		Copy link Message link Add to gist Remove
	samcv: RT #132216	21:40	Copy link Message link Add to gist Remove
synopsebot	RT#132216 [new]: rt.perl.org/Ticket/Display.html?id=132216 'a' coll 'A" not Same but More		Copy link Message link Add to gist Remove
Geth	MoarVM: d5db8486bb \| (Timo Paulssen)++ \| src/spesh/stats.c skip stats for frames beyond spesh max bytecode size an example file with a gigantic mainline - a huge hash literal - spent more than 95% of its time inside by_offset for the benefit of a frame that was going to be ignored by the planner anyway. This makes it as fast as running without spesh.	21:59	Copy link Message link Add to gist Remove
jnthn	Hm, though that happens on another thread...	22:00	Copy link Message link Add to gist Remove
	Nice catch, though :-)		Copy link Message link Add to gist Remove
timotimo	it does		Copy link Message link Add to gist Remove
jnthn	Though I wonder	22:01	Copy link Message link Add to gist Remove
timotimo	but the regular threads waits for spesh to reach its gc sync point		Copy link Message link Add to gist Remove
jnthn	Ah		Copy link Message link Add to gist Remove
	Maybe we should not log such huge frames at all..hmm		Copy link Message link Add to gist Remove
timotimo	that'd be a check inside the instructions that do logging		Copy link Message link Add to gist Remove
jnthn	Though we'd have to look at the bytecode size on frame entry when we're logging		Copy link Message link Add to gist Remove
timotimo	yes		Copy link Message link Add to gist Remove
jnthn	No, I was thinking of doing it in MVM_frame_invoke		Copy link Message link Add to gist Remove
	If we don't give it an entry record or correlation ID		Copy link Message link Add to gist Remove
	Then it won't log anything for it	22:02	Copy link Message link Add to gist Remove
	And given we'll never spesh it, that's fine		Copy link Message link Add to gist Remove
timotimo	oh, that would prevent all ops from logging with a check that's already in place anyway		Copy link Message link Add to gist Remove
jnthn	Yeah		Copy link Message link Add to gist Remove
	So then we'd not even write into the log		Copy link Message link Add to gist Remove
timotimo	yes, that'd increase spesh efficiency, too		Copy link Message link Add to gist Remove
jnthn	aye	22:03	Copy link Message link Add to gist Remove
	May help some of the giant test files too :)		Copy link Message link Add to gist Remove
timotimo	it could very well!		Copy link Message link Add to gist Remove
jnthn	Was the giant file with a hash in actually constructing it in the module mainline?		Copy link Message link Add to gist Remove
	If it's all constant data, sticking a BEGIN in front of it would mean we make it at compile time and serialize it, which'd be faster still :)	22:04	Copy link Message link Add to gist Remove
timotimo	yup, it's just "our %systems = ( ... )"	22:05	Copy link Message link Add to gist Remove
	but you're right		Copy link Message link Add to gist Remove
	i hadn't thought of that		Copy link Message link Add to gist Remove
	a bit shameful that we use about a gig maxrss to compile this beast	22:06	Copy link Message link Add to gist Remove
	i'll put "constant" in front, that should have the same effect		Copy link Message link Add to gist Remove
jnthn	Yeah		Copy link Message link Add to gist Remove
	Then it should load faster still	22:07	Copy link Message link Add to gist Remove
timotimo	i'll have numbers in a mo'		Copy link Message link Add to gist Remove
	ah, snap	22:08	Copy link Message link Add to gist Remove
	it then has to be %( ) rather than just ( )		Copy link Message link Add to gist Remove
	wow, yeah	22:10	Copy link Message link Add to gist Remove
	that's much, much faster		Copy link Message link Add to gist Remove
	from about 0.88 down to about 0.26	22:11	Copy link Message link Add to gist Remove
	now i'm checking if the association id thing works	22:16	Copy link Message link Add to gist Remove
	yeah, it has the speed increase effect	22:17	Copy link Message link Add to gist Remove
	i'll run "make test" and "make spectest" this time, though		Copy link Message link Add to gist Remove
MasterDuke	association id?		Copy link Message link Add to gist Remove
timotimo	yeah, in order to do spesh logging we give every frame an ID	22:18	Copy link Message link Add to gist Remove
	if a frame has no ID, no logging can take place		Copy link Message link Add to gist Remove
MasterDuke	ah		Copy link Message link Add to gist Remove
timotimo	that's not quite accurate	22:20	Copy link Message link Add to gist Remove
	it also interplays with the simulated stack and such	22:21	Copy link Message link Add to gist Remove
	i.e. spesh recreates what it thinks the callstack looked like when the log was created, so we don't have to do too complicated computations when creating the individual log entries		Copy link Message link Add to gist Remove
MasterDuke	and we can just skip all that if the frame size is too big?	22:25	Copy link Message link Add to gist Remove
timotimo	aye		Copy link Message link Add to gist Remove
	jnthn: you think this can have bad effects if a huge frame comes between two frames that get speshed?		Copy link Message link Add to gist Remove
Geth	MoarVM: d0646fafb9 \| (Timo Paulssen)++ \| 2 files don't even generate log entries for huge frames not giving a frame a correlation ID prevents any logging from taking place, the logs are not filled with useless data, fewer runs for the spesh worker to do.	22:27	Copy link Message link Add to gist Remove
timotimo	wouldn't have been able to come up with this as fast if not for MasterDuke being my rubber duck :)	22:28	Copy link Message link Add to gist Remove
jnthn	timotimo: It'll cope		Copy link Message link Add to gist Remove
	The worst that'd happen is it infers something wrong from the stats and sticks a guard in that always fails, but that would be quite unlikely		Copy link Message link Add to gist Remove
MasterDuke	"those who can't do, duck"?	22:29	Copy link Message link Add to gist Remove
timotimo	haha		Copy link Message link Add to gist Remove
	hm, lock-async seems to be hanging, but i think it also hangs for others	22:30	Copy link Message link Add to gist Remove
	nope, it finished		Copy link Message link Add to gist Remove
	just took a while		Copy link Message link Add to gist Remove
jnthn	That one's odd; on my office machine it always completes fast. In my VM at home it often does, then occasinally goes super slow.	22:32	Copy link Message link Add to gist Remove
	Always completes eventually though		Copy link Message link Add to gist Remove
22:40 bloatable6 joined 22:45 buggable joined 22:57 arnsholt joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!