#moarvm on 29 October 2015 - Raku Programming Language Log

00:25 tokuhirom joined 01:09 tokuhirom joined 01:20 TimToady joined 02:37 vendethiel joined 02:47 ilbot3 joined 04:08 TimToady joined 04:13 KDr2 joined 05:24 ingy joined 05:36 FROGGS joined 06:41 FROGGS_ joined 06:47 FROGGS__ joined 06:57 nwc10 joined 07:44 FROGGS joined 08:38 kjs_ joined 09:24 zakharyas joined 10:12 kjs_ joined
jnthn	.tell Hotkeys I didn't actually change how \r behaves yet, just did the groundwork. Also, unless you're building a MoarVM at HEAD, rather than the version Rakudo will pick by default, you'll not have any of my changes yet anyway.	10:12	Copy link Message link Add to gist Remove
	oh, no bot		Copy link Message link Add to gist Remove
	Hotkeys: I didn't actually change how \r behaves yet, just did the groundwork. Also, unless you're building a MoarVM at HEAD, rather than the version Rakudo will pick by default, you'll not have any of my changes yet anyway.		Copy link Message link Add to gist Remove
	psch: \r\n will become a synthetic, not identical to \n, just to be clear.	10:13	Copy link Message link Add to gist Remove
10:48 brrt joined
brrt	\o	10:49	Copy link Message link Add to gist Remove
	guess who had to fix... yet another compilation bug this morning?	10:50	Copy link Message link Add to gist Remove
jnthn	brrt? :)	10:52	Copy link Message link Add to gist Remove
brrt	jnthn++ :-P		Copy link Message link Add to gist Remove
	apparantly, accessing the lower bytes of rsp-rdi (registers 4-7) dynamically in x64 means you have to add a REX byte	10:53	Copy link Message link Add to gist Remove
jnthn	This REX byte seems to cause plenty of fun...	10:54	Copy link Message link Add to gist Remove
brrt	because otherwise the address is understood as the second byte of the lower 4 (rax-rbx) registers		Copy link Message link Add to gist Remove
	even if you don't actually address any of the regular extended registers		Copy link Message link Add to gist Remove
	'fun'		Copy link Message link Add to gist Remove
	why is this? GOD ONLY KNOWS	10:55	Copy link Message link Add to gist Remove
	i'm also seeing that my approach to register-allocation-over-conditional-and-call-boundaries was oversimplisitc	10:58	Copy link Message link Add to gist Remove
	as it happens, online algorithms suck		Copy link Message link Add to gist Remove
	whereby 'suck' has a technical definition meaning 'make things much more complicated and difficult to analyse'		Copy link Message link Add to gist Remove
	anyway... i was hoping i could get the compiler to run with just bugfixing the current setup. but the complexity of it feels like it's spiralling	11:03	Copy link Message link Add to gist Remove
	so i'm wondering if i should move the register allocator to an offline phase, as well as having the tiles in linear memory order		Copy link Message link Add to gist Remove
	one of the things that bothered me slightly is that if i linearize the tiles to an array, then i can't splice in spills and loads easily.	11:05	Copy link Message link Add to gist Remove
	which means one of three things: a): i linearize to something else than an array, like a linked list, which is kind of not-so-fun with regards to allocation (or i should use the spesh allocator, but I don't like that much);	11:06	Copy link Message link Add to gist Remove
	b): i add a second array which is walked next to the first array that holds spills and stores (i kind of like that solution, but it is complex)	11:07	Copy link Message link Add to gist Remove
	c): i do spills and stores inline		Copy link Message link Add to gist Remove
	but i'm not sure how c) interoperates with the idea of making the register allocation step an offline step	11:08	Copy link Message link Add to gist Remove
jnthn	Hmmm		Copy link Message link Add to gist Remove
brrt	(having register alloc as a offline step is basically compiler best practice, or so i've heard)\		Copy link Message link Add to gist Remove
jnthn	For (a) what's wrong with using the spesh allocator?		Copy link Message link Add to gist Remove
brrt	consistency. i use the spesh allocator for nothing, and then bam!, it reappears	11:09	Copy link Message link Add to gist Remove
	although......		Copy link Message link Add to gist Remove
	hmmm		Copy link Message link Add to gist Remove
	i could actually use the spesh allocator to hold the info nodes		Copy link Message link Add to gist Remove
	the info node array is quite redundant as all the tree 'pointers' and constants also get a info node	11:10	Copy link Message link Add to gist Remove
timotimo	well, the spesh allocator is good for things that are unevenly sized and that you're going to throw away completely at the end		Copy link Message link Add to gist Remove
brrt	aye		Copy link Message link Add to gist Remove
timotimo	so it seems like a good fit		Copy link Message link Add to gist Remove
brrt	it is a good fit		Copy link Message link Add to gist Remove
	brrt wonders if i want random access to the tiles		Copy link Message link Add to gist Remove
	ok, that is a good idea, i think	11:14	Copy link Message link Add to gist Remove
timotimo	i've heard a thousand times that linked lists never outperform arrays even if you do inserts in the middle ... or something like that		Copy link Message link Add to gist Remove
	because ... CACHES!		Copy link Message link Add to gist Remove
jnthn	yeah but the spesh allocator sticks the nodes in order anyway :P		Copy link Message link Add to gist Remove
timotimo	L1 cache can give you multiple gigs per second throughput! doesn't matter that it's still small and has to grab data from RAM at a much, much slower rate all the time	11:15	Copy link Message link Add to gist Remove
jnthn	That's why I picked that kind of design		Copy link Message link Add to gist Remove
	Thus a bunch of MVMSpeshIns will be continguous, unless you hit a point something was spliced in.		Copy link Message link Add to gist Remove
	So it's relatively cache friendly		Copy link Message link Add to gist Remove
brrt	i think knuth's old quote applies really well here	11:16	Copy link Message link Add to gist Remove
	timotimo: just do multiple gigs of computation on 8k of memory or so :-P	11:17	Copy link Message link Add to gist Remove
	brrt wonders if that is actually done anywhere		Copy link Message link Add to gist Remove
timotimo	i'm still wondering if my idea for a Big Data product that handles "Big Data data sets as large as 100x your L1 cache effectively" will be seen as revolutionary and awesome	11:18	Copy link Message link Add to gist Remove
brrt	lol	11:21	Copy link Message link Add to gist Remove
	'big' data like ... a gigabyte :-o	11:22	Copy link Message link Add to gist Remove
	fwiw, did anybody see the 'go is a poorly designed language' article anywhere?		Copy link Message link Add to gist Remove
	it's funny because all the things the author considers as 'poor design' are quite logical imho		Copy link Message link Add to gist Remove
timotimo	oh, does it say "its types are spelled totally differently from C"?	11:23	Copy link Message link Add to gist Remove
brrt	no, actually, it doesn't	11:24	Copy link Message link Add to gist Remove
	it says 'i want my negative array indexing to work like python and it doesn't :-('		Copy link Message link Add to gist Remove
	while negative array indices are a huge cost in a potential fast path		Copy link Message link Add to gist Remove
	jnthn heard Go had attracted a bunch of Python folks, but isn't sure how accurate that is	11:25	Copy link Message link Add to gist Remove
brrt	perhaps not when they are constant (you can constant-fold it away), but definitely variable indices		Copy link Message link Add to gist Remove
	yeah, well, if i were to go and blog about how python doesn't support sigils, i'd look ridiculous, no?		Copy link Message link Add to gist Remove
timotimo	wouldn't that be fun?	11:26	Copy link Message link Add to gist Remove
brrt	yes. yes it would		Copy link Message link Add to gist Remove
	'splicing stuff out doesn't look easy' - that's because it isn't easy	11:27	Copy link Message link Add to gist Remove
	and cheap		Copy link Message link Add to gist Remove
	'declaring a variable in a new scope using shorthand notation shadows my outer scope variable' - i'm not even sure what anybody should expect	11:28	Copy link Message link Add to gist Remove
	i'm going to write an article about how go doesn't have a whateverstar and how that makes it a sucky language	11:29	Copy link Message link Add to gist Remove
timotimo	A/B-test it against an article about python not having a whateverstar	11:31	Copy link Message link Add to gist Remove
brrt	that...	11:32	Copy link Message link Add to gist Remove
	is an excellent idea		Copy link Message link Add to gist Remove
dalek	arVM: 385e498 \| jnthn++ \| src/strings/normalize.c: Make NFG algorithm use Unicode Grapheme Clusters. As described in Annex #29. We do all of it except the CRLF case, as enabling that even breaks our ability to parse Perl 6 code (will need to figure out why). Aside from the CRLF case, though, we now pass all the Unicode grapheme boundary tests (that is, we get the .chars that are expected).	12:13	Copy link Message link Add to gist Remove
	arVM: 82f93f7 \| jnthn++ \| src/strings/normalize.h: Toss #define we ended up not needing.	12:14	Copy link Message link Add to gist Remove
	arVM: 3519077 \| jnthn++ \| src/strings/normalize.h: Slightly simplify a conditional.		Copy link Message link Add to gist Remove
jnthn	Time to see what the spectest fallout of the NFG algorithm change will be... :)	12:15	Copy link Message link Add to gist Remove
nwc10	spectests weren't clean before	12:19	Copy link Message link Add to gist Remove
lizmat	yeah, were 4 files failing for me yesterday eve		Copy link Message link Add to gist Remove
nwc10	they are in a state of sin a bit too often for my liking		Copy link Message link Add to gist Remove
jnthn	With MOAR_REVISION or with master?	12:20	Copy link Message link Add to gist Remove
	Seems 7 test files are vicitms of the change	12:23	Copy link Message link Add to gist Remove
	Well, have tests that are vicitms	12:24	Copy link Message link Add to gist Remove
	lunch &	12:27	Copy link Message link Add to gist Remove
nwc10	jnthn: "nom", I believe is the culprit	12:38	Copy link Message link Add to gist Remove
	more spectests fail, but eyeballing the summary, doesn't look like anything surprising	13:01	Copy link Message link Add to gist Remove
	jnthn back	13:06	Copy link Message link Add to gist Remove
brrt	\o jnthn	13:09	Copy link Message link Add to gist Remove
jnthn	Turns out 1 failing file was 'cus I needed to patch Str.perl (now done)	13:19	Copy link Message link Add to gist Remove
	Next 2 were tests that don't make sense under NFG semantics.		Copy link Message link Add to gist Remove
	And that we didn't spot last time around 'cus the NFG algo was insufficient.		Copy link Message link Add to gist Remove
	m: say uniprop("\x1B3D", 'General_Category')	13:30	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«Mc␤»		Copy link Message link Add to gist Remove
jnthn	m: say "\x1B3D".NFD	13:31	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«NFD:0x<1b3c 1b35>␤»		Copy link Message link Add to gist Remove
jnthn	m: say "\x1B3D".chars		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«1␤»		Copy link Message link Add to gist Remove
jnthn	m: say "\x1B3D".ord		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«6973␤»		Copy link Message link Add to gist Remove
jnthn	wtf		Copy link Message link Add to gist Remove
	m: say 0x1B3D	13:32	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«6973␤»		Copy link Message link Add to gist Remove
jnthn	Locally		Copy link Message link Add to gist Remove
	> say "\x1B3D".ord		Copy link Message link Add to gist Remove
	6972		Copy link Message link Add to gist Remove
	o.O		Copy link Message link Add to gist Remove
	m: say Uni.new(0x1B3D).Str.ord.base(16)	13:34	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«1B3D␤»		Copy link Message link Add to gist Remove
jnthn	> say Uni.new(0x1B3D).NFC	13:35	Copy link Message link Add to gist Remove
	NFC:0x<1b3d>		Copy link Message link Add to gist Remove
	Not NFC that got busted		Copy link Message link Add to gist Remove
	> say Uni.new(0x1B3D).Str.ord.base(16)		Copy link Message link Add to gist Remove
	1B3C		Copy link Message link Add to gist Remove
	I don't even...	13:36	Copy link Message link Add to gist Remove
	m: say Uni.new(0x1B3D).NFD.NFC	13:45	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«NFC:0x<1b3c 1b35>␤»		Copy link Message link Add to gist Remove
jnthn	ah	13:46	Copy link Message link Add to gist Remove
brrt	what, how		Copy link Message link Add to gist Remove
jnthn	m: say uniname(0x1B3D)		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«BALINESE VOWEL SIGN LA LENGA TEDUNG␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniname(0x1B3C)		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«BALINESE VOWEL SIGN LA LENGA␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniname(0x1B35)	13:47	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«BALINESE VOWEL SIGN TEDUNG␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniprop(0x1B3C, 'General_Category')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«Mn␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniprop(0x1B3D, 'General_Category')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«Mc␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniprop(0x1B35, 'General_Category')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«Mc␤»		Copy link Message link Add to gist Remove
jnthn	Yowser. That's a bit of an interesting problem.	13:48	Copy link Message link Add to gist Remove
nwc10	It's not at all obvious to me why (or what's wrong)	13:49	Copy link Message link Add to gist Remove
	I have not read this yet: morepypy.blogspot.co.at/2015/10/pyp...us+Blog%29		Copy link Message link Add to gist Remove
jnthn	nwc10: Well, it came from a test regression		Copy link Message link Add to gist Remove
	brrt should probably subscribe to their blog since it is interesting	13:50	Copy link Message link Add to gist Remove
jnthn	m: say uniprop(0x1B3D, 'NFC_QC')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar da8881: OUTPUT«Y␤»		Copy link Message link Add to gist Remove
nwc10	I'm just looking at what's on planetpython.org/		Copy link Message link Add to gist Remove
brrt	although i recently lost all my subscriptions when i forgot to copy a opml file		Copy link Message link Add to gist Remove
jnthn	So, that character passes the NFC quick-check		Copy link Message link Add to gist Remove
	Which implies "we're already in NFC"	13:51	Copy link Message link Add to gist Remove
brrt	uhuh		Copy link Message link Add to gist Remove
jnthn	But NFC should afaiu be stable		Copy link Message link Add to gist Remove
	Such that if you compute NFD and then again compute NFC, you get the same thing back		Copy link Message link Add to gist Remove
	That's not happenign here		Copy link Message link Add to gist Remove
	*happening		Copy link Message link Add to gist Remove
nwc10	we're into "#11907 Looking for a compiler bug is the strategy of LAST resort. LAST resort." ?	13:52	Copy link Message link Add to gist Remove
jnthn	Not yet		Copy link Message link Add to gist Remove
	I need to go look at our NFD -> NFC	13:53	Copy link Message link Add to gist Remove
	But NFC is defined in terms of NFD		Copy link Message link Add to gist Remove
13:57 rarara_ joined
	brrt wonders what the current state of the art is in rubyland, since ruby may be even more comparable to perl6 in terms of indirections	13:58	Copy link Message link Add to gist Remove
jnthn	Well, I can't find a way we're inconsistent with the actual Unicode data files	14:07	Copy link Message link Add to gist Remove
	So yeah, it's very odd. I have a case where a string passes the NFC QuickCheck, but actually computing NFC on that string doesn't give identity	14:16	Copy link Message link Add to gist Remove
nwc10	use more 'coffee'; ?	14:17	Copy link Message link Add to gist Remove
jnthn	And it's a one-char string so I don't think I could be getting the use of NFC_QC wrong.		Copy link Message link Add to gist Remove
14:37 tokuhiro_ joined
jnthn	OK, seems we have something wrong in our canonical composition	14:40	Copy link Message link Add to gist Remove
	Yeah, nailed it I think	14:56	Copy link Message link Add to gist Remove
brrt	that was fast	14:57	Copy link Message link Add to gist Remove
jnthn	The number of characters added by patch to time spent ratio is pretty awful :P	14:59	Copy link Message link Add to gist Remove
brrt	aw, there goes your enterprise points	15:00	Copy link Message link Add to gist Remove
jnthn	:P	15:01	Copy link Message link Add to gist Remove
	brrt has seen a lot of articles lately about how COBOL was making a comeback	15:03	Copy link Message link Add to gist Remove
	maybe we should have an Inline::COBOL		Copy link Message link Add to gist Remove
	or just port moar to COBOL for enterprise points		Copy link Message link Add to gist Remove
jnthn	If your goal is -Osalary, COBOL may well be one of the best languages to learn :)	15:05	Copy link Message link Add to gist Remove
dalek	arVM: 5ff3001 \| jnthn++ \| src/strings/normalize.c: Fix a canonical composition bug. We didn't admit various starter/starter composition cases. This bug actually managed to survive despite us passing the complete Unicode normalization test suite, because we never hit this code path before thanks to the NFC_QC property. Now, thanks to NFG_QC, we can hit it in some more cases (this does perhaps point to a future optimization). Fixes a spectest regression.	15:09	Copy link Message link Add to gist Remove
brrt	i don't even...	15:11	Copy link Message link Add to gist Remove
jnthn	:)	15:13	Copy link Message link Add to gist Remove
brrt	understand how or why that makes a difference :-)	15:14	Copy link Message link Add to gist Remove
jnthn	brrt: Sometimes, two characters that are not combining chars are composed into a single char when canonicalizing.	15:15	Copy link Message link Add to gist Remove
	brrt: And the bug I just fixed meant we didn't let that happen.	15:16	Copy link Message link Add to gist Remove
	Now I'm down to one affected test file	15:17	Copy link Message link Add to gist Remove
	Bizzarely, S32-io/IO-Socket-INET.t		Copy link Message link Add to gist Remove
	m: say uniname(0xbeef)	15:21	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«<Hangul Syllable>␤»		Copy link Message link Add to gist Remove
jnthn	haha		Copy link Message link Add to gist Remove
	m: say uniname(0xbabe)		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«<Hangul Syllable>␤»		Copy link Message link Add to gist Remove
jnthn	Pro tip: when writing tests and wanting some random "Unicode character", don't just spell cute words :)	15:22	Copy link Message link Add to gist Remove
	Otherwise you might (or in this case, will!) end up with something that will, under NFG, end up combining with the previous grapheme.		Copy link Message link Add to gist Remove
brrt	no, i stil don't get it	15:37	Copy link Message link Add to gist Remove
	but i'll accept that for now	15:38	Copy link Message link Add to gist Remove
jnthn	brrt: I only get it in so far as "I read the Unicode spec and know what the terms mean"		Copy link Message link Add to gist Remove
	I don't know anything about the Balianese language and the specific thing that's going on with these chars.	15:39	Copy link Message link Add to gist Remove
brrt	the NFG business seems similar in a way to the x86 instruction encoding business		Copy link Message link Add to gist Remove
	you think you fixed it, but no, something funny happens	15:40	Copy link Message link Add to gist Remove
	only in specific magical cases		Copy link Message link Add to gist Remove
jnthn	Well, this wsan't even an NFG bug, just an NFC one :)		Copy link Message link Add to gist Remove
brrt	fair enough :-)		Copy link Message link Add to gist Remove
jnthn	It's not that bad, tbh. It's just that humans are darn creative about their writing systems.		Copy link Message link Add to gist Remove
	Hangul is probably the biggest offender in terms of amount of code we have to write just for it.	15:41	Copy link Message link Add to gist Remove
nwc10	t/spec/S15-nfg/cgj.rakudo.moar	15:50	Copy link Message link Add to gist Remove
	TODO passed: 5-8		Copy link Message link Add to gist Remove
jnthn	Yup :)		Copy link Message link Add to gist Remove
	Oh man. The \r\n => 1 grapheme thing may be a bit fraught	16:12	Copy link Message link Add to gist Remove
nwc10	why so?		Copy link Message link Add to gist Remove
jnthn	Well, NQP can't even parse a simple test file with a \r\n in it any more	16:14	Copy link Message link Add to gist Remove
	And then the error handling code it uses to try and report that fails too	16:15	Copy link Message link Add to gist Remove
	And the REPL hangs	16:17	Copy link Message link Add to gist Remove
nwc10	step away from the keyboard, and make a curry?	16:18	Copy link Message link Add to gist Remove
	the "error reporting" thing sounds like a bug that needs fixing whatever else happens next.		Copy link Message link Add to gist Remove
jnthn	aye		Copy link Message link Add to gist Remove
	Ah	16:21	Copy link Message link Add to gist Remove
	One possible problem is that concatenation of a \r and \n doesn't produce the grapheme		Copy link Message link Add to gist Remove
dalek	arVM: f1a216d \| jnthn++ \| src/strings/nfg.c: Update concat code in prep for \r\n as grapheme.	16:28	Copy link Message link Add to gist Remove
jnthn	Aha	16:35	Copy link Message link Add to gist Remove
	say(?("\r\n" ~~ /\v/))	16:36	Copy link Message link Add to gist Remove
[Coke]	oho?		Copy link Message link Add to gist Remove
jnthn	That doesn't match		Copy link Message link Add to gist Remove
	So, that's almost certainly the source of the \r\n -> 1 grapheme bustage		Copy link Message link Add to gist Remove
	I suspect fixing this is going to need an NQP bootstrap updage	16:37	Copy link Message link Add to gist Remove
	*update		Copy link Message link Add to gist Remove
	But worse, it probably also brings up the issues with NFG and the NFA engine		Copy link Message link Add to gist Remove
TimToady	m: say "\x037e".ord.base(16)	16:45	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«3B␤»		Copy link Message link Add to gist Remove
jnthn	m: say uniname(0x037E)	16:46	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«GREEK QUESTION MARK␤»		Copy link Message link Add to gist Remove
TimToady	is there an explanation for why we lose track of GREEK QUESTION MARK?		Copy link Message link Add to gist Remove
jnthn	Almost certainly :)		Copy link Message link Add to gist Remove
	m: say uniprop(0x037E, 'Decomp_Spec')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«003B␤»		Copy link Message link Add to gist Remove
jnthn	^^		Copy link Message link Add to gist Remove
	'cus Unicode says we should as part of normalization		Copy link Message link Add to gist Remove
	See singleton equivalence in unicode.org/reports/tr15/ for more info	16:47	Copy link Message link Add to gist Remove
TimToady	k		Copy link Message link Add to gist Remove
jnthn	(There's a handful of 'em)		Copy link Message link Add to gist Remove
[Coke]	TimToady: I thought we already said that on #perl6. my bad.	16:49	Copy link Message link Add to gist Remove
	jnthn: it makes us impervious to the "mess with your friend's code" meme that was going around recently.		Copy link Message link Add to gist Remove
jnthn	[Coke]: Nah, there's still plenty of other ways to do that :)	16:51	Copy link Message link Add to gist Remove
TimToady	jnthn: btw		Copy link Message link Add to gist Remove
	m: say uniprop(0x1B3D) # don't need to type General_Category every time		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«Mc␤»		Copy link Message link Add to gist Remove
TimToady	that's the default		Copy link Message link Add to gist Remove
jnthn	Darn, wish I'd know that earlier :P		Copy link Message link Add to gist Remove
	After today I probably don't need to look up gen cats again for another few months :P		Copy link Message link Add to gist Remove
	TimToady: I assume you want us to end up with \r\n as a synthetic so we totally follow the Unicode grapheme cluster rules? :)	16:53	Copy link Message link Add to gist Remove
TimToady	unless there's some showstopper reason we can't		Copy link Message link Add to gist Remove
jnthn	Not that I see so far, it's just a bit of hunting down the badass umptions in the code... :)	16:54	Copy link Message link Add to gist Remove
TimToady	at least it makes it easier to match \n in regex :)		Copy link Message link Add to gist Remove
jnthn	Yeah		Copy link Message link Add to gist Remove
TimToady	and certainly \v should match it too		Copy link Message link Add to gist Remove
jnthn	For sure; just patched that locally	16:55	Copy link Message link Add to gist Remove
	Though it's a hack :/		Copy link Message link Add to gist Remove
	And will break LTM of \V until I more generally fix the NFG/NFA interaction		Copy link Message link Add to gist Remove
	The NFA design we inherited is rather into doing nqp::ord		Copy link Message link Add to gist Remove
	And so loses synthetics		Copy link Message link Add to gist Remove
	Not a huge engineering problem to fix, I don't think. Just another thing to do.	16:56	Copy link Message link Add to gist Remove
TimToady	though even if we have an nqp::gord or so, we're still in trouble if we go with per-string or per-domain tables	16:57	Copy link Message link Add to gist Remove
jnthn	Indeed. I don't want to go that way.		Copy link Message link Add to gist Remove
	Better to actually pass one-char strings.		Copy link Message link Add to gist Remove
	At the "API"		Copy link Message link Add to gist Remove
	(We can still keep it all integers on the inside at actual matching time)	16:58	Copy link Message link Add to gist Remove
TimToady	let's keep doing it right, and then think about optimization		Copy link Message link Add to gist Remove
jnthn	Well, we're not doing it right yet...but yeah.		Copy link Message link Add to gist Remove
TimToady	*righter		Copy link Message link Add to gist Remove
jnthn	Otherwise, I think the NFG algo tweaks have turned out OK.	16:59	Copy link Message link Add to gist Remove
TimToady	any feel on input performance degradation?		Copy link Message link Add to gist Remove
17:00 tokuhiro_ joined
TimToady	presumably shouldn't be much if most of the file is quick-reject ASCII	17:00	Copy link Message link Add to gist Remove
jnthn	Yeah, it's a little slowdown for ASCII 'cus we have to care about \r now	17:01	Copy link Message link Add to gist Remove
TimToady	well, except insofar as \r\n is ... yeah		Copy link Message link Add to gist Remove
jnthn	And we're more careful over controls		Copy link Message link Add to gist Remove
	When we have to do the full analysis it's more costly		Copy link Message link Add to gist Remove
	But I computed us an NFG quickcheck property		Copy link Message link Add to gist Remove
	So we should only be doing the hard work when we really need to	17:02	Copy link Message link Add to gist Remove
	m: say uniprop('x', 'NFG_QC')		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 3cc195: OUTPUT«0␤»		Copy link Message link Add to gist Remove
jnthn	Ah, build is behind		Copy link Message link Add to gist Remove
	But yeah, we leak that property to userspace as if it was a normal Unicode property, when it's in fact one we've made up		Copy link Message link Add to gist Remove
	Dunno if that bothers you.	17:03	Copy link Message link Add to gist Remove
TimToady	not much		Copy link Message link Add to gist Remove
jnthn	k		Copy link Message link Add to gist Remove
	Ended up with all the control chars being NFG terminators, btw.	17:04	Copy link Message link Add to gist Remove
TimToady	though if the UC adopts the "NFG" term we could eventually get a name collision, but so far they seem to prefer Normalization_Form_Grapheme and such		Copy link Message link Add to gist Remove
jnthn	Which was something you suggested before.		Copy link Message link Add to gist Remove
	Well, they do call their quickcheck properties NFC_QC for example	17:05	Copy link Message link Add to gist Remove
TimToady	well, hopefully then they won't adopt it and flip the sense :)		Copy link Message link Add to gist Remove
jnthn	If they did, they'd make it inconsistent with how the other quickcheck properties work		Copy link Message link Add to gist Remove
	And they don't seem that crazy. :)		Copy link Message link Add to gist Remove
	Or at least, no more crazy than you have to be to try and bring some order to the world's writing systems...	17:06	Copy link Message link Add to gist Remove
TimToady	in the backlog: "never hit this codepath"	17:08	Copy link Message link Add to gist Remove
	do we have any plans for a code coverage tool?		Copy link Message link Add to gist Remove
	so we can tell if there are glaring blind spots in roast?	17:09	Copy link Message link Add to gist Remove
jnthn	Well, a user-level thing "not yet"		Copy link Message link Add to gist Remove
	Do we have the tech to hack something up to tell us where our roast blind spots are? That's easier.	17:10	Copy link Message link Add to gist Remove
	The cross-thread write logging and the profiler use bytecode instrumentation, and it's easy enough to write extra ones of those		Copy link Message link Add to gist Remove
TimToady	well, thinking setting-level mostly there		Copy link Message link Add to gist Remove
jnthn	My Big Plan is to turn the instrumentation stuff into a kind of meta-interpreter framework so you can write stuff like profilers and coverage tools and debuggers in NQP or Perl 6 code	17:11	Copy link Message link Add to gist Remove
	But that's not going to happen this side of 6.c	17:12	Copy link Message link Add to gist Remove
	Anyway, I can do a couple-of-hours hack solution to get an approximate answer to "what is roast not covering"	17:15	Copy link Message link Add to gist Remove
	And maybe it'll be inspiration for somebody to go and make a good one :)		Copy link Message link Add to gist Remove
	OK, I've got NQP patches that get all but one of the NQP tests passing	17:17	Copy link Message link Add to gist Remove
	(With \r\n as a grapheme)	17:18	Copy link Message link Add to gist Remove
	It'll need a rebootstrap, alas	17:19	Copy link Message link Add to gist Remove
	Hm, and it doesn't make it all the way through the Perl 6 build either.	17:20	Copy link Message link Add to gist Remove
TimToady	we have 5 comments in nqp that mention things to change after a rebootstrap :)	17:22	Copy link Message link Add to gist Remove
	which I'm sure were put there at least one reboot ago...		Copy link Message link Add to gist Remove
jnthn	:)	17:23	Copy link Message link Add to gist Remove
	Time for me to go cook us something tasty :)		Copy link Message link Add to gist Remove
	jnthn is happy to have this bit of the NFG work nearly done	17:26	Copy link Message link Add to gist Remove
	Guess I need to worry about the threads/IO things next week		Copy link Message link Add to gist Remove
	Well, various I/O things...	17:27	Copy link Message link Add to gist Remove
	Anyway, away for a bit		Copy link Message link Add to gist Remove
18:06 kjs_ joined 18:17 tokuhiro_ joined 18:18 zakharyas joined 18:28 leont joined 18:40 FROGGS joined 18:42 vendethiel joined 19:24 kjs_ joined 19:54 tokuhiro_ joined 19:55 rarara_ joined 20:10 kjs_ joined 20:37 kjs_ joined 21:19 kjs_ joined 21:33 zakharyas joined 22:20 tokuhiro_ joined

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!