#perl6-dev on 9 October 2016 - Raku Programming Language Log

dalek	p: cea29a3 \| (Zoffix Znet)++ \| docs/ops.markdown: Fix broken example	00:02	Copy link Message link Add to gist Remove
	ast: 13a41ba \| usev6++ \| S03-metaops/reduce.t: Unfudge passing tests on JVM (RT #126899)	06:19	Copy link Message link Add to gist Remove
synopsebot6	Link: rt.perl.org/rt3//Public/Bug/Displa...?id=126899		Copy link Message link Add to gist Remove
FROGGS	o/	06:59	Copy link Message link Add to gist Remove
bartolin	\o	07:19	Copy link Message link Add to gist Remove
[Tux]	This is Rakudo version 2016.09-148-g6977b87 built on MoarVM version 2016.09-34-g082c989	07:55	Copy link Message link Add to gist Remove
	csv-ip5xs 3.207		Copy link Message link Add to gist Remove
	test 16.439		Copy link Message link Add to gist Remove
	test-t 7.390		Copy link Message link Add to gist Remove
	csv-parser 19.802		Copy link Message link Add to gist Remove
dalek	p: 1a37f62 \| FROGGS++ \| tools/build/MOAR_REVISION: bump moar for num->bigint fix	11:02	Copy link Message link Add to gist Remove
	kudo/nom: 228cbc3 \| FROGGS++ \| tools/build/NQP_REVISION: bump nqp/moar for num->bigint fix	11:03	Copy link Message link Add to gist Remove
lizmat	Files=1146, Tests=53276, 220 wallclock secs (13.30 usr 3.87 sys + 1339.82 cusr 126.48 csys = 1483.47 CPU)	11:28	Copy link Message link Add to gist Remove
psch	hm, NQP_VERBOSE_EXCEPTION makes this leaking unwind another bit less comprehensible :l	11:39	Copy link Message link Add to gist Remove
yoleaux2	8 Oct 2016 23:49Z <Zoffix> psch: we do have the .tell bot. It just got stuck behind the netsplit		Copy link Message link Add to gist Remove
psch	apparently the EX_CAT_RETURN leaks from the &last call itself		Copy link Message link Add to gist Remove
	which is weird, i think? because, should we even return from there when we throw a CX::Last in the first place..?		Copy link Message link Add to gist Remove
	r: say +(gather do for ^4 { .take; last if $++; CONTROL { default { .perl.say; $_ } } }).cache;	11:49	Copy link Message link Add to gist Remove
camelia	rakudo-moar 228cbc, rakudo-jvm 2a1605: OUTPUT«CX::Take.new␤CX::Take.new␤CX::Take.new␤CX::Take.new␤0␤»		Copy link Message link Add to gist Remove
psch	everything is weird /o\		Copy link Message link Add to gist Remove
	r: say +(gather do for ^4 { .take; last if $++; CONTROL { default { .perl.say; .rethrow } } }).cache;		Copy link Message link Add to gist Remove
camelia	rakudo-jvm 2a1605: OUTPUT«CX::Take.new␤Error in socket connection:org.perl6.nqp.runtime.UnwindException␤ at org.perl6.nqp.runtime.ThreadContext.<init>(ThreadContext.java:125)␤ at org.perl6.nqp.runtime.GlobalContext.getCurrentThreadContext(GlobalContext.java:340)␤ at org.perl…»		Copy link Message link Add to gist Remove
	..rakudo-moar 228cbc: OUTPUT«CX::Take.new␤Trying to unwind over wrong handler␤»		Copy link Message link Add to gist Remove
psch	r: say +(gather do for ^4 { .take; last if $++; CONTROL { when CX::Last { .perl.say; .rethrow } } }).cache;	11:54	Copy link Message link Add to gist Remove
camelia	rakudo-moar 228cbc: OUTPUT«Trying to unwind over wrong handler␤»		Copy link Message link Add to gist Remove
	..rakudo-jvm 2a1605: OUTPUT«Error in socket connection:org.perl6.nqp.runtime.UnwindException␤ at org.perl6.nqp.runtime.ThreadContext.<init>(ThreadContext.java:125)␤ at org.perl6.nqp.runtime.GlobalContext.getCurrentThreadContext(GlobalContext.java:340)␤ at org.perl6.nqp.runtime.G…»		Copy link Message link Add to gist Remove
psch	i mean, is that such a weird thing to do..?		Copy link Message link Add to gist Remove
dogbert17	o/ anyone there who can answer a valgrind question or have everyone grown tired of them?	12:38	Copy link Message link Add to gist Remove
	there seems to be files with the extension '.S', what kind of files are those?	12:39	Copy link Message link Add to gist Remove
moritz	dogbert17: assembler code	12:42	Copy link Message link Add to gist Remove
	(before running them through the preprocessor)	12:43	Copy link Message link Add to gist Remove
dogbert17	moritz: thanks	12:45	Copy link Message link Add to gist Remove
	the thing is that sometimes when valgrinding spectests I get the error 'Syscall param write(buf) points to uninitialised byte(s)' and then it looks like valgrind restarts	12:56	Copy link Message link Add to gist Remove
	if I rerun the test where this happened the error is gone, just wondering if it's something which should be reported?	12:57	Copy link Message link Add to gist Remove
	if anyone feel like giving it a once over a gist can be found here: gist.github.com/dogbert17/3f9cfcee...9ff49134ae	13:00	Copy link Message link Add to gist Remove
Zoffix	\o :)	15:27	Copy link Message link Add to gist Remove
cowens	Okay, I don't want to be an asshole about this, so you can tell me to shut up about it and I will. But I read the discussion about my issues with the Str type being stored as NFC and I wanted a chance to clarify some points.		Copy link Message link Add to gist Remove
Zoffix	Sure, go ahead. Don't worry about being an asshole.		Copy link Message link Add to gist Remove
timotimo	yup, go ahead, it's fine	15:28	Copy link Message link Add to gist Remove
	though i'm AFK as of right now		Copy link Message link Add to gist Remove
cowens	I say unicode.org/faq/normalization.html#1 being used as justification for throwing away user's data. That FAQ covers comparison not storage.		Copy link Message link Add to gist Remove
psch	NFG is not NFC, for the record	15:29	Copy link Message link Add to gist Remove
cowens	I completely agree that "e\x[0301]" and "e\xe9" should compare as the same		Copy link Message link Add to gist Remove
nine	cowens: I don't know if you've red the channel log. So, in short the sentiment seems to be: yes, there are use cases dealing with broken systems where the normalization hurts and those cases need better support. The default an prirority however should be to support people who want to deal with texts.	15:30	Copy link Message link Add to gist Remove
	cowens: when you talk about "throwing away data", you mean the exact representation of characters at a byte level. Whether this representation actually represents data could be disputed. For many people and many cases it won't matter. The actual data is the text, whether it's represented as bytes in UTF-8 encoded Unicode on a hard drive, or spoken, or written down by hand.	15:32	Copy link Message link Add to gist Remove
FROGGS	cowens: read this and the following few lines: irclog.perlgeek.de/perl6-dev/2016-...i_13363590		Copy link Message link Add to gist Remove
	cowens: TL;DR: it might not be "coming soon", but it will come some day	15:33	Copy link Message link Add to gist Remove
nine	cowens: in other cases, it will of course matter if the exact same representation is still available. The question is, how common this really is.		Copy link Message link Add to gist Remove
psch	uhm, this feels a bit like "ganging up" to me? i was under the impression cowens was refering to exactly that linked discussion		Copy link Message link Add to gist Remove
	and, just maybe, we should wait for the clarification?	15:34	Copy link Message link Add to gist Remove
	FROGGS waits :o)		Copy link Message link Add to gist Remove
cowens	Sorry, I lost connection somehow	15:35	Copy link Message link Add to gist Remove
	Yes I read those logs		Copy link Message link Add to gist Remove
	The Unicode document being used to justify forcing NFC does not recommend storing the data as NFC	15:37	Copy link Message link Add to gist Remove
	it recommends comparing the data as the NFC		Copy link Message link Add to gist Remove
	it has a FAQ that even covers whether or not you should normalize unnormalized data automatically	15:38	Copy link Message link Add to gist Remove
nine	If I understood jnthn++ correctly, we store strings in NFG, because it makes accesses O(1) instead of O(n). Something that will matter to many users.		Copy link Message link Add to gist Remove
cowens	and it says to make it behave like it is in NFC, but it does not say to normalize it		Copy link Message link Add to gist Remove
	amortized O(1) indexing can be achieved without throwing away data	15:39	Copy link Message link Add to gist Remove
FROGGS	by duplicating data?		Copy link Message link Add to gist Remove
cygx	cowens: have you seen www.reddit.com/r/perl/comments/557...ks/d8jlxji ?	15:40	Copy link Message link Add to gist Remove
nine	cowens: did you get my remarks about "throwing away data" before you lost connection?		Copy link Message link Add to gist Remove
cygx	in a perfect Unicode-aware world, the Perl6 behaviour is ok		Copy link Message link Add to gist Remove
	the problem is that the p6 story is still lacking in the non-perfect world we live in	15:41	Copy link Message link Add to gist Remove
cowens	There will never be a perfect Unicode-aware workd		Copy link Message link Add to gist Remove
	world		Copy link Message link Add to gist Remove
	And even in such a world, the string "e\x[301]" is a valid string.	15:42	Copy link Message link Add to gist Remove
	but it isn't in Perl 6		Copy link Message link Add to gist Remove
FROGGS	m: say "e\x[301]"	15:43	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«é␤»		Copy link Message link Add to gist Remove
cygx	cowens: semantically, p6 thinks of strings not as codepoint sequences, but equivalence classes of such		Copy link Message link Add to gist Remove
	that's perfectly ok in my book as long as we still have a good story for the real-world problems	15:44	Copy link Message link Add to gist Remove
	cygx is going to gist something		Copy link Message link Add to gist Remove
FROGGS	well, usually you are not interested in codepoints either		Copy link Message link Add to gist Remove
psch	a Str consists of graphemes, an Uni consists of codepoints		Copy link Message link Add to gist Remove
FROGGS	I think the exact bytes of source and NFG strings are interesting		Copy link Message link Add to gist Remove
cowens	Let me approach this form a different angle.		Copy link Message link Add to gist Remove
psch	so, yes, Uni is what wouldn't ever throw away data, and it's underimplemented		Copy link Message link Add to gist Remove
	but a Str is about what can be read on the screen	15:45	Copy link Message link Add to gist Remove
	that's how i understand jnthn++'s explanation yesterday		Copy link Message link Add to gist Remove
cowens	Is there a reason to discard the user's data for strings? Shouldn't the only goal be treating comparisons as equal?		Copy link Message link Add to gist Remove
cygx	psch: from what I can gather, jnthn prefers generating synthetic codepoints over Uni		Copy link Message link Add to gist Remove
psch	cygx: but that's what Str does, isn't it?		Copy link Message link Add to gist Remove
nine	cowens: I already wrote that reason?		Copy link Message link Add to gist Remove
	cowens: I start to think, you're not getting my messages, as you don't react to them in any way :/	15:46	Copy link Message link Add to gist Remove
cowens	which was what? O(1) indexing? You don't need to throw away data for that		Copy link Message link Add to gist Remove
nine	Does anyone else read me?		Copy link Message link Add to gist Remove
FROGGS	nine: I do :o)		Copy link Message link Add to gist Remove
nine	cowens: and how not?		Copy link Message link Add to gist Remove
cowens	simplest implementation would be an array and a sparse array		Copy link Message link Add to gist Remove
cygx	psch: generating synthetics for non-canonical codepoint sequences which can then be round-tripped		Copy link Message link Add to gist Remove
cowens	one code point graphemes go in the array	15:47	Copy link Message link Add to gist Remove
psch	cygx: i have no idea what you are trying to tell me, sorry :)		Copy link Message link Add to gist Remove
FROGGS	cowens: problem is when you split texts or do other stuff, you'd have to operate on two "strings", rather than one		Copy link Message link Add to gist Remove
cowens	multi code point graphemes are stored in the array as a sentinel value		Copy link Message link Add to gist Remove
	then stored in the sparse array at that point.	15:48	Copy link Message link Add to gist Remove
	Split should work on the grapheme level in Str		Copy link Message link Add to gist Remove
FROGGS	the implementation of Str is meant to be fast, and you cant be fast when you have two copies of the string data to work with		Copy link Message link Add to gist Remove
cowens	You have to have a lookup table now don't you?	15:49	Copy link Message link Add to gist Remove
FROGGS	to split something? no		Copy link Message link Add to gist Remove
psch	m: say Uni.new(0x65, 0x301) [&($_)] Uni.new(0xe9) for &[==], &[eq]		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«False␤True␤»		Copy link Message link Add to gist Remove
cowens	You don't in that array + sparse array either	15:50	Copy link Message link Add to gist Remove
	I mean to print the data back out		Copy link Message link Add to gist Remove
FROGGS	to print a Str we turn it back to NFC or something, yes	15:51	Copy link Message link Add to gist Remove
	but still, all string operations deal with a single stream of one element things		Copy link Message link Add to gist Remove
nine	cowens: split right now can work on an array of 32 bit ints where each element is a grapheme. Same as all other string operations. I don't see how you could achieve the same performance when using two data structures? Wouldn't that at least give you worst cache locality, not to speak of the additional operations necessary?		Copy link Message link Add to gist Remove
cowens	Hey, I was talking about O(1) which is what I saw people talking about.	15:52	Copy link Message link Add to gist Remove
	Yeah, it won't be as performant, but it would be better than Uni and it wouldn't throw away data	15:53	Copy link Message link Add to gist Remove
	Given that the choice was to go with Rat over floating point as the default, I assumed Perl 6 was supposed to favor integrity over speed.	15:54	Copy link Message link Add to gist Remove
nine	cowens: but now we're talking about trade offs. You want to make all string operations in Perl 6 slower for covering some use cases. That's totally ok, but you got to admit, that as with all judgement calls, opinions may differ :)		Copy link Message link Add to gist Remove
psch	i don't think Rat is about integrity over speed	15:55	Copy link Message link Add to gist Remove
	Rat is about doing the obviously right thing		Copy link Message link Add to gist Remove
	because, well, 0.1 + 0.2 is equal to 0.3		Copy link Message link Add to gist Remove
	and in the same vein, é is equal to, well, the other one that also looks like é		Copy link Message link Add to gist Remove
nine	Which for texts means keeping the integrity of texts, not necessarily the integrity of byte code level representations of texts.		Copy link Message link Add to gist Remove
psch	(i'm not sure which of those two i'm typing with my locale, hence my phrasing :) )	15:56	Copy link Message link Add to gist Remove
cowens	Well, my file contains "re\x[301]sum\x[301]" the obvious thing to happen when I append "Chas's" with Perl is that it should be "Chas's re\x301]sume\x[301]"		Copy link Message link Add to gist Remove
FROGGS	cowens: why do you need the original bytes ooc?	15:57	Copy link Message link Add to gist Remove
cowens	Many reasons: legacy systems, audit controls, etc.		Copy link Message link Add to gist Remove
cygx	rfc: gist.github.com/cygx/b545c206a0f7c...4b26afccf6		Copy link Message link Add to gist Remove
cowens	If I round trip a file in Perl 6 strings, it might not get the same md5sum even if nothing changed	15:58	Copy link Message link Add to gist Remove
FROGGS	cowens: well, if you just want to write it from a to b without modification and you know that checksums are involved than perhaps use Buf	15:59	Copy link Message link Add to gist Remove
	I mean, for now at least		Copy link Message link Add to gist Remove
cowens	It might not have no modification		Copy link Message link Add to gist Remove
FROGGS	what I usually do with text is interaction with humans		Copy link Message link Add to gist Remove
psch	Buf will not normalize		Copy link Message link Add to gist Remove
FROGGS	and they dont care about byte orderings	16:00	Copy link Message link Add to gist Remove
cowens	say you have a process that needs to change names in a file		Copy link Message link Add to gist Remove
	but if the names aren't present, no changes should be made		Copy link Message link Add to gist Remove
FROGGS	cowens: Buf is meant to be string like		Copy link Message link Add to gist Remove
	having subbuf instead of substr etc		Copy link Message link Add to gist Remove
cowens	the typical way of doing that is a filter		Copy link Message link Add to gist Remove
FROGGS	but I guess not being able to regex match can be a showstopper	16:01	Copy link Message link Add to gist Remove
cowens	Perl 6 changes things even when the user didn't ask for them to be changed if he or she uses a string		Copy link Message link Add to gist Remove
psch	except Perl 6 doesn't write that if you don't tell it to?		Copy link Message link Add to gist Remove
FROGGS	yes, and in >95% of the cases the user wont care		Copy link Message link Add to gist Remove
cowens	Also, Buf is bytes. I want to work at the grapheme level		Copy link Message link Add to gist Remove
psch	i mean, are you memmapping the file..?		Copy link Message link Add to gist Remove
FROGGS	but (s)he will be more happy with faster string ops		Copy link Message link Add to gist Remove
cowens	But I don't want working at the grapheme level to force me into normalized data	16:02	Copy link Message link Add to gist Remove
	that goes back to the Rat vs floating point argument		Copy link Message link Add to gist Remove
	I don't mind there being an optional lossy string type		Copy link Message link Add to gist Remove
	that is faster		Copy link Message link Add to gist Remove
psch	that's not the Rat vs floating point argument	16:03	Copy link Message link Add to gist Remove
cowens	But the default being lossy is very surprising		Copy link Message link Add to gist Remove
psch	that's the reason why the Rat vs floating point argument was made		Copy link Message link Add to gist Remove
	we pick Rat because it does the right thing with the representation		Copy link Message link Add to gist Remove
cowens	NFC isn't the right thing for strings		Copy link Message link Add to gist Remove
	It is an optional thing for strings	16:04	Copy link Message link Add to gist Remove
psch	if you cannot visually distinguish them, they are identical		Copy link Message link Add to gist Remove
	that is what text means		Copy link Message link Add to gist Remove
	i understand that you disagree with assertion		Copy link Message link Add to gist Remove
	but it's the one the core devs arrived at, and why Str defaults to NFG		Copy link Message link Add to gist Remove
	*with this assertion		Copy link Message link Add to gist Remove
cygx	what psch said: any system that does treat canonically equivalent text as different is arguably broken		Copy link Message link Add to gist Remove
timotimo	psch: so I and l are the same thing? :P		Copy link Message link Add to gist Remove
cygx	as there are a lot of broken system, you need a good fallback story	16:05	Copy link Message link Add to gist Remove
psch	timotimo: they look different here :)		Copy link Message link Add to gist Remove
cygx	Perl6 curretnly lacks such a story		Copy link Message link Add to gist Remove
psch	cowens: now, the concern that absolute byte level integrity is hard currently is absolutely valid		Copy link Message link Add to gist Remove
timotimo	i can visually distinguish the one ê from the other ê by looking at the bytes in the file with a hex editor :P :P		Copy link Message link Add to gist Remove
FROGGS	cowens: there will be a solution... but it most likely wont be the default		Copy link Message link Add to gist Remove
cowens	Yeah, I can tell that there is some cool aid I haven't drunk	16:06	Copy link Message link Add to gist Remove
cygx	FROGGS: it's entirely possible to echive something like open(:compat) and		Copy link Message link Add to gist Remove
psch	oh come on		Copy link Message link Add to gist Remove
cygx	*achieve		Copy link Message link Add to gist Remove
FROGGS	yes, or have a pragma or something else		Copy link Message link Add to gist Remove
cygx	...and have it do the mostlyright thing		Copy link Message link Add to gist Remove
psch	"your arguments are bad because i don't like the tradeoff" isn't reasonable		Copy link Message link Add to gist Remove
FROGGS	or a mixed in role that keeps the original bytes or a string		Copy link Message link Add to gist Remove
cowens	psch that cuts both ways	16:07	Copy link Message link Add to gist Remove
FROGGS	there are several ways of doing it, and I'm not the one proposing a good solution here :o)		Copy link Message link Add to gist Remove
cowens	Performance is being given more importance than correctness for strings, but not for numbers		Copy link Message link Add to gist Remove
	That is the trade off. And it is very odd to outsiders	16:08	Copy link Message link Add to gist Remove
Zoffix	But the string are "correct". The trade off is in their representation.		Copy link Message link Add to gist Remove
cygx	cowens: but normalization is the correct approach in a perfect Unicode-aware world		Copy link Message link Add to gist Remove
FROGGS	cowens: do operations on stuff that cannot be normalized to a single codepoint in all the languages you know		Copy link Message link Add to gist Remove
cygx	(or rather treating equivalent things as equivalent)		Copy link Message link Add to gist Remove
cowens	No, the graphemes are correct, but the underlying data is lost		Copy link Message link Add to gist Remove
FROGGS	cowens: and than count these that dont split in the middle of a grapheme		Copy link Message link Add to gist Remove
Zoffix	The underlying data is bytes, so the string operations are correct :)	16:09	Copy link Message link Add to gist Remove
psch	but the graphemes are the relevant data in a Str		Copy link Message link Add to gist Remove
	if you care about bytes, use Buf		Copy link Message link Add to gist Remove
	or maybe Uni, eventually		Copy link Message link Add to gist Remove
Zoffix	And once we have user-land encoders, I imagine this type of stuff would be even more convenient, no?		Copy link Message link Add to gist Remove
psch	or maybe some up-and-coming prgama		Copy link Message link Add to gist Remove
	*pragma		Copy link Message link Add to gist Remove
FROGGS	strings in general are not about bytes... they are about visible characters, whatever that means		Copy link Message link Add to gist Remove
	psch is getting too heated	16:10	Copy link Message link Add to gist Remove
	sorry, clinking myself out here :S		Copy link Message link Add to gist Remove
FROGGS	yeah, a break sounds sane :o)		Copy link Message link Add to gist Remove
	dinner &		Copy link Message link Add to gist Remove
cygx	FROGGS: user-perceived characters as approximated by equivalent grapheme clusters, if you want to get technical		Copy link Message link Add to gist Remove
	;)	16:11	Copy link Message link Add to gist Remove
cowens	On a more positive note: should all of the string functions be in stringy?		Copy link Message link Add to gist Remove
Zoffix	Don't think so, considering:		Copy link Message link Add to gist Remove
	m: say Buf ~~ Stringy		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«True␤»		Copy link Message link Add to gist Remove
cowens	Should string functions for Uni be on the grapheme level, code point level, or take an adverb with a default		Copy link Message link Add to gist Remove
Zoffix	And, I mean, .match is a string function, etc	16:12	Copy link Message link Add to gist Remove
	I'd imagine on grapheme level, though I'm not familiar with Uni.		Copy link Message link Add to gist Remove
timotimo	there's still the possibility of giving every grapheme that would be changed by normalization a synthetic that writes back what it once was. problem solved forever.		Copy link Message link Add to gist Remove
cygx	generic functions should operate on codepoints, string-specific ones might want to aoerce		Copy link Message link Add to gist Remove
timotimo	that will, however, make things like string comparison a bit more complicated	16:13	Copy link Message link Add to gist Remove
cowens	Yeah, that was one of my O(1) solutions		Copy link Message link Add to gist Remove
cygx	*coerce		Copy link Message link Add to gist Remove
cowens	But it has complexity attack issues and isn't as performant		Copy link Message link Add to gist Remove
timotimo	i don't really remember what the particular arguments against that were that were brought forth		Copy link Message link Add to gist Remove
cowens	What does NFG do when it runs out of synthetics?	16:14	Copy link Message link Add to gist Remove
cygx	die because you've exhausted your RAM ;)		Copy link Message link Add to gist Remove
timotimo	at the moment we don't have a mechanism for that, but we can GC through existing strings and throw out unused synthetics		Copy link Message link Add to gist Remove
	but yeah, you'll need a lot of input data to get to that point, and later you'll need all of that data to stick around in memory so GC won't free it up	16:15	Copy link Message link Add to gist Remove
cowens	So, the table is interpreter level, not string level?		Copy link Message link Add to gist Remove
timotimo	that's right		Copy link Message link Add to gist Remove
	we have a lock-free datastructure that does that stuff		Copy link Message link Add to gist Remove
	i haven't looked closely at that yet		Copy link Message link Add to gist Remove
cowens	I had assumed string level because of a comment in the docs, but I guess you just reserve the right to have it at string level		Copy link Message link Add to gist Remove
timotimo	btw, we currently have an entirely real attack based on NFG, compared to the pretty difficult-to-pull-off attack of exhausting the table space	16:18	Copy link Message link Add to gist Remove
	m: say "a" ~ "\c[COMBINING ACUTE]" xx 100000000		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«===SORRY!=== Error while compiling <tmp>␤Unrecognized character name COMBINING ACUTE␤at <tmp>:1␤------> say "a" ~ "\c[COMBINING ACUTE⏏]" xx 100000000␤»		Copy link Message link Add to gist Remove
timotimo	.u combining		Copy link Message link Add to gist Remove
yoleaux2	U+0300 COMBINING GRAVE ACCENT [Mn] (◌̀)		Copy link Message link Add to gist Remove
	U+0301 COMBINING ACUTE ACCENT [Mn] (◌́)		Copy link Message link Add to gist Remove
	U+0302 COMBINING CIRCUMFLEX ACCENT [Mn] (◌̂)		Copy link Message link Add to gist Remove
timotimo	m: say "a" ~ "\c[COMBINING GRAVE ACCENT]" xx 100000000		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«Memory allocation failed; could not allocate 800000000 bytes␤»		Copy link Message link Add to gist Remove
timotimo	oh, hehe		Copy link Message link Add to gist Remove
	m: say "a" ~ "\c[COMBINING GRAVE ACCENT]" xx 10000		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«à ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀…»		Copy link Message link Add to gist Remove
timotimo	no, actually, that won't do it		Copy link Message link Add to gist Remove
	m: say "a\c[COMBINING GRAVE ACCENT]" ~ "\c[COMBINING GRAVE ACCENT]" xx 10000	16:19	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«à̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀ ̀…»		Copy link Message link Add to gist Remove
timotimo	how do i ...		Copy link Message link Add to gist Remove
	probably have to go via Uni?		Copy link Message link Add to gist Remove
	anyway. trying to add that kind of thing to our trie datastructure exhausts the C stack		Copy link Message link Add to gist Remove
	FROGGS thinks that synthetics have to be kept at interpreter level because otherwise you could not compare strings	16:22	Copy link Message link Add to gist Remove
timotimo	you can, you just have to compare synthetics that you come across		Copy link Message link Add to gist Remove
	if we have different synthetics for the same character but different normalizations, we'd have to do some extra work anyway	16:23	Copy link Message link Add to gist Remove
cowens_	So, if I were to start implementing string functions for Uni, where should I put them? Should we have parallel functions in the different classes (Buf/Uni/Str)?		Copy link Message link Add to gist Remove
FROGGS	how so? their made up negative number certainly wont be the same		Copy link Message link Add to gist Remove
timotimo	i meant compare the underlying data for the synthetics when their indices don't match		Copy link Message link Add to gist Remove
	and even if the indices match, if the table is per-string, same index doesn't mean same character		Copy link Message link Add to gist Remove
FROGGS	well, that'd be slower again :o)		Copy link Message link Add to gist Remove
	aye	16:24	Copy link Message link Add to gist Remove
timotimo	honestly, i'd be okay with a flag on strings that says "uses normalization-conserving synthetics"		Copy link Message link Add to gist Remove
	and only compare underlying synthetic data if that flag is set on one or both of the strings	16:25	Copy link Message link Add to gist Remove
	and we'd probably also want some kind of support for strings with different normalizations in code as literals		Copy link Message link Add to gist Remove
cowens_	I think S15 covers that		Copy link Message link Add to gist Remove
	qq:nfd"e\x[301]"	16:26	Copy link Message link Add to gist Remove
timotimo	ah		Copy link Message link Add to gist Remove
	i was thinking of a way to write "this string has mixed normalizations in it"		Copy link Message link Add to gist Remove
cowens_	That is called Uni	16:27	Copy link Message link Add to gist Remove
	Based on what people have been saying		Copy link Message link Add to gist Remove
timotimo	not easy to type, though :)		Copy link Message link Add to gist Remove
cowens_	Oh, yeah, I hate that syntax		Copy link Message link Add to gist Remove
timotimo	well, you can type it Uni.new(<1 2 3 4>), can't you?		Copy link Message link Add to gist Remove
cygx	"foo{ Uni.new(...).non-normalized-synthetic-Str }bar"		Copy link Message link Add to gist Remove
cowens_	Yeah, that is even worse	16:28	Copy link Message link Add to gist Remove
timotimo	if you just call it .nnsS ... :P		Copy link Message link Add to gist Remove
cowens_	Or .SaneStr (half kidding)	16:29	Copy link Message link Add to gist Remove
timotimo	should we consider people would expect code like 'say "asdf"' piped through xxd to have the same bytes as the part between the " in the code?		Copy link Message link Add to gist Remove
cowens_	Seriously, If I start implementing string functions for Uni, where should I put them: in Uni or stringy?		Copy link Message link Add to gist Remove
p3rln00b	m: 7 ~ "\x[308]" x 150_000		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«(signal SEGV)»		Copy link Message link Add to gist Remove
timotimo	for now, i'd just put them into Uni	16:30	Copy link Message link Add to gist Remove
	they can probably be moved up later, right?		Copy link Message link Add to gist Remove
	p3rln00b: thank you		Copy link Message link Add to gist Remove
cowens_	Should they share names with Str or have some sort of prefix?	16:31	Copy link Message link Add to gist Remove
	there was talk of subbuf instead of substr	16:32	Copy link Message link Add to gist Remove
timotimo	subuni? :P		Copy link Message link Add to gist Remove
cowens_	Yeah, that one is easy		Copy link Message link Add to gist Remove
	timotimo does the AFK-dance again		Copy link Message link Add to gist Remove
cygx	o/	16:35	Copy link Message link Add to gist Remove
cowens_	A parting thought: It feels to me like Str is changing "consumed" to "ate" and people are saying I shouldn't be upset because they mean the same thing.	16:42	Copy link Message link Add to gist Remove
p3rln00b	hah. Debuggin something and dumping things and apparently we have something with 73 multi candidates for it :}	17:28	Copy link Message link Add to gist Remove
	(unless I'm misinterpreting my data)	17:29	Copy link Message link Add to gist Remove
	m: &postcircumfix:<[ ]>.candidates.elems.say	17:32	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«65␤»		Copy link Message link Add to gist Remove
	geekosaur is noodling something re strings...	17:44	Copy link Message link Add to gist Remove
	I kinda want to reflavor things. Str -> UI string, Uni -> unified string (may not be unicode any more), Buf8 -> octet string. files and streams are Buf8 when raw or Uni when not. Uni lets you define ranges (including "from X until further notice") with encodings (and, in a more general implementation, encoding can include compression and/or encryption)	17:46	Copy link Message link Add to gist Remove
	also reinterpretations, as needed for IRC where the command framing is an ISO8859 but the payload may be a different ISO8859 or UTF8, or when an HTTP stream sends a Content-Type and/or Content-Encoding	17:47	Copy link Message link Add to gist Remove
	people kinda want to believe "string is string" but that has not been true since people finally realized the world is not US ASCII carrying US English	17:48	Copy link Message link Add to gist Remove
nine	English has never been ASCII only		Copy link Message link Add to gist Remove
geekosaur	...but the computer representation was	17:49	Copy link Message link Add to gist Remove
	basically the thing I'm thinking about is that perl 5 had IO layers on filehandles; that's the wrong place since stream interpretations need to change as they go. I want layers on streams, where a stream is a generalized notion of "string"	17:50	Copy link Message link Add to gist Remove
Zoffix	m: sub foo (int $x) { $x.say }; my Int $b = 2; foo $b;	17:54	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«2␤»		Copy link Message link Add to gist Remove
Zoffix	Would you say that's a bug?		Copy link Message link Add to gist Remove
lucasb	that's expected unboxing, no?	17:56	Copy link Message link Add to gist Remove
	otherwise 'my int $x = $an-Int' would also have to be a bug	17:57	Copy link Message link Add to gist Remove
Zoffix	Fair enough. What about this one:		Copy link Message link Add to gist Remove
	m: sub foo (Int $x) { $x.say }; my int $b = 2; foo $b;		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«2␤»		Copy link Message link Add to gist Remove
Zoffix	Would you say that is a bug?		Copy link Message link Add to gist Remove
lucasb	and that is expected boxing	17:58	Copy link Message link Add to gist Remove
geekosaur	auto(un)boxing should probably be a thing unless you want to go the strict typing route a la Haskell		Copy link Message link Add to gist Remove
Zoffix	Fair enough. Now, with the above in mind, which multi should this call; the native or or the Int?: multi foo (Int $x) { $x.say }; multi foo (int $x) { $x.say }; foo 2;	17:59	Copy link Message link Add to gist Remove
	Or to mix it up: multi foo (Int $x, Int $y) {}; multi foo (int $x, int $y) {}; my int $b = 2; foo 2, $b;	18:01	Copy link Message link Add to gist Remove
geekosaur	if a multi matches the boxing state, use it, otherwise if a multi matches the type but different box/unbox, use it, otherwise if you have a coercion you can use, box and coerce.	18:02	Copy link Message link Add to gist Remove
	there might be a pragma for strict boxing if someone wants to write haskell in perl6		Copy link Message link Add to gist Remove
	java's had autoboxing for some time now, it's not an unexplored concept	18:03	Copy link Message link Add to gist Remove
Zoffix	Is literal 2 a "boxed state"?	18:05	Copy link Message link Add to gist Remove
geekosaur	maybe a restatement of that which-to-call: if you have an unboxed value, first try to match the unboxed type exactly. if no multi matches, box it and do normal multi resolution		Copy link Message link Add to gist Remove
	the one hard rule is that the only "coercion" available for an unboxed value is to box it		Copy link Message link Add to gist Remove
	that question makes me think I need to restate that again	18:06	Copy link Message link Add to gist Remove
	because that question does not make sense...		Copy link Message link Add to gist Remove
Zoffix	Well, is literal 2 "boxed"?	18:07	Copy link Message link Add to gist Remove
geekosaur	I am inclined to consider it boxed, and unbox it if needed		Copy link Message link Add to gist Remove
	mostly because in perl6 literal 2 can coerce to various types, and coercion is only possible for boxed values		Copy link Message link Add to gist Remove
Zoffix	OK	18:08	Copy link Message link Add to gist Remove
geekosaur	(yes, technically you can pretend otherwise but you're really just manipulating a separate box to do so)		Copy link Message link Add to gist Remove
lucasb	I think this question, wether a literal integer is native or not is a very relevant question that should be answered in the FAQ	18:09	Copy link Message link Add to gist Remove
	I think a reasonable approach would be for it to be a native, if it fits the native range of values, otherwise it would be Int	18:10	Copy link Message link Add to gist Remove
Zoffix	I think that's already how it is in the optimizer.		Copy link Message link Add to gist Remove
lucasb	but this thinking doesn't work for strings, right? is a literal string a 'str' or Str?		Copy link Message link Add to gist Remove
	geekosaur: haskell has something like polymorphic constant, right? do you think this concept describes what we are talking about?	18:12	Copy link Message link Add to gist Remove
	I read about it in that book learn haskell for great good		Copy link Message link Add to gist Remove
geekosaur	yes and no. haskell also has strict typing and full type inference, so you can give a meaning to a polymorphic constant more easily	18:13	Copy link Message link Add to gist Remove
	"polymorphic constant" has to do something else in a perl6-like type world		Copy link Message link Add to gist Remove
lucasb	5 :: Int		Copy link Message link Add to gist Remove
	5 :: Integer		Copy link Message link Add to gist Remove
	5 :: Float		Copy link Message link Add to gist Remove
	this enforces the type in the constant, no?	18:14	Copy link Message link Add to gist Remove
	(in Haskell)		Copy link Message link Add to gist Remove
geekosaur	when you write "5" in Haskell, the compiler sees (as specified by the language definition): fromInteger (5 :: Integer)	18:15	Copy link Message link Add to gist Remove
	if you write "5 :: Int" this becomes "fromInteger (5 :: Integer) :: Int" so the instance (fromInteger :: Integer -> Int) is selected	18:16	Copy link Message link Add to gist Remove
lucasb	hmm, interesting. thanks for clarifying		Copy link Message link Add to gist Remove
geekosaur	one complication in the boxing/unboxing discussion is that in haskell an unboxed value is never polymorphic, because if you do not know the type statically you can't know what the value actually is --- or even how large it is.	18:19	Copy link Message link Add to gist Remove
	perl 6 "unboxed" values are actually boxed, in that sense	18:20	Copy link Message link Add to gist Remove
	but in an OO language "unboxed" means "not an object", not "does not have a type witness"	18:22	Copy link Message link Add to gist Remove
	so perl6 can actually do some polymorphism there... the problem being that coercions are object methods, so you still really want an object to do a coercion	18:23	Copy link Message link Add to gist Remove
nine	[</win 13	18:40	Copy link Message link Add to gist Remove
	2016.10 release is next weekend, isn't it?		Copy link Message link Add to gist Remove
Zoffix	Right	18:41	Copy link Message link Add to gist Remove
	NeuralAnomaly, status		Copy link Message link Add to gist Remove
NeuralAnomaly	Zoffix, [✘] Next release will be in 5 days and 9 hours. Since last release, there are 41 new still-open tickets (38 unreviewed and 0 blockers) and 149 unreviewed commits. See perl6.fail/release/stats for details		Copy link Message link Add to gist Remove
nine	Ok, then I won't bother sumbmitting 2016.09 for openSUSE. Rakudo fails the tests on i586 and this issue should be fixed in 2016.10 anyway	18:42	Copy link Message link Add to gist Remove
dalek	p: af751d3 \| FROGGS++ \| tools/build/MOAR_REVISION: bump moar for better error messages	19:18	Copy link Message link Add to gist Remove
lizmat	m: sub a(&b) { b(); b() }; a sub{ say "foo" } # I'm surprised this works		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«foo␤foo␤»		Copy link Message link Add to gist Remove
lizmat	I'd expect an error because of the lack of whitespace between "sub" and "{"		Copy link Message link Add to gist Remove
dalek	kudo/nom: a92f092 \| FROGGS++ \| / (2 files): improve nativecall error messages, fixes RT #129353	19:19	Copy link Message link Add to gist Remove
synopsebot6	Link: rt.perl.org/rt3//Public/Bug/Displa...?id=129353		Copy link Message link Add to gist Remove
lizmat	m: sub c(\sub) { sub{"foo"} }; say c(my %h) # expected Any here	19:23	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«sub () { #`(Sub\|61604456) ... }␤»		Copy link Message link Add to gist Remove
lizmat	m: sub c(\sup) { sup{"foo"} }; say c(my %h) # any other name than "sub" does	19:24	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 228cbc: OUTPUT«(Any)␤»		Copy link Message link Add to gist Remove
lizmat	is this a bug or a DIHWIDT ?		Copy link Message link Add to gist Remove
geekosaur	I think unless a prototype is mandatory, it's not really possible to distinguish whether it's an anon sub or a hash?	19:26	Copy link Message link Add to gist Remove
	or you want to require a brace between sub and { to indicate an anon sub		Copy link Message link Add to gist Remove
	er, a sspace		Copy link Message link Add to gist Remove
lizmat	yeah, I expected the space to be mandatory, like lack of it is mandatory with calling foo()	19:27	Copy link Message link Add to gist Remove
	to distinguish itself from a hash lookup		Copy link Message link Add to gist Remove
Zoffix	ISAGN for nqp version of dd :/	20:51	Copy link Message link Add to gist Remove
	Spent about 30 compilations already trying to dump a simple array, just to be greeted with one error or another after parse stage heh	20:54	Copy link Message link Add to gist Remove
dalek	kudo/nom: f4a8a69 \| lizmat++ \| src/core/Str.pm: Looks like we don't need 'try' here any more First stab at making Str.match faster	21:19	Copy link Message link Add to gist Remove
	kudo/nom: 13f4798 \| lizmat++ \| src/core/CompUnit/RepositoryRegistry.pm: Don't use regexp based split for pathspec Use fast str based split, and just trim the whitespace fastly later, without needing to start up the whole regex engine. Found this when I broke Str.split with my work on Str.match	21:27	Copy link Message link Add to gist Remove
lizmat	this should have a visible effect on "make spectest" I hope		Copy link Message link Add to gist Remove
	good night, #perl6-dev	21:39	Copy link Message link Add to gist Remove
Zoffix__	night	21:41	Copy link Message link Add to gist Remove
Zoffix	m: sub foo (int $x) { $x.say }; foo 2	21:48	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 13f479: OUTPUT«2␤»		Copy link Message link Add to gist Remove
Zoffix	m: multi sub foo (int $x) { $x.say }; foo 2		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 13f479: OUTPUT«Cannot resolve caller foo(Int); none of these signatures match:␤ (int $x)␤ in block <unit> at <tmp> line 1␤␤»		Copy link Message link Add to gist Remove
Zoffix	^ adding to previous discussion. The latter is a bug then?		Copy link Message link Add to gist Remove
	m: multi sub foo (int $x) { $x.say }; foo my int $ = 2	22:04	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 13f479: OUTPUT«2␤»		Copy link Message link Add to gist Remove
Zoffix	m: multi sub foo (Int $x) { $x.say }; foo my int $ = 2		Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 13f479: OUTPUT«2␤»		Copy link Message link Add to gist Remove
Zoffix	m: multi foo (int $x) { $x.say }; my Int $x = 2; foo $x	22:05	Copy link Message link Add to gist Remove Run code
camelia	rakudo-moar 13f479: OUTPUT«Cannot resolve caller foo(Int); none of these signatures match:␤ (int $x)␤ in block <unit> at <tmp> line 1␤␤»		Copy link Message link Add to gist Remove
Zoffix	Stresstest is floppy as hell... 5 runs so far, only the first one succeeded. Different failures on each of the following runs :/	22:11	Copy link Message link Add to gist Remove
	t/spec/S05-substitution/subst.rakudo.moar (Wstat: 256 Tests: 183 Failed: 1)	22:16	Copy link Message link Add to gist Remove
	ZOFVM: Files=1194, Tests=129672, 137 wallclock secs (21.65 usr 3.14 sys + 2425.60 cusr 195.67 csys = 2646.06 CPU)	22:26	Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!