#moarvm on 3 December 2024 - Raku Programming Language Log

Welcome to the main channel on the development of MoarVM, a virtual machine for NQP and Rakudo (moarvm.org). This channel is being logged for historical purposes. Set by lizmat on 24 May 2021.
Geth	MoarVM: MasterDuke17++ created pull request #1871: Add fast path when difference for 8-bit strings	03:13	Copy link Message link Add to gist Remove
09:40 sena_kun joined
Geth	MoarVM/main: 492e511f0d \| MasterDuke17++ (committed using GitHub Web editor) \| src/strings/ops.c Add fast path when difference for 8-bit strings If we're comparing 8-bit strings and there's a difference, we don't need to go through the generic grapheme-iterator path, since we know there won't be combining synthetics.	10:57	Copy link Message link Add to gist Remove
timo	hold up		Copy link Message link Add to gist Remove
lizmat	hold up?	10:58	Copy link Message link Add to gist Remove
timo	i was about to comment on this		Copy link Message link Add to gist Remove
lizmat	ah, ok ;-(		Copy link Message link Add to gist Remove
	I took nine's approval, and the code appeared simple enough		Copy link Message link Add to gist Remove
timo	8 bit grapheme storage, i.e. not "only in ascii range", includes synthetics		Copy link Message link Add to gist Remove
	we can't just say the lower of the synthetic codepoint is the lower one for real	10:59	Copy link Message link Add to gist Remove
	because those are allocated first-come-first-served		Copy link Message link Add to gist Remove
Geth	MoarVM/main: 639e401db3 \| (Elizabeth Mattijsen)++ \| src/strings/ops.c Revert "Add fast path when difference for 8-bit strings" This reverts commit 492e511f0df59fadc44c2fb690b3e877a5834f40.		Copy link Message link Add to gist Remove
timo	well, a full revert is maybe a bit much since we didn't bump yet		Copy link Message link Add to gist Remove
lizmat	better be safe than sorry, I'd say	11:00	Copy link Message link Add to gist Remove
timo	we will have to check if either of the two graphemes is a synthetic, in which case we can't do the fast path. we have to see if it's still faster to do it this way when the additional check goes in		Copy link Message link Add to gist Remove
lizmat	I was just about to say :-)		Copy link Message link Add to gist Remove
timo	i have to go AFK for a bit so i can't properly create a test case that shows this		Copy link Message link Add to gist Remove
lizmat	I'll keep my handz in daz pokkets	11:01	Copy link Message link Add to gist Remove
timo	but it'd probably look something like "create two buffers of utf8 bytes that are decoded in two different orders after program start which result in a character with lots of combiners on it so it's a synthetic, guaranteed. then compare strings that are less than 8 graphemes long, the same length, and end in one and the other synthetic, respectively"		Copy link Message link Add to gist Remove
	if my worry is correct, those would give different results based on which synthetic was registered first by decoding the buf	11:02	Copy link Message link Add to gist Remove
	we can't just create the buf from a string in the same program run because then the synthetic grapheme would be registered already at compile time and then depend on where it's seen when reading in the source code		Copy link Message link Add to gist Remove
lizmat	.oO( oh what a tangled web we weave :-)	11:03	Copy link Message link Add to gist Remove
timo	where does that come from btw?		Copy link Message link Add to gist Remove
lizmat	nosweatshakespeare.com/quotes/famo...-we-weave/	11:04	Copy link Message link Add to gist Remove
timo	m: my $with_a = Buf8.new(0x41, 0xCD, 0x99, 0xE2, 0x83, 0xB0); my $with_b = Buf8.new(0x42, 0xCD, 0x99, 0xE2, 0x83, 0xB0); say $with_a cmp $with_b	11:09	Copy link Message link Add to gist Remove Run code
camelia	===SORRY!=== Error while compiling <tmp> Undeclared name: Buf8 used at lines 1, 1. Did you mean 'buf8', 'Buf'?		Copy link Message link Add to gist Remove
timo	m: my $with_a = buf8.new(0x41, 0xCD, 0x99, 0xE2, 0x83, 0xB0); my $with_b = buf8.new(0x42, 0xCD, 0x99, 0xE2, 0x83, 0xB0); say $with_a cmp $with_b		Copy link Message link Add to gist Remove Run code
camelia	Less		Copy link Message link Add to gist Remove
timo	forgot to decode		Copy link Message link Add to gist Remove
	m: my $with_a = buf8.new(0x41, 0xCD, 0x99, 0xE2, 0x83, 0xB0); my $with_b = buf8.new(0x42, 0xCD, 0x99, 0xE2, 0x83, 0xB0); say $with_a.decode cmp $with_b.decode		Copy link Message link Add to gist Remove Run code
camelia	Less		Copy link Message link Add to gist Remove
timo	m: my $with_a = buf8.new(0x41, 0xCD, 0x99, 0xE2, 0x83, 0xB0); my $with_b = buf8.new(0x42, 0xCD, 0x99, 0xE2, 0x83, 0xB0); $with_b.decode; $with_a.decode; say $with_a.decode cmp $with_b.decode		Copy link Message link Add to gist Remove Run code
camelia	Less		Copy link Message link Add to gist Remove
timo	m: my $with_a = buf8.new(0x41, 0xCD, 0x99, 0xE2, 0x83, 0xB0); my $with_b = buf8.new(0x42, 0xCD, 0x99, 0xE2, 0x83, 0xB0); $with_a.decode; $with_b.decode; say $with_a.decode cmp $with_b.decode	11:10	Copy link Message link Add to gist Remove Run code
camelia	Less		Copy link Message link Add to gist Remove
timo	ok, with the changes from the PR these all still have to give "Less"		Copy link Message link Add to gist Remove
22:06 kjp left, kjp_ joined 22:43 kjp_ left, kjp joined 22:58 sena_kun left

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!