|
00:17
kjp left
|
|||
| ShimmerFairy | timo: The "A\c[ZWJ]🧀" string is in fact incorrect. ZWJ only joins an existing grapheme cluster unconditionally, it doesn't necessarily extend further. Only rules GB9c (thanks to InCB=Extend) and GB11 allow a ZWJ to be followed by some otherwise non-combining character. | 00:52 | |
| The issue is that current MoarVM handles rule GB11 by only checking for a preceding ZWJ and following Extended_Pictographic, which isn't enough. Rule GB11 requires you to look back further to find a string of Extend-type codes following another Extended_Pictographic. | 00:53 | ||
| As for the breaking function, what I wrote (most of) yesterday is essentially just the old function, but with 'MVMCodepoint a' replaced with state kept in the normalizer. However, by the end of yesterday I couldn't escape the feeling that this was fundamentally the wrong approach now, so I'm going to try something else first. | 01:29 | ||
| Regardless of the solution, the concat check task you mentioned definitely can't use the same function as the normalization task anymore, they just require different behaviors with the current grapheme rules. | 01:30 | ||
|
03:36
Voldenet_ joined,
vrurg joined,
nativecallable6 left
03:37
vrurg_ left,
Voldenet left,
JRaspass left,
Voldenet_ is now known as Voldenet
03:39
nativecallable6 joined
03:41
Voldenet_ joined
03:42
Voldenet left,
Voldenet_ is now known as Voldenet
03:43
nativecallable6 left
03:44
nativecallable6 joined
03:49
JRaspass joined,
nativecallable6 left,
nativecallable6 joined
10:14
committable6 left
10:15
committable6 joined
|
|||
| timo | thanks for the explanation, that's something i had missed before | 10:42 | |
|
11:14
woodi joined
11:15
woodi left
|
|||
| ShimmerFairy | Good news: I got grapheme parsing working, at least for the test strings "A\c[ZWJ]🧀" (2 graphemes) and "𑩐𝪄ꠂ्ႎ" (1 grapheme, InCB test)! Bad news: rakudo doesn't build. I managed to narrow the issue down to seemingly concating strings with certain characters in front. (Which shows up as hanging on parsing the > in "prefix:sym<++⚛>" in the Perl6 grammar.) | 16:36 | |
| I'll have to try fixing it tomorrow, but it's a good sign that it does function properly on easy test evals, and nqp's "make test" runs for the same roughly 19 seconds as before, so at first glance things haven't slowed horrifically. | 16:39 | ||
| japhb | ShimmerFairy++ | 16:52 | |
| librasteve_ | rakudoweekly.blog/2025/12/01/2025-...t-is-here/ | 19:42 | |
|
21:44
librasteve_ left
|
|||