🦋 Welcome to the IRC channel of the core developers of the Raku Programming Language (raku.org #rakulang). This channel is logged for the purpose of history keeping about its development | evalbot usage: 'm: say 3;' or /msg camelia m: ... | Logs available at irclogs.raku.org/raku-dev/live.html | For MoarVM see #moarvm
Set by lizmat on 8 June 2022.
00:12 lizmat joined, Geth joined 01:03 vrurg_ left
Geth rakudo: usev6++ created pull request #5708:
[JVM] Decont Routine object when looking for attribute
08:32
09:04 sena_kun joined
bartolin bisectable6: multi sub foo(Mu $a) is default { say "no named" }; multi sub foo(Mu $a, *%_) { say "with named" }; foo(42); foo(42, :foo) 09:45
bisectable6 bartolin, Will bisect the whole range automagically because no endpoints were provided, hang tight
bartolin, Output on all releases: gist.github.com/e3da7bf0cd63bc48ef...d7777ee2fc 09:46
bartolin, Bisecting by exit code (old=2021.09 new=2021.10). Old exit code: 1
09:48 sena_kun left
bisectable6 bartolin, bisect log: gist.github.com/39064be6b6685b4507...69e63c55f1 09:50
bartolin, Bisecting by output (old=2016.08.1 new=2016.09) because on both starting points the exit code is 1
bartolin, bisect log: gist.github.com/4efa5ab770655dcee6...c53ecf83dd
bartolin, Output on all releases and bisected commits: gist.github.com/902373a4744c6e5251...24bb082422
12:56 lizmat_ joined 12:58 lizmat left, lizmat_ left 12:59 lizmat joined
ugexe one millions conditionals has been precompiling for 40 hours now. i think i'm bout to cut it short 16:25
gist.github.com/ugexe/2435aacd7016...b9b7936617 16:28
timo would you like to try again to figure out where in the parsing process we are on your machine right now? 16:30
ugexe yeah sure 16:31
timo ok, you would "lldb --attach-name rakudo" or if more likely you would have the pid and --attach-pid 12345blah 16:32
ugexe yeah im attached 16:33
timo now we set a breakpoint to the nqp_nfa_run function, hold on 16:34
breakpoint set --name nqp_nfa_run
and "c" in order to run until we hit the breakpoint
ugexe (lldb) breakpoint set --name nqp_nfa_run 16:35
Breakpoint 1: where = libmoar.dylib`nqp_nfa_run + 40 [inlined] MVM_string_check_arg at ops.h:38:12, address = 0x0000000100a587e8
(lldb) c
Process 14888 resuming
timo print offset # gives you the position the nfa was run at
print target->body # lets you see the size of the string in question, in case it's just parsing something inside a shorter string 16:36
ah interesting, it might be past the parsing stage already
in that case, do "process interrupt" and we'll look at where the code currently is at
ugexe (lldb) print target->body
error: Process is running. Use 'process interrupt' to pause execution. 16:37
gist.github.com/ugexe/de94e0fcfba7...4bc8c0add3
timo yes, those commands only make sense after we have hit the breakpoint, but we might not be seeing any nfa any more if we're not doing parsing at the moment
ok, please see if "up" once or twice gives you a valid value for "tc" in the function's signature, one that is not "<unavailable>" 16:38
ugexe gist.github.com/ugexe/b5253885a038...d4c2eeee7a 16:39
timo we can `call MVM_dump_backtrace(tc)` which will cause moar to spit out a backtrace on its stderr, i.e. not visible in the lldb terminal
there are other ways to find out where we are, but this is the easiest
we might be in a very deep stack though
ugexe how do i capture `call MVM_dump_backtrace(tc)` if it doesn't go to the lldb terminal? 16:40
oh it would be in the process ive had running wouldnt it
timo yes
ugexe nothing happened 16:41
timo i'm not exactly sure how lldb behaves when you "call" something yourself and want to step in it
interesting. ok, let's keep looking
there's also MVM_dump_bytecode(tc) which also goes to the stderr of your process 16:42
ugexe im not sure if stderr from a precompiling process would make it to stderr of the parent process
timo oh!
yes you're right
ugexe if it did output anything it probably messed up something since presumably it is doing something with that stderr it captures
timo oh 16:43
would it think there has been an error of some kind?
ugexe im not sure honestly. i think it is used to emit precompilation dependency related information 16:44
which in this case might not be relevant since it is just a single module with no dependencies
timo we can just do this: print MVM_exception_backtrace_line(tc, tc->cur_frame, 0, *(tc->interp_cur_op)) 16:45
and then: print MVM_exception_backtrace_line(tc, tc->cur_frame->caller, 1, *(tc->interp_cur_op))
and then: print MVM_exception_backtrace_line(tc, tc->cur_frame->caller->caller, 1, *(tc->interp_cur_op))
and then: print MVM_exception_backtrace_line(tc, tc->cur_frame->caller->caller->caller, 1, *(tc->interp_cur_op))
etc. pp. 16:46
that's what MVM_dump_backtrace does, but it also printf's it to stderr and frees the string
which is fine for us to not do
the output of that will appear inside lldb 16:47
ugexe gist.github.com/ugexe/1b6364d72089...ef42d4ec45 16:48
timo after we know where we're at we can look at the rakudo/nqp source and see what variables we can look at to tell us something about our progress
ok, how about we "resume" then "process interrupt" again and see if we're still in that frame or somewhere completely different 16:50
i have no idea if STMTSTEMPS can ever become too big or something 16:52
ugexe gist.github.com/ugexe/b8a98eb29910...629d25566d 16:53
timo but what we can do is set a breakpoint in MVMHash_at_key which will let us look at a few hashes that get accessed, and with what key (with a little extra effort)
OK, that's inside garbage collection 16:54
i'd say we jump ahead a bit further again
ugexe gist.github.com/ugexe/74ed4a7f1753...45f4a9c54f 16:55
still gc i guess 16:57
timo yeah, let's look at how big this hash is
i believe "finish" will tell us the return vlue 16:58
ugexe gist.github.com/ugexe/2a1dd7bbdb81...20f61565f5
timo no it doesn't seem to tell us that, but we should be able to print the "elems" variable, which should become available and have the correct contents after a few "step" commands 16:59
or immediately
so `print elems`
ugexe (lldb) print elems 17:00
(MVMuint64) 2
timo oh ok, that's not a very interesting one i suppose
we can "c" again
ugexe gist.github.com/ugexe/fab10763ba11...c6a5d40f67 - i interrupted and continued a few times to get some more samples 17:02
timo BRB bathroom break
MVM_gc_collect_free_gen2_unmarked is a later phase in the garbage collection, so it'd be over in a moment 17:03
ugexe i guess i killed the process somehow 17:14
timo oh what does the log look like? 17:15
ugexe had opened a web tmate session i was going to pm you, and then went to continue the process in lldm and detach from the process, then lldb acted like it was froze for 2 minutes and then there is now there are no rakudo/moar processes running 17:16
timo huh, that's weird
not sure what might have happened there 17:17
did it save a core dump per chance?
you might find that in the "Terminal.app"
ugexe i dont have core dumps enabled
timo feels like i broke it >_< 17:18
lizmat perhaps try again with 100K instead of 1M? 17:19
timo Stage parse : 66.119 17:21
that is one deep backtrace ... looks like we compile an if/elsif/else as a very deeply nested, very one-sided tree 17:23
lizmat is that the same with "with / orwith / else ? 17:29
my guess: it probably is 17:30
ugexe i see, so extremely nested things have non-ideal performance characteristics. i think an issue here though is that it is hard to reason about what is actually deeply nested, as in this case (unlike the for-loop one) the code itself isn't nested 17:32
timo well, here we are sorting the keys of a hash with 10k entries 17:38
lizmat because of reproducible build? 17:44
timo it's taking an awfully long time sorting huh 17:51
lizmat this is an NQP sort ? 17:52
or a sort at MoarVM level ?
timo github.com/raku/nqp/blob/c4b44e979....nqp#L1409
it's the sorted_keys function from nqp/Hash.nqp 17:53
nqp/core/Hash.nqp
nqp src/core/Hash.nqp
ugexe "Make building Perl 6 code reproducible"
lizmat making %*STMTTEMPS a hash may not have been such a good idea in the end ? 17:55
if that weren't a hash, it wouldn't need to be sorted 17:56
timo gist.github.com/timo/dc80f576574a3...734429e231 the keys in the hash 18:01
--target=ast takes a long-ass time to dump the structure 18:05
Dec 01 19:06:09 pebbsi systemd-oomd[1852]: Killed /user.slice/user-1000.slice/user@1000.service/user.slice/libpod-75c648286b3a991c39e08ee4cb160bbf7e4af6c7dcd8abf5c4107325a434a2d1.scope/container due to memory used (32280829952) / total (33363832832) and swap used (17430986752) / total (19327344640) being more than 90.00% 18:06
ugexe yeah it feels like using a different data structure could potentially be helpful. i wonder if there is one that wouldnt tradeoff against the more common scenario much 18:08
timo i'm not sure why there's so many lowered lexicals in the first place 18:10
is it the $_ of every one of these blocks perhaps?
lizmat if we would have a MoarVM op that would search a string array for a string and return its index
then we wouldn't have to that in NQP and there's a good chance that it would actually beat hashes for N entries where N would be between 5 and 10 I'd say 18:11
*to do that
timo 5600 stack frames, asking moar-remote to "all lex" that :D 18:12
cpu usage is suspiciously low, there's probably some optimization possible here ... 18:13
- QAST::Var(local __lowered_lex_503) <wanted> $num 18:30
every one of these is an instance of "$num"?
worklist with items = 1725274, yeowch 18:47
looking at the "insert_bytecode" byetcode, it looks like we might be doing a lot of unnecessary boxing and unboxing of integers, i wonder why we are missing these 19:00
19:45 librasteve_ joined 19:56 gfldex left
Geth rakudo/azure_improvements_squashed: e1e348c75b | (Patrick Böker)++ | azure-pipelines.yml
Run tests irrespective of prev failure
20:24
20:35 sena_kun joined 20:55 gfldex joined
Geth rakudo/azure_improvements_squashed: 2cf6928091 | (Patrick Böker)++ | azure-pipelines.yml
more ext tests
22:09
22:31 sena_kun left
timo the amount of time spent in GC seems to be rather large for this stage 22:33
Geth rakudo/azure_improvements_squashed: f835704ff3 | (Timo Paulssen)++ (committed by Patrick Böker) | 2 files
Reimagine Azure CI

  - Prettier output (using the JUnit test result formatter)
  - Restructure making better use of parameters
  - Disable OOMing Win_JMV
  - Speed up CI on Windows by working around the compiler search
  - More granular output using `echo` commands
  - add a nodelay & blocking spec test run
  - persist build artifacts
  - Speed up testing by running tests in parallel
  - Bump the OS versions to latest as of now
22:46
[Coke] ls 22:52
timo i'm not sure i'm going to let this finish tbh 23:50
but i think using --optimize=0 on odd-or-even.rakumod can make it a lot faster
to clarify, i mean "on the order of 10 hours" rather than "on the order of 50 hours"