#raku-beginner on 27 December 2021 - Raku Programming Language Log

17:18 discord-raku-bot left, discord-raku-bot joined
tope	hi, totally new to raku and a question about grammars: is there some guide translating my "context-free grammar" thinking into thinking about grammars? I'm struggling with non-greedy matching in particular. I want to match something on the form `<text> <stuff>?` where `<stuff>` have a strict "structure" that is easy to match, but `<text>` is essentially "anything, but non-greedily" like `<-[\n]>*?`, i.e.	17:42	Copy link Message link Add to gist Remove
Anton Antonov	"[...] translating my "context-free grammar" thinking into thinking about grammars" -- Hmmm... should be straightforward. Do think in BNF of EBNF?	17:47	Copy link Message link Add to gist Remove
	Here is a more useful answer: `rule org-section { <stars-spec> <todo-marker>? <text> <tag-list>?}`	17:49	Copy link Message link Add to gist Remove
thowe	I haven't read the Lenz Grammars book yet...	17:50	Copy link Message link Add to gist Remove
	I have it, but haven't gotten that far.	17:51	Copy link Message link Add to gist Remove
Anton Antonov	Here is a more useful answer: `rule org-section { <stars-spec> <todo-marker>? <text> <org-tag-list>?}`		Copy link Message link Add to gist Remove
	Here is a more useful answer: `rule org-section { <stars-spec> <org-todo-marker>? <text> <org-tag-list>?}`		Copy link Message link Add to gist Remove
	```	17:53	Copy link Message link Add to gist Remove
	rule text { \w+ }		Copy link Message link Add to gist Remove
	rule marker { 'TODO' \| 'DONE' \| 'CANCEL' }		Copy link Message link Add to gist Remove
	rule org-tag { \w+ }		Copy link Message link Add to gist Remove
	rule org-tag-list { <org-tag> % ':' }		Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	Well I have to say the above Raku code is just an example -- some small important tweaks might have to be done in order to work on actual org-mode sections.		Copy link Message link Add to gist Remove
	(Meaning, I just wrote that code here in the chat, did not try to run it.)		Copy link Message link Add to gist Remove
tope	@Anton Antonov#7232 But I'd like for `<text>` to maybe contain `:` and the like. E.g. `** Tags on the form :ARCHIVE: are special :tag1:tag2:`	17:55	Copy link Message link Add to gist Remove
Anton Antonov	@thowe Moritz Lenz' book "Parsing with Perl 6 Regexes and Grammars" was first Perl6 book I read.		Copy link Message link Add to gist Remove
thowe	I got both of his books and "Learning Perl 6".	17:56	Copy link Message link Add to gist Remove
	I started with his basics book, but then didn't touch it for a while and started again with Learning. Going through it now. Also watching a lot of Raku YouTube videos.	17:57	Copy link Message link Add to gist Remove
Anton Antonov	Then include ':' in the text rule : ` rule text { [ ':' \| \w]+ }`.		Copy link Message link Add to gist Remove
tope	but won't that just swallow the tag?		Copy link Message link Add to gist Remove
	my starting point was something like `^^ <stars> " " <todo>? <priority>? <text> <tags>? $$`. but `token text { <-[\n]>? }` however the non-greedy matching ? doesn't seem to work with tokens?		Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> Sure, it can do the swallowing, but you can use the appropriate regexes. I responded to you context-free-grammar thinking request.	17:59	Copy link Message link Add to gist Remove
	@tope#9134 Sure, it can do the swallowing, but you can use the appropriate regexes. I responded to your context-free-grammar thinking request.		Copy link Message link Add to gist Remove
tope	hmm yeah come to think of it I'm not sure how I'd translate this to context-free either. I'd maybe write something like `text_and_tags -> <tags> $ \| $ \| . <text_and_tags>`, so it keeps trying to match the ending or tags and only falls back to adding chars to the text when it fails (where alternative tries left-to-right in order until success)	18:05	Copy link Message link Add to gist Remove
	but I guess my problem is doing this kind of non-greedy matching in general		Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> Ok I will post a response with the next hour.	18:06	Copy link Message link Add to gist Remove
tope	or basic question: should I be using `regex` instead of `token` or `rule`? can token/rules do non-greedy matching?		Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> I see -- `regex` is for comprehensive matching; `rule` and `token` are for more streamlined and optimized parsing, without parsing too hard.	18:08	Copy link Message link Add to gist Remove
	Sorry, for being too vague, but the precise definitions are given in the documentation at raku.org .	18:09	Copy link Message link Add to gist Remove
tope	yeah I've gone through the grammar tutorial & the grammar page, though I feel I have this curse where what I want to do never matches the examples given in tutorials. and/or I'm not smart enough to see how / translate it.	18:10	Copy link Message link Add to gist Remove
	I could try searching github for people who have written other, more complicated parsers using grammar {}	18:11	Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> -- Ahh, I know that curse too! :0	18:12	Copy link Message link Add to gist Remove
	@tope#9134 -- Ahh, I know that curse too! 🙂		Copy link Message link Add to gist Remove
	<@93032313142669312> So, are you a fan of org-mode, or just cursed with it? (Because of a certain project or else...)		Copy link Message link Add to gist Remove
tope	No, I actually love org-mode. But use it mostly in lieu of markdown for writing rather than as an organizational tool, as I find markdown way too simplistic, but org-mode has more flexibility and formatting options as it comes with latex, tables, (advanced) footnotes, etc etc.	18:14	Copy link Message link Add to gist Remove
	and just in general prefer the org-mode syntax over markdown's, find it useful to have a separation of =typewriter= from ~code~ in formatting, prefer /italics/ etc.	18:16	Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> Ok. I use org-mode a lot in my projects.		Copy link Message link Add to gist Remove
tope	yeah and so I have a sort of blog/homepage/technical writeups written in markdown currently (mdbook), and looking into making some tools for myself to feed org-mode instead of markdown into mdbook -- and thought it could be a sort of beginner-project for learning raku	18:18	Copy link Message link Add to gist Remove
Anton Antonov	Ok, I am doing kind of the same things -- if you want we can "join forces."		Copy link Message link Add to gist Remove
tope	I used and loved perl5 15+ years ago, but since then all I've done is mostly python/rust/c++, however I do love a lot of the language features in raku		Copy link Message link Add to gist Remove
	esp. nice that raku is one of the few languages doing operators The Right Way(tm), i.e. completely free user-defined operators like in Haskell, rather than some static list of accepted symbols that can be overloaded.	18:20	Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> Here is what I have so far:	18:24	Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	grammar OrgMode {		Copy link Message link Add to gist Remove
	rule TOP { [ <org-section-header> \| <line-spec> ]+ % "\n" }		Copy link Message link Add to gist Remove
	regex org-section-header {		Copy link Message link Add to gist Remove
	\| <org-section-header-not-tags> \h+ <org-tag-list>		Copy link Message link Add to gist Remove
	\| <org-section-header-not-tags> }		Copy link Message link Add to gist Remove
	regex org-section-header-not-tags { <org-stars-spec> [\h+]? <org-todo-marker>? [\h+]? <text> }		Copy link Message link Add to gist Remove
	token org-stars-spec { '*'+ }		Copy link Message link Add to gist Remove
	regex text { [\V]+ }	18:25	Copy link Message link Add to gist Remove
	token org-todo-marker { 'TODO' \| 'DONE' \| 'CANCEL' }		Copy link Message link Add to gist Remove
	token org-tag { ':' \w+ }		Copy link Message link Add to gist Remove
	regex org-tag-list { <org-tag>+ }		Copy link Message link Add to gist Remove
	}		Copy link Message link Add to gist Remove
	Produces this output:		Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	** TODO First section｣		Copy link Message link Add to gist Remove
	org-section-header => ｢** TODO First section｣		Copy link Message link Add to gist Remove
	org-section-header-not-tags => ｢** TODO First section｣		Copy link Message link Add to gist Remove
	org-stars-spec => ｢**｣		Copy link Message link Add to gist Remove
	org-todo-marker => ｢TODO｣		Copy link Message link Add to gist Remove
	text => ｢First section｣		Copy link Message link Add to gist Remove
	｢** TODO First section :TAG1:TAG2｣		Copy link Message link Add to gist Remove
	org-section-header => ｢** TODO First section :TAG1:TAG2｣		Copy link Message link Add to gist Remove
	org-section-header-not-tags => ｢** TODO First section｣		Copy link Message link Add to gist Remove
	org-stars-spec => ｢**｣		Copy link Message link Add to gist Remove
	org-todo-marker => ｢TODO｣		Copy link Message link Add to gist Remove
	text => ｢First section｣		Copy link Message link Add to gist Remove
tope	ah hehe, you weren't kidding, you're actually writing an org-mode parser too	18:26	Copy link Message link Add to gist Remove
Anton Antonov	I did not define the `<line-spec>` , but yeah why not write an org-mode parse.		Copy link Message link Add to gist Remove
	I did not define the `<line-spec>` , but yeah why not write an org-mode parser.		Copy link Message link Add to gist Remove
	Note, that I am kind of dealing with the greediness in a sort of ad hoc manner with the rule `<org-section-header>`.	18:27	Copy link Message link Add to gist Remove
tope	yeah you're dealing with it by using `regex` that backtracks I guess?	18:28	Copy link Message link Add to gist Remove
Anton Antonov	Sure, that, but I also give precedence to parsing of section specs with a tags lists	18:30	Copy link Message link Add to gist Remove
	Sure, that, but I also give precedence to parsing of section specs with a tags lists.		Copy link Message link Add to gist Remove
	It is not a well-though grammar yet -- I just wrote it...	18:31	Copy link Message link Add to gist Remove
	But, as you suggested, I strongly suspect someone has already written an org-mode grammar in Raku and posted in the web...	18:32	Copy link Message link Add to gist Remove
tope	```		Copy link Message link Add to gist Remove
	grammar Test {		Copy link Message link Add to gist Remove
	token TOP { ^^ "*"+ <todo>? <prio>? <tt> $$ }		Copy link Message link Add to gist Remove
	token todo { "TODO" \|\| "DONE" \|\| "IDEA" \|\| "KILL" \|\| "PROG" }		Copy link Message link Add to gist Remove
	token prio { "[#" <[\V]> "]" }		Copy link Message link Add to gist Remove
	token tt { <tags>? $$ \|\| <[\V]> <tt> }		Copy link Message link Add to gist Remove
	token tags { ":" <tag>+ % ":" ":" }		Copy link Message link Add to gist Remove
	token tag { <[\w] + [\# @ %]>+ }		Copy link Message link Add to gist Remove
	}		Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	this was my only idea, to use a sort of muncher. would need the actions to handle collecting the chars into the title i guess		Copy link Message link Add to gist Remove
	but of course this is just for a simple heading. my next quest is to figure out how one would express the fact that the <section> after a heading with X stars should only match headings with X+1 or more stars..	18:36	Copy link Message link Add to gist Remove
	i.e. if that sort of logic can be taken care of by the grammar-actions, without requiring extra logic outside of the grammar stuff	18:37	Copy link Message link Add to gist Remove
lizmat	and yet another Rakudo Weekly News hits the Net: rakudoweekly.blog/2021/12/27/2021-...-released/		Copy link Message link Add to gist Remove
Anton Antonov	Yes this can be done with grammars. I have implementations that have conditional parsing similar to what you describe. The book by M. Lentz mentioned above has examples for that kind of parsing.	18:42	Copy link Message link Add to gist Remove
tope	ah, thanks, I'll see if I can procure it and add it to my bedtime reading list	18:43	Copy link Message link Add to gist Remove
Anton Antonov	Here is my org-mode grammar so far:	18:44	Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	grammar OrgMode {		Copy link Message link Add to gist Remove
	regex TOP { [ <empty-line> \| <org-section-header> \| <line-spec> ]* % \v }		Copy link Message link Add to gist Remove
	regex org-section-header {		Copy link Message link Add to gist Remove
	\| <org-section-header-not-tags> \h+ <org-tag-list>		Copy link Message link Add to gist Remove
	\| <org-section-header-not-tags> }		Copy link Message link Add to gist Remove
	regex org-section-header-not-tags { <org-stars-spec> [\h+]? <org-todo-marker>? [\h+]? <text> }	18:45	Copy link Message link Add to gist Remove
	token org-stars-spec { '*'+ }		Copy link Message link Add to gist Remove
	regex text { [\V]+ }		Copy link Message link Add to gist Remove
	token org-todo-marker { 'TODO' \| 'DONE' \| 'CANCEL' }		Copy link Message link Add to gist Remove
	token org-tag { ':' \w+ }		Copy link Message link Add to gist Remove
	Here is a parsing result of org-mode text:		Copy link Message link Add to gist Remove
	```		Copy link Message link Add to gist Remove
	｢** TODO First section :TAG1:TAG2		Copy link Message link Add to gist Remove
	** TODO Second section :TAG1:TAG3		Copy link Message link Add to gist Remove
	- This text 1		Copy link Message link Add to gist Remove
18:45 discord-raku-bot left, discord-raku-bot joined
	@tobe This package of mine evaluates Raku code sections in both Markdown and org-mode: modules.raku.org/dist/Text::CodePr...an:ANTONOV	18:49	Copy link Message link Add to gist Remove
	@tobe Do you use Babel-org-mode ?	18:50	Copy link Message link Add to gist Remove
19:05 mmat joined, mmat left
tope	@Anton Antonov#7232 thanks for the link, I'll look at that too! as for babel I'm not sure -- is babel just the part that allow you to evaluate code directly in org-mode? If so, then yes -- I've used it extensively to write literate documents with code. but if it's something else then no, I don't think so	19:33	Copy link Message link Add to gist Remove
Anton Antonov	@tobe -- Yes, that Babel is for "literate programming."	19:34	Copy link Message link Add to gist Remove
tope	@Anton Antonov#7232 btw it's `tope` not `tobe` so I missed the @ notifictions. but yes, cool, then I do. (I went through and solved all the problems on cryptohack.org interactively in an org document, for example.) Though I'm far from an emacs expert, I'm mostly just using doom-emacs + a few personal settings.	19:37	Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> Sorry for misspelling your identifier / name.	19:40	Copy link Message link Add to gist Remove
tope	no problem		Copy link Message link Add to gist Remove
Anton Antonov	<@93032313142669312> As for cryptography -- have tried to use Mathematica for cryptography? I am a big fan of Mathematica and I have programmed several types of connections between Mathematica and Raku. (Documenting them right now...)	19:42	Copy link Message link Add to gist Remove
	So, basically I want to use literate programming with Raku through Mathematica notebooks.		Copy link Message link Add to gist Remove
tope	Nah, I haven't, but I'm very comfortable with Python/SageMath and have built up a ton of personal tooling for cryptography-related stuff over some years where I've participated in CTF competitions, so never felt the need to look elsewhere	19:47	Copy link Message link Add to gist Remove
Anton Antonov	Ok, I am interested comparing programming languages over different computational workflows. I plan to design a conversational agent for Cryptography workflows in the next few months. (Using Raku of course.) So, I might ask for input from you.	19:49	Copy link Message link Add to gist Remove
	Ok, I am interested in comparing programming languages over different computational workflows. I plan to design a conversational agent for Cryptography workflows in the next few months. (Using Raku of course.) So, I might ask for input from you.		Copy link Message link Add to gist Remove
tope	have a lot of writeups for crypto-related problems at franksh.gitlab.io/ctf/ tho the old articles are probably a bit quirky since they were forcibly converted from org-mode documents to markdown+katex.		Copy link Message link Add to gist Remove
	(and they're probably not very useful to people who didn't play those CTFs and are thus not familiar with the problems)	19:51	Copy link Message link Add to gist Remove
Anton Antonov	Ah, back org-mdoe ?! 🙂 See this please: github.com/alainbebe/org-mode-gtk.raku		Copy link Message link Add to gist Remove
tope	ah yes, that's great, thanks		Copy link Message link Add to gist Remove
thowe	what does -Ofun mean?	21:56	Copy link Message link Add to gist Remove
	as seen in the weekly news		Copy link Message link Add to gist Remove
gfldex	thowe: Raku is optimised for fun.	22:02	Copy link Message link Add to gist Remove
thowe	I get that, but what is "-O" ?	22:05	Copy link Message link Add to gist Remove
	is that not a Raku lang thing?		Copy link Message link Add to gist Remove
	I feel that is some kind of idiom or inside joke that had something to do with Raku, but I don't see that when trying to search the docs for it.	22:06	Copy link Message link Add to gist Remove
gfldex	gcc -O3 your-file.cc	22:22	Copy link Message link Add to gist Remove
tope	syntax question: ```> say(<a b> X~ ^2); say(<a b>X~^2)		Copy link Message link Add to gist Remove
	(a0 a1 b0 b1)		Copy link Message link Add to gist Remove
	(S P)		Copy link Message link Add to gist Remove
	``` the first result I understand (and expected), the second result I have no idea what means / what happens.		Copy link Message link Add to gist Remove
gfldex	c++ is clearly optimised so you need 3 ppl to review your code :)	22:23	Copy link Message link Add to gist Remove
gfldex	m: dd <a b>X~^2;	22:34	Copy link Message link Add to gist Remove
	m: dd <a b>(X~^2);	22:36	Copy link Message link Add to gist Remove
thowe	Ah, compiler switch. Thanks. makes sense.		Copy link Message link Add to gist Remove
tope	yeah never mind I'm dumb, I though X~ was a separate operator, but X is a metaop, which modifies the infix `~^` to apply across the lists, basically xoring the chars	22:40	Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!