#moarvm on 2 August 2017 - Raku Programming Language Log

00:24 harrisi joined 01:50 vendethiel joined 01:52 ilbot3 joined 02:00 https_GK1wmSU joined, https_GK1wmSU left
harrisi	jnthn: You're Jonathan Worthington, I'm assuming? If so, is anything relating to your dissertation public? It sounds as though it is researching exactly a project I've been interested in working on.	03:09	Copy link Message link Add to gist Remove
	jnthn: .. this is disappointing, but I found it actually. :)	03:13	Copy link Message link Add to gist Remove
05:53 brrt joined 06:27 brrt joined
harrisi	jnthn: hope you don't me bothering you, but the link to the slides for "All Your Dynamic Language Are Belong To Us (Guest Lecture at Stockholm University)" gives a 404, from your articles page (www.jnthn.net/articles.shtml). Do you have that up somewhere? It looked like it might be related to bytecode translation, and I've read all your other slides on it.	06:29	Copy link Message link Add to gist Remove
06:58 brrt joined
samcv	good *	07:19	Copy link Message link Add to gist Remove
brrt	good * samcv	07:21	Copy link Message link Add to gist Remove
samcv	going to update Unicode Database for unicode 10 and also add in names for Hangul (Korean) codepoints		Copy link Message link Add to gist Remove
	since that is done algorithmically. though i still have no clue why unicode didn't just put the damn names in the unicode data file...	07:22	Copy link Message link Add to gist Remove
	there's literally no reason i can think of why they can't have just written them in there		Copy link Message link Add to gist Remove
brrt	:-)	07:27	Copy link Message link Add to gist Remove
	ask them!		Copy link Message link Add to gist Remove
08:01 zakharyas joined
harrisi	jnthn: I have a lot more questions and comments, but I wanted to say that your paper now has one of my favorite quotes in any research paper I've ever seen. That quote being "As in real life, getting too intimate leads to excessive complexity." :)	08:07	Copy link Message link Add to gist Remove
08:17 vendethiel joined
jnthn	morning #moarvm o/	09:00	Copy link Message link Add to gist Remove
	harrisi: Yes, I'm that Jonathan. It's been years since I worked on bytecode translation though.	09:01	Copy link Message link Add to gist Remove
	I'm...not sure about those missing slides. Hmm. But I don't think that talk actually was about the bytecode translation work.	09:02	Copy link Message link Add to gist Remove
	Pretty sure it was about the idea of a common AST to compile different languages to.	09:03	Copy link Message link Add to gist Remove
brrt	good hi jnthn	09:13	Copy link Message link Add to gist Remove
jnthn	o/ brrt		Copy link Message link Add to gist Remove
	How'd the talk go yesterday?		Copy link Message link Add to gist Remove
brrt	oh, my talk is next week :-)		Copy link Message link Add to gist Remove
	on friday	09:14	Copy link Message link Add to gist Remove
	sorry for the misunderstanding		Copy link Message link Add to gist Remove
	anyway, that's basically what the GraalVM folks are doing nowadays		Copy link Message link Add to gist Remove
jnthn	Oh!		Copy link Message link Add to gist Remove
brrt	(a common AST)		Copy link Message link Add to gist Remove
jnthn	So at TPC in Amsterdam?	09:15	Copy link Message link Add to gist Remove
brrt	uhuh		Copy link Message link Add to gist Remove
	i'm a bit worried that i've been 'stuck' in this subject matter for so long that what is really obvious to me isn't to others, and that i have no idea of what that is		Copy link Message link Add to gist Remove
moritz	the "curse of knowledge"	09:20	Copy link Message link Add to gist Remove
jnthn	brrt: Are you expecting to merge "this week"?	09:21	Copy link Message link Add to gist Remove
brrt	ehm. maybe. i think i've uncovered another bug :-)		Copy link Message link Add to gist Remove
jnthn	ah		Copy link Message link Add to gist Remove
brrt	i'm not 100% sure it's not a template bug	09:22	Copy link Message link Add to gist Remove
	otherwise, yeah, this week would be good		Copy link Message link Add to gist Remove
	i'll do a pull-request		Copy link Message link Add to gist Remove
	if it causes a lot of fallout, we can always switch to an enable-on-demand model for the time being	09:23	Copy link Message link Add to gist Remove
nwc10	brrt: building even-moar-jit with ASAN. Into NQP and getting a lot of	09:29	Copy link Message link Add to gist Remove
	src/jit/linear_scan.c:433:32: runtime error: shift exponent 60 is too large for 32-bit type 'int'		Copy link Message link Add to gist Remove
	src/jit/linear_scan.c:429:26: runtime error: shift exponent 60 is too large for 32-bit type 'int'		Copy link Message link Add to gist Remove
	src/jit/linear_scan.c:437:27: runtime error: shift exponent 63 is too large for 32-bit type 'int'		Copy link Message link Add to gist Remove
09:30 dogbert17 joined
brrt	really? we shouldn't be doing that on an 32 bit int	09:30	Copy link Message link Add to gist Remove
	thanks!		Copy link Message link Add to gist Remove
	the constant 1 must be the thing that's intepreted as an int	09:32	Copy link Message link Add to gist Remove
	that's silly		Copy link Message link Add to gist Remove
nwc10	that's C		Copy link Message link Add to gist Remove
	(not looked at the code, but my innacurate/incomplete understanding of C is that the constant 1 has value 1 and type int)		Copy link Message link Add to gist Remove
09:35 AlexDani` joined
dogbert17	jnthn: is the new MoarVM, i.e. master, still supposed to be slower than e.g. 2017.07 when running multithreaded apps?	09:48	Copy link Message link Add to gist Remove
jnthn	Well, the main remaining slowdown is independent of that.	09:51	Copy link Message link Add to gist Remove
	(threads or not, I mean)	09:53	Copy link Message link Add to gist Remove
dogbert17	I have a script which is ~20% slower in mt mode but only 10% slower in st mode compared to 2017.07 (in 32 bit mode :) gist.github.com/dogbert17/fc516b36...98b635dae4	09:59	Copy link Message link Add to gist Remove
10:00 buggable joined
dogbert17	I have added callgrind results to the gist (MoarVM master)	10:09	Copy link Message link Add to gist Remove
jnthn	Yeah, a lot of time in multi-dispatch which is what I'm currently repairing/improving the spesh-time resolution of	10:18	Copy link Message link Add to gist Remove
dogbert17	cool, I could do another run when those fixes show up	10:20	Copy link Message link Add to gist Remove
jnthn	Yeah, was working on it, then realized I need to do a bit of work on deopt	10:24	Copy link Message link Add to gist Remove
brrt	nwc10, thanks for that	10:42	Copy link Message link Add to gist Remove
	today i learned, C stdint has macros INT64_C(...)		Copy link Message link Add to gist Remove
	and UNINT64_C(..)	10:43	Copy link Message link Add to gist Remove
Geth	MoarVM: 0585b8d49c \| (Jonathan Worthington)++ \| 10 files Store deopt one target directly in bytecode. Rather than needing to use the offsets to resolve them each deopt. This brings the interpreted approach in line with what the JIT does, but further to that it means that we can have a sequence of guards that will deopt to the same place, which will be used for upcoming improvements to invocation specialization.	10:46	Copy link Message link Add to gist Remove
brrt	i think i've found my bug	10:50	Copy link Message link Add to gist Remove
	it's related to the 'optimistic' store insertion		Copy link Message link Add to gist Remove
Geth	MoarVM: a2128f15ba \| (Jonathan Worthington)++ \| src/jit/graph.c Ensure JIT uses index same as interpreter.	10:51	Copy link Message link Add to gist Remove
brrt	wait, you made the deopt target an instruction parameter?	10:55	Copy link Message link Add to gist Remove
Geth	MoarVM: 6ce4a2d1dc \| (Jonathan Worthington)++ \| src/spesh/optimize.h Allow calls with more args to be optimized. 4 is a little low, given nameds can need a couple of slots.		Copy link Message link Add to gist Remove
jnthn	brrt: Yeah		Copy link Message link Add to gist Remove
brrt	jnthn++, then jnthn++, then jnth++		Copy link Message link Add to gist Remove
	jnthn++		Copy link Message link Add to gist Remove
jnthn	brrt: So could probably simplify the JIT code further		Copy link Message link Add to gist Remove
brrt	oh man	10:56	Copy link Message link Add to gist Remove
	i wonder why i never did that myself		Copy link Message link Add to gist Remove
	but anyway		Copy link Message link Add to gist Remove
	thanks!		Copy link Message link Add to gist Remove
	that helps a hole bundle		Copy link Message link Add to gist Remove
	*whole		Copy link Message link Add to gist Remove
jnthn	The reason I want to do this is that it will let me, hopefully, have multiple guards that deopt to the same point	10:57	Copy link Message link Add to gist Remove
brrt	that makes a bunch of sense	10:58	Copy link Message link Add to gist Remove
	it also is much simpler, i think		Copy link Message link Add to gist Remove
	because it avoids a lookup at guard-failure	10:59	Copy link Message link Add to gist Remove
jnthn	Need to think carefully about this though		Copy link Message link Add to gist Remove
	'cus where we put the guards matters for what of the call resolution we can eliminate	11:00	Copy link Message link Add to gist Remove
	I was pondering stacking guards before prepargs		Copy link Message link Add to gist Remove
brrt	uhuh		Copy link Message link Add to gist Remove
	the careful thinking is probably why i didn't do it		Copy link Message link Add to gist Remove
	:-P		Copy link Message link Add to gist Remove
jnthn	But if I do that and they fail then we need to keep the callee stored in a register	11:01	Copy link Message link Add to gist Remove
	So the deopt can find it		Copy link Message link Add to gist Remove
	We can still have optimized the resolution of it I guess		Copy link Message link Add to gist Remove
	Anyway, I guess I'll stack them up ahead of the prepargs	11:05	Copy link Message link Add to gist Remove
	Though deopt one points are presently always after the instruction		Copy link Message link Add to gist Remove
	And I'd like them before in this case I guess	11:06	Copy link Message link Add to gist Remove
	Maybe I should just spot this in the graph and handle it specially		Copy link Message link Add to gist Remove
	I figure in most cases prepargs will have a previous instruction	11:07	Copy link Message link Add to gist Remove
brrt	always, but not always within the same basic block	11:08	Copy link Message link Add to gist Remove
jnthn	Yeah, hmm	11:09	Copy link Message link Add to gist Remove
brrt	(i've debugged my issue though)	11:10	Copy link Message link Add to gist Remove
jnthn	I guess I could introduce a :predeoptonepoint or so		Copy link Message link Add to gist Remove
	That records the offset before the prepargs		Copy link Message link Add to gist Remove
brrt	(retrospectively, inc_i/dec_i were the wrong way to solve an understandable problem)	11:12	Copy link Message link Add to gist Remove
harrisi	jnthn: is there still source somewhere?	11:21	Copy link Message link Add to gist Remove
	jnthn: I'm only about half way through your dissertation (4:22 am here - need to sleep), but there are a couple of things I'm not sure are covered in that, that I am interested in.	11:22	Copy link Message link Add to gist Remove
	one, stack -> register has been a complicated thing to think through, which your paper is helping with. I planned on doing stack <-> stack (.pyc <-> .class, for example), but there are a lot of type issues I can imagine	11:24	Copy link Message link Add to gist Remove
jnthn	Good question on the source. I...actually don't know. I think I open sourced it after I was done with the dissertation	11:25	Copy link Message link Add to gist Remove
	It's been > 10 years by now :)		Copy link Message link Add to gist Remove
	In that time the Parrot project also ceased to be developed	11:26	Copy link Message link Add to gist Remove
harrisi	the above leads me to major question number two. if I'm going from python bytecode to java bytecode (.pyc -> .class), I have to handle types, since python bytecode is untyped and java bytecode is typed. I actually just got to the part in your paper about static analysis of .net instructions for handling stack types so I think looking more into the CLR in general could help. I looked a bit at the .pyc files of two python files that add integers and one that a	11:27	Copy link Message link Add to gist Remove
jnthn	(got cut off at "one that ad")		Copy link Message link Add to gist Remove
harrisi	jnthn: yeah, the slides say it's available in the parrot source but I think I'd have to look historically.		Copy link Message link Add to gist Remove
jnthn	I was going from static -> dynamic, which is an easier problem ;)		Copy link Message link Add to gist Remove
harrisi	one that adds integers and one that adds strings. identical instructions, but the metadata should provide everything I need, I hope.	11:28	Copy link Message link Add to gist Remove
	yeah, static -> dynamic seems somewhat easier, but I don't know much of anything. :)	11:29	Copy link Message link Add to gist Remove
	all I know is that your paper and slides have been helpful recently, so thank you.		Copy link Message link Add to gist Remove
jnthn	I couldn't imagine when I did it that people would care 10 years later :-)		Copy link Message link Add to gist Remove
	Glad it's of some use		Copy link Message link Add to gist Remove
harrisi	one more question before I go. why did you end up devoting time to parrot and other things instead of continuing with bytecode translation?	11:30	Copy link Message link Add to gist Remove
	was there something that seemed like a significant show-stopper?		Copy link Message link Add to gist Remove
	I'm sure it's in the paper but it's hard to wait!	11:31	Copy link Message link Add to gist Remove
Geth	MoarVM: 0ad9ce1412 \| (Jonathan Worthington)++ \| 5 files Make prepargs a pre-deopt-one point. Which is a deopt point where we record the offset before the instruction, so if we depot we will re-run that instruction.		Copy link Message link Add to gist Remove
jnthn	Well, I got into working on Parrot because I wanted to contribute something for the Perl 6 project, and that was the most practical place I could see to help with at the time.	11:32	Copy link Message link Add to gist Remove
	The bytecode translation stuff was an intresting dissertation project. If there'd been people using it, I'd probably have kept going. But Parrot never really got the up-take.	11:34	Copy link Message link Add to gist Remove
harrisi	I'm a little over a year into community college studying computer science (with a little bit of self teaching prior), but my dreams for various languages' bytecode to be translated between each other as needed seems.. very useful, and mostly very interesting. your work is actually the first I've seen about it which talks about this at all in detail. it's a little surprising.		Copy link Message link Add to gist Remove
jnthn	There has been work on this in the .Net/JVM space I believe	11:35	Copy link Message link Add to gist Remove
	ut I think		Copy link Message link Add to gist Remove
	*But I think part of the practical problem is that most VMs get instruction sets designed for their target language	11:36	Copy link Message link Add to gist Remove
harrisi	and there are projects like jython and ironruby but they're not bytecode translators, as far as I can tell.		Copy link Message link Add to gist Remove
jnthn	And not just that, but standard library functions too		Copy link Message link Add to gist Remove
harrisi	right. one of the most concerning things to me now is (thanks to your slides) the very same code behaving very different across languages.	11:37	Copy link Message link Add to gist Remove
jnthn	So at some point you reach I/O, for example, or concurrency primitives. Generally, languages do some amount of this stuff in their standard library, and then have the low-level bits call into VM-provided functions	11:38	Copy link Message link Add to gist Remove
	And those will differ too		Copy link Message link Add to gist Remove
harrisi	in one of your examples you used incrementing strings in python and javascript, I believe. being able to translate semantics easily and effectively seems very powerful though, and I suspect dealing with it at the bytecode level would be easier than source.		Copy link Message link Add to gist Remove
jnthn	Yes, in that the semantics of bytecode are far simpler		Copy link Message link Add to gist Remove
harrisi	it definitely is way out of my league, but I've been interested for awhile. :)	11:39	Copy link Message link Add to gist Remove
	right.		Copy link Message link Add to gist Remove
jnthn	Lunch time for me; bbiab		Copy link Message link Add to gist Remove
11:40 markmont joined
harrisi	jnthn: anyway, thank you, sincerely, for the work you did. if you for some reason you know more resources about this stuff that you could point me to, I'd be forever indebted!	11:40	Copy link Message link Add to gist Remove
	night, have a good lunch!	11:41	Copy link Message link Add to gist Remove
brrt	harrisi: the general rule is that the lower-level the 'language', the simpler the semantics, and the easier the translation	11:46	Copy link Message link Add to gist Remove
	you can translate everything to everything, it is just a lot of work, and you generally lose efficiency of representation		Copy link Message link Add to gist Remove
11:56 AlexDaniel joined, vendethiel joined 11:57 travis-ci joined
travis-ci	MoarVM build failed. Jonathan Worthington 'Make prepargs a pre-deopt-one point.	11:57	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/260157705 github.com/MoarVM/MoarVM/compare/6...d9ce1412f4		Copy link Message link Add to gist Remove
11:57 travis-ci left
harrisi	brrt: do you have any resources for that kind of thing for me? :)	12:02	Copy link Message link Add to gist Remove
brrt	no, that's just a general observation… :-)	12:04	Copy link Message link Add to gist Remove
	you might be interested in the graal project		Copy link Message link Add to gist Remove
	which does something similar today		Copy link Message link Add to gist Remove
	well, somewhat similar anyway		Copy link Message link Add to gist Remove
jnthn	harrisi: One thing that occurred to me over lunch about your mention of the python add instruction earlier: on today's JVM it probably best translates into an invokedynamic than then decides what the right thing to do is.	12:12	Copy link Message link Add to gist Remove
	huh, it looks successful then says at the end the build exited with 1? travis-ci.org/MoarVM/MoarVM/jobs/260157712	12:16	Copy link Message link Add to gist Remove
lizmat	jnthn: fwiw, I've seen that a few times in travis reports of rakudo builds as well	12:17	Copy link Message link Add to gist Remove
	not sure what to make of that		Copy link Message link Add to gist Remove
jnthn	Me either		Copy link Message link Add to gist Remove
12:17 AlexDaniel joined 12:41 AlexDaniel joined 12:42 zakharyas joined, AlexDaniel joined 13:01 Geth_ joined 13:08 zakharyas joined
jnthn	eek, language class time...making progress on call opts though:)	13:26	Copy link Message link Add to gist Remove
13:37 brrt joined 13:49 brrt1 joined 14:08 brrt joined 14:41 zakharyas joined 14:50 zakharyas joined
	jnthn back	14:58	Copy link Message link Add to gist Remove
dogbert17	Vítej zp?t	15:00	Copy link Message link Add to gist Remove
jnthn	Diky		Copy link Message link Add to gist Remove
lizmat	proost!		Copy link Message link Add to gist Remove
jnthn	Je mi horko a musim pracovat na deopt a inlining... :P	15:01	Copy link Message link Add to gist Remove
dogbert17	your language classes seems to be working		Copy link Message link Add to gist Remove
jnthn	(It's hot and a have to work on deopt and inlining)		Copy link Message link Add to gist Remove
	dogbert17 had to use google translate myself :)		Copy link Message link Add to gist Remove
jnthn	So it turns out that yes I can sneak extra guards in for args based on observed type tuples	15:02	Copy link Message link Add to gist Remove
	And it'll net us more spesh calls and inlines (good)		Copy link Message link Add to gist Remove
	But sneaking them in without an entry for them in the deopt table goes rotten when we do inlining		Copy link Message link Add to gist Remove
	So I guess I can't cheat quite that hard.		Copy link Message link Add to gist Remove
	The good news is that growing that table is probably completely safe	15:03	Copy link Message link Add to gist Remove
15:12 vendethiel joined 15:18 vendethiel- joined
markmont	timotimo: A quick recap to be sure we're working on the same problem:	15:25	Copy link Message link Add to gist Remove
	On Fedora 26, I cloned MoarVM, updated dyncall to the latest head, added a call to strerror() to src/jit/compile.c, and then ran examples/hello_world.nqp from the NQP repo:		Copy link Message link Add to gist Remove
	sudo setsebool selinuxuser_execstack=off deny_execmem=on		Copy link Message link Add to gist Remove
	env MVM_JIT_LOG=/dev/tty nqp-m hello_world.nqp		Copy link Message link Add to gist Remove
	This resulted in: "Setting jit page executable failed or was denied: Permission denied. Deactivating jit." together with an SELinux denial for execmem.		Copy link Message link Add to gist Remove
	Is this the same problem you've been looking at?		Copy link Message link Add to gist Remove
timotimo	yeah	15:26	Copy link Message link Add to gist Remove
markmont	Excellent. And please let me know if I should move discussion to a mailing list to make it easier to follow. :)		Copy link Message link Add to gist Remove
	For dyncall, the mmap()/mprotect() code is only called by src/core/nativecall_dyncall.c:unmarshal_callback(). This code is not exercised in the hello_world.nqp example, so we don't know if the dyncall code works when execmem is prohibited (I suspect it does not work -- see below).		Copy link Message link Add to gist Remove
	I've looked at the JIT in PCRE2 and the one in Google Chrome V8. I've found that they both require execmem. PCRE2 does not attempt W^X at all, while V8 does use mmap()/mprotect() with the same results we're seeing in MoarVM.		Copy link Message link Add to gist Remove
	I've verified that the mmap()/mprotect() approach will not work with SELinux and deny_execmem. I tested under both Fedora 26 as well as under RHEL 6 (roughly equivalent to Fedora 12). Note that deny_execmem is off by default, so mmap()/mprotect() will work on most Linux machines, including those which disallow executable heaps and executable stacks. I've taken a quick look at the Linux kernel source, but haven't found an obvious place with the EPERM		Copy link Message link Add to gist Remove
	we're getting is being generated; I can look into this more, if desired.		Copy link Message link Add to gist Remove
timotimo	you should be able to "make test" in rakudo to get the nativecall tests running		Copy link Message link Add to gist Remove
15:27 AlexDaniel joined
markmont	Yes, I've done that. But for the specific case above, I'm trying to keep things as low-level as possible.	15:27	Copy link Message link Add to gist Remove
	Here is a test case showing that there is no problem with MoarVM's implementation of the JIT: gist.github.com/markmont/dcd20d632...b3bb3711ec		Copy link Message link Add to gist Remove
	The temporary file approach to W^X does work in all cases, see the test case at gist.github.com/markmont/b5f1ee6b4...3aae96e3b9		Copy link Message link Add to gist Remove
	libffi takes the temporary file approach. It tries a number of different locations for the temporary file, searches for filesystems that are not noexec, and also attempts to create the file using O_TMPFILE so that it never appears in the filesystem and does not need to be unlinked. See github.com/libffi/libffi/blob/6e2e...res.c#L615		Copy link Message link Add to gist Remove
	Based on this, I think we should decide if making MoarVM work on systems that enforce W^X in all situations is something we want to do. I think it is worth doing and am willing to work on this, but a lot of other software projects seem to be fine with the current mmap()/mprotect() approach. If we do want to force this, I think re-implementing the approach that libffi uses (and crediting them for it) would be a good way to proceed. Thoughts? (Whew,	15:28	Copy link Message link Add to gist Remove
	that was a lot of stuff!)		Copy link Message link Add to gist Remove
timotimo	hm, we could potentially expose the more complicated W^X thing through a configure flag and have ifdefs for that and some .c files tat are only in there if that flag is set or something	15:29	Copy link Message link Add to gist Remove
	then we don't have to do the whole searching filesystems song&dance on all platforms	15:30	Copy link Message link Add to gist Remove
	anyway, i've gotta go AFK for a bit before i can continue discussing		Copy link Message link Add to gist Remove
markmont	Sure thing! Thanks for your time on this.		Copy link Message link Add to gist Remove
timotimo	no, thank you!		Copy link Message link Add to gist Remove
markmont	I'll hold off on proceeding until we've discussed it more.	15:31	Copy link Message link Add to gist Remove
Geth	MoarVM: 0f94262d7a \| (Jonathan Worthington)++ \| 2 files Widen scope of deopt annotations function. And make it a tad more generic.	15:39	Copy link Message link Add to gist Remove
15:49 robertle joined
Geth	MoarVM/spesh-invoke: 290e2545be \| (Jonathan Worthington)++ \| 2 files Use callsite info to insert guards before prepargs These will allow us to use the type tuple to both pick a spesh candidate that already exists as well as do multi-dispatch, even in the presence of args passedin Scalar containers. This commit only adds the guards, and doesn't yet make use of them.	16:02	Copy link Message link Add to gist Remove
16:03 brrt joined
jnthn	Phew :)	16:04	Copy link Message link Add to gist Remove
16:07 zakharyas joined
brrt	\o	16:12	Copy link Message link Add to gist Remove
Zoffix	\|	16:13	Copy link Message link Add to gist Remove
	/\		Copy link Message link Add to gist Remove
brrt	i've tracked down the errors in the JIT to the use of too-small variants for the bitmap ints	16:14	Copy link Message link Add to gist Remove
16:15 travis-ci joined
jnthn	haha, spot the bug in the above commit :)	16:15	Copy link Message link Add to gist Remove
travis-ci	MoarVM build errored. Jonathan Worthington 'Widen scope of deopt annotations function.		Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/260255405 github.com/MoarVM/MoarVM/compare/0...94262d7a29		Copy link Message link Add to gist Remove
16:15 travis-ci left
jnthn	heh, not you travis :P	16:15	Copy link Message link Add to gist Remove
Zoffix	:D	16:16	Copy link Message link Add to gist Remove
brrt	it's not totally obvious initally	16:17	Copy link Message link Add to gist Remove
	markmont: if you do want to do this, please be my guest		Copy link Message link Add to gist Remove
Zoffix	there's too much code for bug spotting :)	16:18	Copy link Message link Add to gist Remove
markmont	brrt: I'd like to look into this more. I'm sure timotimo will have more input when he gets back.	16:19	Copy link Message link Add to gist Remove
jnthn	Turns out I didn't bump the usage count for the register it deconts into, so the guard was against an empty register and so always failed		Copy link Message link Add to gist Remove
brrt	my personal opinion (which is of no importance) is that W^X that can be circumvented by a filesystem hack is security theater	16:20	Copy link Message link Add to gist Remove
	and that people who insist on employing can enjoy the fulll benefits of doing so		Copy link Message link Add to gist Remove
	amongst which there are, no functional JIT		Copy link Message link Add to gist Remove
markmont	Yeah, the Mozilla devs have a bug report on this from back in 2006 that says much the same thing.		Copy link Message link Add to gist Remove
brrt	now, if we were to go so far as make a full ELF and DLL, then load that via dlopen, then use that, well, i'd be more appreciative, cause i can see that having some real benefits	16:22	Copy link Message link Add to gist Remove
markmont	It's not clear to me why SELinux denies the mprotect() call. The description of the deny_execmem boolean says "Deny user domains applications to map a memory region as both executable and writable", but we're not trying to do that.		Copy link Message link Add to gist Remove
16:23 zakharyas joined
brrt	hmm, i think the question is if the currrent sequence of call adequatly makes the page unwritable	16:23	Copy link Message link Add to gist Remove
	and, of course, if the selinux check is correct		Copy link Message link Add to gist Remove
markmont	I agree that that question is still open. I've looked at several other JITs, though, and none of them do any better.	16:25	Copy link Message link Add to gist Remove
	In other words, I'm not seeing anything obvious that is missing in terms of making the page unwritable.		Copy link Message link Add to gist Remove
brrt	anyway. i'm personally not super-fond of 'hacks' like searching for non-noexec filesystems. that feels to me like introducing less security (because the filesystem can be overwritten), not more	16:27	Copy link Message link Add to gist Remove
	but like i said, you're free to try it out		Copy link Message link Add to gist Remove
	and i'll be interested to see what the result will be :-)	16:28	Copy link Message link Add to gist Remove
Geth	MoarVM/spesh-invoke: ddb3e65596 \| (Jonathan Worthington)++ \| src/spesh/optimize.c Make sure to mark decont for guard used. Otherwise, dead instruction elimination throws it out and we guard it against an empty register, which will always fail.	16:29	Copy link Message link Add to gist Remove
	MoarVM/spesh-invoke: 87fb83fc9a \| (Jonathan Worthington)++ \| src/spesh/optimize.c Guards need STable, not type object.		Copy link Message link Add to gist Remove
	MoarVM/spesh-invoke: c47281fb40 \| (Jonathan Worthington)++ \| src/spesh/optimize.c Use type tuple to find spesh candidate if present. If not, we fall back to using the facts inferred using the facts established thus far.	16:30	Copy link Message link Add to gist Remove
markmont	brrt: OK, thanks. I agree with you about hacks leading to less security. I don't know if there is really anything to fix here, but I'd still like to look into exactly why the mprotect() returns with EPERM since we're asking to remove PROT_WRITE and add PROT_EXEC.	16:35	Copy link Message link Add to gist Remove
Geth	MoarVM/even-moar-jit: 8 commits pushed by (Bart Wiegmans)++ - Update plan and todo lists - Add sp_decont op - A bunch of comparison ops - Make jit debug tools use blocking spesh - Document root cause of inc_i/dec_i bugs - Use appropriate-sized values - Add special-case logic for single operand inc_i - Add atpos_i template	16:37	Copy link Message link Add to gist Remove
brrt	yeah, thanks for looking into it	16:38	Copy link Message link Add to gist Remove
	okay, afk		Copy link Message link Add to gist Remove
jnthn	Good news is the above pass spectest :)	16:52	Copy link Message link Add to gist Remove
	Uh, my ones :)	16:53	Copy link Message link Add to gist Remove
	Hopefully brrt++'s too :)		Copy link Message link Add to gist Remove
Geth	MoarVM/spesh-invoke: e7f7a18e2e \| (Jonathan Worthington)++ \| 3 files Enable type tuple use in multi-dispatch. Should get Perl 6 multi-dispatch resolution at spesh time mostly working again.	17:23	Copy link Message link Add to gist Remove
jnthn	Enough!	17:26	Copy link Message link Add to gist Remove
	bbl		Copy link Message link Add to gist Remove
18:36 AlexDaniel joined
nwc10	jnthn: ASAN doesn't find your branch at all interesting	19:27	Copy link Message link Add to gist Remove
jnthn	Nice :)	19:43	Copy link Message link Add to gist Remove
	Hm, I think somewhere there's an RT about a spesh bug involving native args		Copy link Message link Add to gist Remove
	Think I just found what's going on after running into it in some other code	19:44	Copy link Message link Add to gist Remove
	In an inline, this:		Copy link Message link Add to gist Remove
	arg_i liti16(0), r9(5)		Copy link Message link Add to gist Remove
	arg_i liti16(1), r15(1)		Copy link Message link Add to gist Remove
	Is rewritten to:		Copy link Message link Add to gist Remove
	set r30(1), r9(5)		Copy link Message link Add to gist Remove
	set r30(2), r15(1)		Copy link Message link Add to gist Remove
	It puts two different args into the same register o.O		Copy link Message link Add to gist Remove
	ooh, shop closing soon; brb	19:45	Copy link Message link Add to gist Remove
nwc10	.tell brrt ASAN finds even-moar-jit very exciting: paste.scsys.co.uk/564761		Copy link Message link Add to gist Remove
yoleaux	nwc10: I'll pass your message to brrt.		Copy link Message link Add to gist Remove
jnthn	OK, figured out what it is. Sometimes the thing we're inlining grabs the parameters and puts them into different registers	20:17	Copy link Message link Add to gist Remove
	uh, into the same registe		Copy link Message link Add to gist Remove
	*register		Copy link Message link Add to gist Remove
	But does something (like boxing them) inbetween.		Copy link Message link Add to gist Remove
	Which is fine enough		Copy link Message link Add to gist Remove
	Until we inline		Copy link Message link Add to gist Remove
nwc10	.tell brrt first bad commit is 2dbb62f9cade2, "Add sp_decont op" - as the only non comment change in that is to add a template, I guess it reveals a latent bug	20:28	Copy link Message link Add to gist Remove
yoleaux	nwc10: I'll pass your message to brrt.		Copy link Message link Add to gist Remove
Geth	MoarVM: 3f33a8419b \| (Jonathan Worthington)++ \| src/spesh/inline.c When inlining, replace receive instruction. If two parameters are received into the same register (because, for example, they are then boxed and only the target of the box matters), then the arg passing code would be rewritten in such a way that the second argument overwrote the value of the first. This resolves the issue by flipping the rewriting to the receiving side, where this issue cannot happen.	20:38	Copy link Message link Add to gist Remove
20:58 DBeepBeep joined
lizmat	jnthn: reality check: sub a(Any:U $a) { say nqp::isconcrete($a) } # can that ever say 1 ?? Don't think so, right ?	21:00	Copy link Message link Add to gist Remove
jnthn	No	21:02	Copy link Message link Add to gist Remove
	:U is defiend in terms of isconcrete		Copy link Message link Add to gist Remove
lizmat	ok, figured, thanks!		Copy link Message link Add to gist Remove
21:03 https_GK1wmSU joined 21:04 https_GK1wmSU left
lizmat	argh, I thought I could simplify code, but I can't after all... will add comment just someone doesn't step into that trap again later	21:05	Copy link Message link Add to gist Remove
21:26 travis-ci joined
travis-ci	MoarVM build passed. Jonathan Worthington 'When inlining, replace receive instruction.	21:26	Copy link Message link Add to gist Remove
	travis-ci.org/MoarVM/MoarVM/builds/260362737 github.com/MoarVM/MoarVM/compare/0...33a8419bfd		Copy link Message link Add to gist Remove
21:26 travis-ci left
markmont	brrt, timotimo: A W^X update:	21:29	Copy link Message link Add to gist Remove
	SELinux not allowing mprotect() to change from PROT_WRITE to PROT_EXEC turns out to be deliberate in order to avoid exploiting the writable memory with a timing attack in a multithreaded program as described in	21:30	Copy link Message link Add to gist Remove
	www.internetsociety.org/sites/defa...09_2_2.pdf		Copy link Message link Add to gist Remove
	It turns out that the libffi implementation (trying different locations, searching for noexec filesystems, using O_TMPFILE) originated in Mozilla Firefox nanojit,		Copy link Message link Add to gist Remove
timotimo	mhm mhm		Copy link Message link Add to gist Remove
markmont	see bugzilla.mozilla.org/show_bug.cgi?id=506693 (lots of good argument about temporary files there).		Copy link Message link Add to gist Remove
	see bugzilla.mozilla.org/show_bug.cgi?id=506693 (lots of good argument about temporary files this). However, OpenBSD 6.1 will allow it.	21:31	Copy link Message link Add to gist Remove
	Whoops, that didn't come out right, let's try again:		Copy link Message link Add to gist Remove
	NetBSD 8.0_BETA won't allow mprotect() to change from PROT_WRITE to PROT_EXEC by default (tested and verified). But OpenBSD 6.1 will.	21:32	Copy link Message link Add to gist Remove
	Under NetBSD 8.0 it's possible to mark individual executables specially to allow them to use mprotect() this way, overriding the default behavior.	21:33	Copy link Message link Add to gist Remove
timotimo	i'm going to read that paper	21:34	Copy link Message link Add to gist Remove
markmont	So the bottom line is that people on systems that allow it say "use mmap() and mprotect()" while NetBSD people and Linux people who use deny_execmem say "map a temporary file twice".		Copy link Message link Add to gist Remove
MasterDuke	timotimo: do you have a fix for the profiler segvs in the works? or do you need any more info?	21:37	Copy link Message link Add to gist Remove
timotimo	MasterDuke: haven't looked at it yet, tbh. i was only really aware of the "can't call defined on a null method" thing	21:40	Copy link Message link Add to gist Remove
MasterDuke	i don't know if there are multiple reasons		Copy link Message link Add to gist Remove
timotimo	"on a null object"*	21:42	Copy link Message link Add to gist Remove
	that one's a spesh-related problem (i.e. likely optimizer trouble)		Copy link Message link Add to gist Remove
MasterDuke	ah, this line?: STABLE(code)->invoke(tc, code, cur_callsite, args);	21:43	Copy link Message link Add to gist Remove
	that's what valgrind pointed to		Copy link Message link Add to gist Remove
timotimo	markmont: our jit differs from what is described in that paper because we never make pages writable after making them executable	21:45	Copy link Message link Add to gist Remove
markmont	indeed, but it's disallowed anyway. There's some more information about this in the Mozilla bug above.	21:47	Copy link Message link Add to gist Remove
timotimo	OK, i shall read that after i've finished with the paper	21:49	Copy link Message link Add to gist Remove
MasterDuke	jnthn: seems like --profile breaking might be due to your recent spesh work, any ideas?	21:50	Copy link Message link Add to gist Remove
timotimo	markmont: the drawback to what we're doing is that we're wasting at least one whole page for even the smallest pieces of jitted code	21:53	Copy link Message link Add to gist Remove
	so at some point we might want to re-use memory that was once execute-only and add some more jitcode at the end	21:54	Copy link Message link Add to gist Remove
markmont	Calling mprotect() again to switch back should work on platforms that support it, if we stick with the current approach. Or if we go with the map-a-temp-file-twice approach, that can be reused without doing anything (but we'll consume a file descriptor for each one we have).	21:56	Copy link Message link Add to gist Remove
timotimo	ah, yes, we'd reach the max number of open files in no time at all		Copy link Message link Add to gist Remove
markmont	I imagine it might be possible to keep many pieces of jitted code in a single mapped-but-unlinked tempfile. I'd have to try it.	21:58	Copy link Message link Add to gist Remove
	In other words, map per interpreter or once per thread rather than once per piece of jitted code.	21:59	Copy link Message link Add to gist Remove
timotimo	if we have per-thread code pages we'll be regenerating many of the same jitted blobs on all threads	22:01	Copy link Message link Add to gist Remove
markmont	right -- one per interpreter/VM then, or whatever the correct number turns out to be. I'm very new to MoarVM and NQP and not familiar with how they work.	22:04	Copy link Message link Add to gist Remove
timotimo	we don't do any of the jit-related mitigation techniques discussed in that paper	22:07	Copy link Message link Add to gist Remove
	like making the jitted code slightly random and unpredictable		Copy link Message link Add to gist Remove
jnthn	MasterDuke: No immediate guesses, sorry. If nobody else manages to hunt it down, I'll get there, just got quite a queue of spesh things to do.	22:17	Copy link Message link Add to gist Remove
timotimo	at some point hopefully a Real Security Researcher™ will try to attack moar	22:19	Copy link Message link Add to gist Remove
	i imagine if you can reach the nativecall functions, you'll be able to wreak absolute havoc		Copy link Message link Add to gist Remove
	i'm already unreasonably tired ... but maybe i'll go to bed rather soon today	22:21	Copy link Message link Add to gist Remove
22:24 jsimonet joined 22:47 https_GK1wmSU joined 22:49 https_GK1wmSU left 22:53 markmont joined
jnthn	Oh heck. It turns out that we don't get UV_EOF in the socket callback when the socket is closed	23:17	Copy link Message link Add to gist Remove
	So never pass on the done signal		Copy link Message link Add to gist Remove
	And so leak		Copy link Message link Add to gist Remove
23:23 Geth joined
MasterDuke	jnthn: the --profile problem is interp.c:973, but that's because of github.com/MoarVM/MoarVM/blob/mast...me.c#L1764	23:56	Copy link Message link Add to gist Remove
	if i add a `if (!code) fprintf(stderr, "here!\n")` right before then it gets printed	23:57	Copy link Message link Add to gist Remove

Please report any issues / comments / feature requests as an issue on App::Raku::Log.

Thank you!