[urbit] Just-in-Time Compiler for u3

Discussion:

Paul Driver

2016-05-30 03:25:11 UTC

I've used libjit to implement a jit for u3. It compiles the nock for
fast-hinted core formulas that don't have registered jets.

https://github.com/frodwith/urbit/tree/jit

Feedback and performance testing is welcome. Hopefully this is useful
enough to get integrated into master.

I did have to change the road structure, so this is probably a breaching
change(?), unless someone knows how to use those mysterious future-proof
fields to make it not...

--
You received this message because you are subscribed to the Google Groups "urbit" group.
To unsubscribe from this group and stop receiving emails from it, send an email to urbit-dev+***@googlegroups.com.
To post to this group, send email to urbit-***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Curtis Yarvin

2016-05-30 03:45:28 UTC

Permalink

I have no comment, I just like saying "holy kamoly!" I feel ambushed!
In a good way! I mean, there are pull requests, and there are pull
requests...

What's the performance like? Or rather, since this is a hard
question, what are seat-of-pants numbers? Also, have you turned the
garbage collector on to do leak testing?

Post by Paul Driver
I've used libjit to implement a jit for u3. It compiles the nock for
fast-hinted core formulas that don't have registered jets.
https://github.com/frodwith/urbit/tree/jit
Feedback and performance testing is welcome. Hopefully this is useful enough
to get integrated into master.
I did have to change the road structure, so this is probably a breaching
change(?), unless someone knows how to use those mysterious future-proof
fields to make it not...
--
You received this message because you are subscribed to the Google Groups "urbit" group.
To unsubscribe from this group and stop receiving emails from it, send an
For more options, visit https://groups.google.com/d/optout.

Paul Driver

2016-05-30 05:48:57 UTC

Permalink

I don't know! It computes nock correctly. A fakezod comes up and things
work at the dojo. These things I know :)

I don't know how the profiler or memory leak tester work (yet). We can talk
about it over the next couple of days. Certain things are bound to be
faster, and maybe we can find some easy performance gains. Up to this point
I've been focused on getting it to *work*.

I want to discuss the architecture, too, but it's nice to have some running
code to talk about. So this is definitely *not* a pull request yet. But
look, it runs!

Sorry about the ambush. I've mentioned some things cryptically in :talk,
but, you guys have been focused on other important things lately. Can't
wait to see the video from lamdaconf!

Post by Curtis Yarvin
I have no comment, I just like saying "holy kamoly!" I feel ambushed!
In a good way! I mean, there are pull requests, and there are pull
requests...
What's the performance like? Or rather, since this is a hard
question, what are seat-of-pants numbers? Also, have you turned the
garbage collector on to do leak testing?

enough

Post by Paul Driver
to get integrated into master.
I did have to change the road structure, so this is probably a breaching
change(?), unless someone knows how to use those mysterious future-proof
fields to make it not...
--
You received this message because you are subscribed to the Google

Groups

Post by Paul Driver
"urbit" group.
To unsubscribe from this group and stop receiving emails from it, send

an
<javascript:>.

Post by Paul Driver
For more options, visit https://groups.google.com/d/optout.

Curtis Yarvin

2016-05-30 17:53:45 UTC

Permalink

A profiler (I almost wrote "brofiler," which is what a brogrammer uses
to optimize his brograms) is not in my opinion to be confused with a
benchmark. All benchmarks are lies, but I'd just write some simple
algorithm and time it for a seat-of-the-pants estimate of how big the
win is -- in orders of magnitude, really. Surely already you have
some code that's getting jitted, or you would have spent longer
plotting your ambush!

Our profiler, which really is rather good (just run with -P), will be
used to find the inner loops to target. That's normally automatic in
a JIT, of course, but for us it's manual. Which I for one much
prefer.

To activate the garbage collector, compile with U3_MEMORY_DEBUG and
U3_CELLOC_TOGGLE in include/noun/allocate.h, and run urbit -g. This
will run a tracing garbage collection after every event. The only use
for this is debugging (and possibly it should run occasionally at
runtime just in case).

I don't know! It computes nock correctly. A fakezod comes up and things work
at the dojo. These things I know :)
I don't know how the profiler or memory leak tester work (yet). We can talk
about it over the next couple of days. Certain things are bound to be
faster, and maybe we can find some easy performance gains. Up to this point
I've been focused on getting it to *work*.
I want to discuss the architecture, too, but it's nice to have some running
code to talk about. So this is definitely *not* a pull request yet. But
look, it runs!
Sorry about the ambush. I've mentioned some things cryptically in :talk,
but, you guys have been focused on other important things lately. Can't wait
to see the video from lamdaconf!

Post by Paul Driver
I've used libjit to implement a jit for u3. It compiles the nock for
fast-hinted core formulas that don't have registered jets.
https://github.com/frodwith/urbit/tree/jit
Feedback and performance testing is welcome. Hopefully this is useful enough
to get integrated into master.
I did have to change the road structure, so this is probably a breaching
change(?), unless someone knows how to use those mysterious future-proof
fields to make it not...
--
You received this message because you are subscribed to the Google
Groups
"urbit" group.
To unsubscribe from this group and stop receiving emails from it, send an
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "urbit" group.
To unsubscribe from this group and stop receiving emails from it, send an
For more options, visit https://groups.google.com/d/optout.

Paul Driver

2016-05-30 18:19:23 UTC

Permalink

Really, no timing estimates yet. I am actually trying *not* to ambush, you
see. Code bombs are a headache. Also it would be nice to have some code
review in case I'm doing something boneheaded. I've never written a
compiler before, and I'm only *starting* to get a good handle on u3
internals.

I'll play with the profiler and memory debugger soon and get back to you.
Yes, the benchmark is separate, but related, in that there is probably some
opportunity for performance enhancements that I can find with the profiler.
Profilers often tell you interesting things.

Do you have any suggestions though for some code to benchmark?

Post by Curtis Yarvin
A profiler (I almost wrote "brofiler," which is what a brogrammer uses
to optimize his brograms) is not in my opinion to be confused with a
benchmark. All benchmarks are lies, but I'd just write some simple
algorithm and time it for a seat-of-the-pants estimate of how big the
win is -- in orders of magnitude, really. Surely already you have
some code that's getting jitted, or you would have spent longer
plotting your ambush!
Our profiler, which really is rather good (just run with -P), will be
used to find the inner loops to target. That's normally automatic in
a JIT, of course, but for us it's manual. Which I for one much
prefer.
To activate the garbage collector, compile with U3_MEMORY_DEBUG and
U3_CELLOC_TOGGLE in include/noun/allocate.h, and run urbit -g. This
will run a tracing garbage collection after every event. The only use
for this is debugging (and possibly it should run occasionally at
runtime just in case).

Post by Paul Driver
I don't know! It computes nock correctly. A fakezod comes up and things

work

Post by Paul Driver
at the dojo. These things I know :)
I don't know how the profiler or memory leak tester work (yet). We can

talk

Post by Paul Driver
about it over the next couple of days. Certain things are bound to be
faster, and maybe we can find some easy performance gains. Up to this

point

Post by Paul Driver
I've been focused on getting it to *work*.
I want to discuss the architecture, too, but it's nice to have some

running

Post by Paul Driver
code to talk about. So this is definitely *not* a pull request yet. But
look, it runs!
Sorry about the ambush. I've mentioned some things cryptically in :talk,
but, you guys have been focused on other important things lately. Can't

wait

Post by Paul Driver
to see the video from lamdaconf!

breaching