1
0
Fork 0
mirror of https://github.com/ruby/ruby.git synced 2022-11-09 12:17:21 -05:00
The Ruby Programming Language [mirror]
Find a file
eileencodes d2e8b99b5b Implement tostring instruction for yjit
Co-authored-by: Aaron Patterson <tenderlove@ruby-lang.org>
2021-10-20 18:19:38 -04:00
.github Try running with more YJIT options in CI to surface more bugs 2021-10-20 18:19:36 -04:00
basictest
benchmark Use faster any_hash logic in rb_hash 2021-09-30 13:06:53 -07:00
bin Remove probably unintented symlinks 2021-07-27 18:49:28 +09:00
bootstraptest Save PC and SP before accessing globals 2021-10-20 18:19:38 -04:00
ccan
coroutine Ignore dead threads in coroutine_join. 2021-07-02 12:36:14 +12:00
coverage
cygwin Suppress maybe-uninitialized warning on mingw 2021-10-02 08:02:57 +09:00
defs Fix rubyspec_capiext dependency and flags 2021-08-30 12:03:44 +09:00
doc Rename YJIT readme to README.md 2021-10-20 18:19:34 -04:00
enc ruby tool/update-deps --fix 2021-10-05 14:18:23 +09:00
ext Tie lifetime of uJIT blocks to iseqs 2021-10-20 18:19:29 -04:00
gems Update bundled_gems at 2021-10-20 2021-10-20 07:07:22 +00:00
include Remove duplicate type qualifiers 2021-10-06 14:01:43 +09:00
internal Restore Hash#compare_by_identity mode [Bug #18171] 2021-10-02 11:43:35 +09:00
lib [ruby/racc] Bump up racc version to 1.6.0 2021-10-19 21:13:29 +09:00
libexec [ruby/irb] Move IRB::TOPLEVEL_BINDING from exe/irb to lib/irb/workspace.rb 2021-09-10 06:37:07 +09:00
man [ruby/irb] Add help about extra doc dir option 2021-10-11 21:12:58 +09:00
misc Add an lldb script to print YJIT comments 2021-10-20 18:19:37 -04:00
missing Use C99-defined signbit macro 2021-08-27 12:42:23 +09:00
sample Clean up lambda output from exyacc.rb 2021-08-25 18:23:56 -07:00
spec Update to ruby/spec@254c380 2021-10-20 21:57:05 +02:00
template Remove the scraper 2021-10-20 18:19:38 -04:00
test Implement tostring instruction for yjit 2021-10-20 18:19:38 -04:00
tool Remove the scraper 2021-10-20 18:19:38 -04:00
win32 Remove the scraper 2021-10-20 18:19:38 -04:00
.appveyor.yml Dot-prefix appveyor.yml 2021-09-25 15:44:14 +09:00
.cirrus.yml Set the locale encoding to UTF-8 2021-09-28 00:52:18 +09:00
.dir-locals.el
.document marshal.c Marshal.load accepts a freeze: true option. 2021-10-05 18:34:56 +02:00
.editorconfig
.gdbinit
.gitattributes
.gitignore Remove the scraper 2021-10-20 18:19:38 -04:00
.rspec_parallel
.travis.yml Revert "Shallow clone on TravisCI" 2021-10-13 19:04:46 +09:00
aclocal.m4
addr2line.c Cast to void pointer for %p in commented out code [ci skip] 2021-10-20 11:22:33 +09:00
addr2line.h
array.c Remove repeated 'the' (#4966) 2021-10-13 23:05:44 -07:00
array.rb [DOC] Additional to Array#sample [ci skip] 2021-09-22 14:17:09 +09:00
ast.c ast.c: AST.of against C method should return nil (as Ruby 2.6--3.0) 2021-09-18 21:52:18 +09:00
ast.rb ast.c: Rename "save_script_lines" to "keep_script_lines" 2021-08-20 16:18:36 +09:00
autogen.sh
bignum.c bary_mul_balance_with_mulfunc: move working buffer allocation 2021-10-12 00:45:14 +09:00
BSDL
builtin.c
builtin.h
class.c Use the flag for uninitialized module [Bug #18185] 2021-09-24 08:29:00 +09:00
common.mk Add getglobal to yjit 2021-10-20 18:19:38 -04:00
compar.c Refactor and Using RBOOL macro 2021-09-15 08:11:05 +09:00
compile.c Simpler fix for -DUSE_EMBED_CI=0 2021-10-20 18:19:38 -04:00
complex.c Remove declarations unused since a95262356e 2021-10-18 23:46:58 +09:00
configure.ac conditionally add libcapstone 2021-10-20 18:19:27 -04:00
constant.h
cont.c rb_fiber_raise(): add doxygen 2021-09-30 20:55:23 +13:00
CONTRIBUTING.md [DOC] Fix broken links [ci skip] 2021-09-15 14:16:14 +09:00
COPYING
COPYING.ja
darray.h Malloc branch entries (#112) 2021-10-20 18:19:33 -04:00
debug.c lldb: Show encoding of String [ci skip] 2021-09-28 20:03:54 +09:00
debug_counter.c
debug_counter.h Add a cache for class variables 2021-06-18 10:02:44 -07:00
dir.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
dir.rb Links for What's Here sections (#4827) 2021-09-13 10:37:15 -05:00
dln.c dln.c: add missing dependency 2021-09-10 20:00:06 +09:00
dln.h
dln_find.c
dmydln.c
dmyenc.c
dmyext.c
encindex.h
encoding.c include/ruby/encoding.h: convert macros into inline functions 2021-10-05 14:18:23 +09:00
enum.c Accommondate earlier reviews of RDoc for Enumerable (#4943) 2021-10-06 19:06:17 -05:00
enumerator.c Introduce RBIMPL_NONNULL_ARG macro 2021-09-27 14:47:52 +09:00
error.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
eval.c Consider modified modules initialized [Bug #18185] 2021-09-24 08:29:00 +09:00
eval_error.c Trim extra whitespace in inaccesible error message 2021-09-01 13:18:55 +09:00
eval_intern.h Remove root_jmpbuf in rb_thread_struct 2021-08-10 19:08:38 +09:00
eval_jump.c
file.c rb_encoding is already const 2021-10-14 10:23:33 +09:00
gc.c Partial revert of ceebc7fc98 2021-10-20 18:19:36 -04:00
gc.h Enable inline assembly of SET_MACHINE_STACK_END on AIX 2021-10-04 23:06:43 -05:00
gc.rb Enhance documentation on GC.stat (#4843) 2021-09-20 14:40:55 -04:00
gem_prelude.rb Silence LoadError only if it is for rubygems itself 2021-08-18 21:37:31 +09:00
golf_prelude.rb
goruby.c goruby.c: include golf_prelude.c to get rid of overwriting EXTOBJS 2021-07-03 22:39:14 +09:00
GPL
hash.c Restore Hash#compare_by_identity mode [Bug #18171] 2021-10-02 11:43:35 +09:00
hrtime.h
id_table.c Qundef is not for ID 2021-10-13 12:07:57 +09:00
id_table.h
inits.c Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
insns.def Eliminate some redundant checks on num in newhash 2021-10-18 17:41:38 +09:00
internal.h Using RB_BIGNUM_TYPE_P macro 2021-09-11 09:13:24 +09:00
io.c Reuse ractor stderr 2021-10-07 19:57:09 +09:00
io.rb
iseq.c Clear JIT code when tracepoints get enabled 2021-10-20 18:19:38 -04:00
iseq.h Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
kernel.rb
KNOWNBUGS.rb
LEGAL [DOC] Separate paragraph in LEGAL instead of break lines [ci skip] 2021-09-11 19:09:37 +09:00
lex.c.blt
load.c Pass the VM pointer as an argument 2021-10-10 23:34:16 +09:00
loadpath.c
localeinit.c
main.c
marshal.c marshal.c Marshal.load accepts a freeze: true option. 2021-10-05 18:34:56 +02:00
marshal.rb marshal.c Marshal.load accepts a freeze: true option. 2021-10-05 18:34:56 +02:00
math.c Using RB_FLOAT_TYPE_P macro 2021-09-12 11:16:31 +09:00
memory_view.c memory_view.c: Rename private to private_data for C++ (#4812) 2021-09-06 16:01:52 +09:00
method.h
mini_builtin.c
miniinit.c
mjit.c Revert "Pause an MJIT worker when JIT is cancelled" 2021-08-13 09:13:09 -07:00
mjit.h Always use ret to return to the interpreter 2021-10-20 18:19:37 -04:00
mjit_compile.c Extract precompile_inlinable_child_iseq to separate alloca for each iseq 2021-10-12 17:12:01 +09:00
mjit_worker.c
NEWS.md Added entries about default gems and bundled gems 2021-10-20 20:13:09 +09:00
nilclass.rb
node.c Show node IDs in dump 2021-07-12 12:10:16 +09:00
node.h
numeric.c Enhanced RDoc for Numeric (#4991) 2021-10-19 12:00:22 -05:00
numeric.rb Replace RBOOL macro 2021-09-05 23:01:27 +09:00
object.c Fix a typo [Bug #17048] 2021-09-19 22:39:18 +09:00
pack.c Moved exported symbols in internal/util.h to ruby/util.h 2021-08-24 10:37:41 +09:00
pack.rb
parse.y Fix evaluation order of hash values for duplicate keys 2021-10-18 09:09:07 -07:00
prelude.rb
probes.d
probes_helper.h
proc.c Collect statistics about binding allocations / local variable set 2021-10-20 18:19:32 -04:00
process.c Fill pid and error in the status 2021-10-19 16:41:09 +09:00
ractor.c Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
ractor.rb [DOC] Fix broken links [ci skip] 2021-09-15 14:16:14 +09:00
ractor_core.h Move rb_ractor_p definition 2021-10-04 08:21:40 +09:00
random.c [Win32] Prefer Cryptography Next Generation API 2021-10-02 21:22:33 +09:00
range.c Unify iteration arguments 2021-10-10 11:45:50 +09:00
rational.c Using RB_BIGNUM_TYPE_P macro 2021-09-11 09:13:24 +09:00
re.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
README.EXT
README.EXT.ja
README.ja.md [DOC] Move mailing-list URLs to footnotes [ci skip] 2021-07-11 20:12:45 +09:00
README.md It's RUN_OPTS, not RUNOPTS!!! via @tenderlove 2021-10-20 18:19:37 -04:00
README_ruby.md Rename Ruby README to README_ruby.md 2021-10-20 18:19:33 -04:00
regcomp.c Cast up to get rid of the potential overflow posibility 2021-10-14 18:43:32 +09:00
regenc.c
regenc.h
regerror.c Add printf-style format attribute to oniguruma functions 2021-09-27 19:02:45 +09:00
regexec.c
regint.h Add printf-style format attribute to oniguruma functions 2021-09-27 19:02:45 +09:00
regparse.c Add printf-style format attribute to oniguruma functions 2021-09-27 19:02:45 +09:00
regparse.h
regsyntax.c
ruby-runner.c
ruby.c Try running with more YJIT options in CI to surface more bugs 2021-10-20 18:19:36 -04:00
ruby_assert.h internal/*.h: skip doxygen 2021-09-10 20:00:06 +09:00
ruby_atomic.h
rubystub.c
scheduler.c Add gvl and fiber assertions to scheduler interface to catch invalid usage. 2021-09-20 23:16:45 +12:00
signal.c Use free instead of xfree to free altstack 2021-09-06 14:22:24 +09:00
siphash.c
siphash.h
sparc.c
sprintf.c Use C99-defined macros to classify a floating-point number 2021-08-27 12:41:30 +09:00
st.c
strftime.c Using RB_BIGNUM_TYPE_P macro 2021-09-11 09:13:24 +09:00
string.c Update documentation for String and Symbol to discuss differences 2021-10-15 13:54:03 -07:00
struct.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
symbol.c Fix STATIC_SYM2ID for large ID on IL32LLP64 platforms 2021-10-14 01:11:31 +09:00
symbol.h
test_asm.sh Create GitHub workflow to run ASM tests (#8) 2021-10-20 18:19:34 -04:00
thread.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
thread_pthread.c include/ruby/internal/interpreter.h: add doxygen 2021-09-10 20:00:06 +09:00
thread_pthread.h
thread_sync.c Add WB_PROTECTED to mutexes 2021-09-17 10:03:39 +09:00
thread_win32.c Function w32_error does not return 2021-10-04 08:15:52 +09:00
thread_win32.h Get rid of duplicate typedef 2021-09-03 22:47:58 +09:00
time.c Prefer the reentrant versions of gmtime and localtime 2021-10-14 23:44:15 +09:00
timev.h
timev.rb Links for What's Here sections (#4827) 2021-09-13 10:37:15 -05:00
trace_point.rb Tweak description of TracePoint :line event 2021-06-19 13:55:28 +12:00
transcode.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
transcode_data.h
transient_heap.c Supress warning: data argument not used by format string [-Wformat-extra-args] 2021-10-20 07:48:30 +09:00
transient_heap.h
util.c include/ruby/internal/ctype.h: add doxygen 2021-09-10 20:00:06 +09:00
variable.c Remove autoload for constant if the autoload fails 2021-10-08 14:54:26 -07:00
variable.h internal/*.h: skip doxygen 2021-09-10 20:00:06 +09:00
version.c Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
version.h * 2021-10-21 [ci skip] 2021-10-21 04:43:39 +09:00
vm.c Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
vm_args.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
vm_backtrace.c Using NIL_P macro instead of == Qnil 2021-10-03 22:34:45 +09:00
vm_callinfo.h Partial revert of ceebc7fc98 2021-10-20 18:19:36 -04:00
vm_core.h YJIT: Fancier opt_getinlinecache 2021-10-20 18:19:33 -04:00
vm_debug.h Use RUBY_FUNCTION_NAME_STRING for old Visual C++ 2021-10-20 11:22:33 +09:00
vm_dump.c Shorten register names on arm64-darwin not to be truncated 2021-10-10 21:58:56 +09:00
vm_eval.c Introduce rb_vm_call_with_refinements to DRY up a few calls 2021-10-01 08:12:46 -09:00
vm_exec.c Try to convince get better code out of Clang 2021-10-20 18:19:22 -04:00
vm_exec.h Remove the scraper 2021-10-20 18:19:38 -04:00
vm_insnhelper.c Implement splatarray 2021-10-20 18:19:37 -04:00
vm_insnhelper.h Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00
vm_method.c Partial revert of ceebc7fc98 2021-10-20 18:19:36 -04:00
vm_opts.h oops Direct threading should be default 2021-10-20 18:19:25 -04:00
vm_sync.c Supress warning: data argument not used by format string [-Wformat-extra-args] 2021-10-20 07:48:30 +09:00
vm_sync.h
vm_trace.c Refactor rb_add_event_hook function 2021-09-29 09:16:53 +13:00
vsnprintf.c
warning.rb
yjit.h Make sure we can still compile with the JIT disabled 2021-10-20 18:19:38 -04:00
yjit.rb Add YJIT.enabled? 2021-10-20 18:19:37 -04:00
yjit_asm.c First pass at code page GC object. 2021-10-20 18:19:37 -04:00
yjit_asm.h Add (void) for no arg functions 2021-10-20 18:19:37 -04:00
yjit_asm_tests.c Add some YJIT runtime tests 2021-10-20 18:19:38 -04:00
yjit_codegen.c Implement tostring instruction for yjit 2021-10-20 18:19:38 -04:00
yjit_codegen.h Add a guard that we start executing on the first PC 2021-10-20 18:19:37 -04:00
yjit_core.c Allow upgrading first N types when stack is large 2021-10-20 18:19:38 -04:00
yjit_core.h Introduce ctx_{get,set}_opnd_mapping 2021-10-20 18:19:38 -04:00
yjit_iface.c If codeblock is NULL because YJIT is disabled, YJIT.runtime_stats should return Qnil 2021-10-20 18:19:38 -04:00
yjit_iface.h Remove the scraper 2021-10-20 18:19:38 -04:00
yjit_utils.c fix alignment 2021-10-20 18:19:37 -04:00
yjit_utils.h Yet Another Ruby JIT! 2021-10-20 18:19:31 -04:00

YJIT - Yet Another Ruby JIT

DISCLAIMER: Please note that this project is in early stages of development. It is very much a work in progress, it may cause your software to crash, and current performance results are likely to leave you feeling underwhelmed.

YJIT is a lightweight, minimalistic Ruby JIT built inside the CRuby/MRI binary. It lazily compiles code using a Basic Block Versioning (BBV) architecture. The target use case is that of servers running Ruby on Rails, an area where CRuby's MJIT has not yet managed to deliver speedups. To simplify development, we currently support only macOS and Linux on x86-64, but an ARM64 backend is part of future plans. This project is open source and falls under the same license as CRuby.

If you wish to learn more about the approach taken, here are some written resources and conference talks:

To cite this repository in your publications, please use this bibtex snippet:

@misc{yjit_ruby_jit,
  author = {Chevalier-Boisvert, Maxime and Wu, Alan and Patterson, Aaron},
  title = {YJIT - Yet Another Ruby JIT},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/Shopify/ruby/tree/yjit}},
}

Current Limitations

YJIT is a work in progress and as such may not yet be mature enough for mission-critical software. Below is a list of known limitations, all of which we plan to eventually address:

  • No support for the TracePoint API (see #54).
  • No garbage collection for generated code.

Because there is no GC for generated code yet, your software could run out of executable memory if it is large enough. You can change how much executable memory is allocated using YJIT's command-line options.

Installation

Start by cloning the yjit branch of the Shopify/ruby repository:

git clone https://github.com/Shopify/ruby.git yjit
cd yjit

The YJIT ruby binary can be built with either GCC or Clang. We recommend enabling debug symbols so that assertions are enabled during development as this makes debugging easier. Enabling debug mode will also make it possible for you to disassemble code generated by YJIT, and get access to stat counters. More detailed build instructions are provided in the Ruby README.

To support disassembly of the generated code, libcapstone is also required (brew install capstone on MacOS, sudo apt-get install -y libcapstone-dev on Ubuntu/Debian).

./autogen.sh
./configure cppflags=-DRUBY_DEBUG --prefix=$HOME/.rubies/ruby-yjit
make -j16 install

You can test that YJIT works correctly by running:

# Quick tests found in /bootstraptest
make btest

# Complete set of tests
make -j16 test-all

Usage

Examples

Once YJIT is built, you can either use ./miniruby from within your build directory, or switch to the YJIT version of ruby by using the chruby tool:

chruby ruby-yjit
ruby myscript.rb

You can dump statistics about compilation and execution by running YJIT with the --yjit-stats command-line option:

./miniruby --yjit-stats myscript.rb

The machine code generated for a given method can be printed by adding puts YJIT.disasm(method(:method_name)) to a Ruby script. Note that no code will be generated if the method is not compiled.

Command-Line Options

YJIT supports all command-line options supported by upstream CRuby, but also adds a few YJIT-specific options:

  • --disable-yjit: turn off YJIT (enabled by default)
  • --yjit-stats: produce statistics after the execution of a program (must compile with cppflags=-DRUBY_DEBUG to use this)
  • --yjit-exec-mem-size=N: size of the executable memory block to allocate (default 256 MiB)
  • --yjit-call-threshold=N: number of calls after which YJIT begins to compile a function (default 2)
  • --yjit-max-versions=N: maximum number of versions to generate per basic block (default 4)
  • --yjit-greedy-versioning: greedy versioning mode (disabled by default, may increase code size)

Benchmarking

We have collected a set of benchmarks and implemented a simple benchmarking harness in the yjit-bench repository. This benchmarking harness is designed to disable CPU frequency scaling, set process affinity and disable address space randomization so that the variance between benchmarking runs will be as small as possible. Please kindly note that we are at an early stage in this project.

Performance Tips

This section contains tips on writing Ruby code that will run as fast as possible on YJIT. Some of this advice is based on current limitations of YJIT, while other advice is broadly applicable. It probably won't be practical to apply these tips everywhere in your codebase, but you can profile your code using a tool such as stackprof and refactor the specific methods that make up the largest fractions of the execution time.

  • Use exceptions for error recovery only, not as part of normal control-flow
  • Avoid redefining basic integer operations (i.e. +, -, <, >, etc.)
  • Avoid redefining the meaning of nil, equality, etc.
  • Avoid allocating objects in the hot parts of your code
  • Use while loops if you can, instead of integer.times
  • Minimize layers of indirection
    • Avoid classes that wrap objects if you can
    • Avoid methods that just call another method, trivial one liner methods
  • CRuby method calls are costly. Favor larger methods over smaller methods.
  • Try to write code so that the same variables always have the same type

You can also compile YJIT in debug mode and use the --yjit-stats command-line option to see which bytecodes cause YJIT to exit, and refactor your code to avoid using these instructions in the hottest methods of your code.

Contributing

We welcome open source contributors. You should feel free to open new issues to report bugs or just to ask questions. Suggestions on how to make this readme file more helpful for new contributors are most welcome.

Bug fixes and bug reports are very valuable to us. If you find a bug in YJIT, it's very possible be that nobody has reported it before, or that we don't have a good reproduction for it, so please open an issue and provide as much information as you can about your configuration and a description of how you encountered the problem. List the commands you used to run YJIT so that we can easily reproduce the issue on our end and investigate it. If you are able to produce a small program reproducing the error to help us track it down, that is very much appreciated as well.

If you would like to contribute a large patch to YJIT, we suggest opening an issue or a discussion on this repository so that we can have an active discussion. A common problem is that sometimes people submit large pull requests to open source projects without prior communication, and we have to reject them because the work they implemented does not fit within the design of the project. We want to save you time and frustration, so please reach out and we can have a productive discussion as to how you can contribute things we will want to merge into YJIT.

Source Code Organization

The YJIT source code is divided between:

  • yjit_asm.c: x86 in-memory assembler we use to generate machine code
  • yjit_asm_tests.c: tests for the in-memory assembler
  • yjit_codegen.c: logic for translating Ruby bytecode to machine code
  • yjit_core.c: basic block versioning logic, core structure of YJIT
  • yjit_iface.c: code YJIT uses to interface with the rest of CRuby
  • yjit.h: C definitions YJIT exposes to the rest of the CRuby
  • yjit.rb: YJIT Ruby module that is exposed to Ruby
  • test_asm.sh: script to compile and run the in-memory assembler tests
  • tool/ruby_vm/views/vm.inc.erb: template instruction handler used to hook into the interpreter

The core of CRuby's interpreter logic is found in:

  • insns.def: defines Ruby's bytecode instructions (gets compiled into vm.inc)
  • vm_insnshelper.c: logic used by Ruby's bytecode instructions
  • vm_exec.c: Ruby interpreter loop

Coding & Debugging Protips

There are 3 test suites:

  • make btest (see /bootstraptest)
  • make test-all
  • make test-spec
  • make check runs all of the above

The tests can be run in parallel like this:

make -j16 test-all RUN_OPTS="--yjit-call-threshold=1"

Or single-threaded like this, to more easily identify which specific test is failing:

make test-all TESTOPTS=--verbose RUN_OPTS="--yjit-call-threshold=1"

To debug a single test in test-all:

make test-all TESTS='test/-ext-/marshal/test_usrmarshal.rb' RUNRUBYOPT=--debugger=lldb RUN_OPTS="--yjit-call-threshold=1"

You can also run one specific test in btest:

make btest BTESTS=bootstraptest/test_ractor.rb RUN_OPTS="--yjit-call-threshold=1"

There are shortcuts to run/debug your own test/repro in test.rb:

make run  # runs ./miniruby test.rb
make lldb # launches ./miniruby test.rb in lldb

You can use the Intel syntax for disassembly in LLDB, keeping it consistent with YJIT's disassembly:

echo "settings set target.x86-disassembly-flavor intel" >> ~/.lldbinit