1
0
Fork 0
mirror of https://github.com/ruby/ruby.git synced 2022-11-09 12:17:21 -05:00
The Ruby Programming Language [mirror]
Find a file
Aaron Patterson e4e054e3ce Speed up setting the backref match object
This patch speeds up setting the backref match object by avoiding some
memcopies.  Take the following code for example:

```ruby
"hello world" =~ /hello/
p $~
```

When the RE matches the string, we have to set the Match object in the
backref global.  So we would allocate a match object[^1] and use
`rb_reg_region_copy`[^2] to make a deep copy of the stack allocated
`re_registers` struct[^3] in to the newly created Ruby object.  This
could possibly trigger GC[^4], and would allocate new memory.

This patch makes a shallow copy of the `re_registers` struct on to the
Match object allowing the match object to manage the `re_registers`
pointer and also avoiding some calls to `xmalloc` and some manual
memcopy.

Benchmark looks like this:

```ruby

require "benchmark/ips"

def test_re thing
  thing =~ /hello/
end

Benchmark.ips do |x|
  x.report("re hit") do
    test_re "hello world"
  end

  x.report("re miss") do
    test_re "world"
  end
end
```

Before this patch:

```
$ ruby -v test.rb
ruby 3.2.0dev (2022-07-27T22:29:00Z master 4ad69899b7) [arm64-darwin21]
Ignoring bcrypt-3.1.16 because its extensions are not built. Try: gem pristine bcrypt --version 3.1.16
Warming up --------------------------------------
              re hit   345.401k i/100ms
             re miss   673.584k i/100ms
Calculating -------------------------------------
              re hit      3.452M (± 0.5%) i/s -     17.270M in   5.002535s
             re miss      6.736M (± 0.4%) i/s -     34.353M in   5.099593s
```

After this patch:

```
$ ./ruby -v test.rb
ruby 3.2.0dev (2022-08-01T21:24:12Z less-memcpy 0ff2a56606) [arm64-darwin21]
Warming up --------------------------------------
              re hit   419.578k i/100ms
             re miss   673.251k i/100ms
Calculating -------------------------------------
              re hit      4.201M (± 0.7%) i/s -     21.398M in   5.093593s
             re miss      6.716M (± 0.4%) i/s -     33.663M in   5.012756s
```

Matches get faster and misses maintain the same speed

[^1]: 24204d54ab/re.c (L1737)
[^2]: 24204d54ab/re.c (L1738)
[^3]: 24204d54ab/re.c (L1686)
[^4]: 24204d54ab/re.c (L981)
2022-08-02 09:04:04 -07:00
.github Revert "Try reproducing the MinGW hang on time command (#6168)" 2022-07-28 16:12:46 -07:00
basictest
benchmark rb_str_buf_append: add a fast path for ENC_CODERANGE_VALID 2022-07-25 14:18:52 +02:00
bin [rubygems/rubygems] rubygems.rb is required by gem_runner.rb 2022-07-22 16:24:29 +09:00
bootstraptest YJIT: Teach getblockparamproxy to handle the no-block case without exiting (#6191) 2022-07-28 11:38:07 -04:00
ccan Prefix ccan headers (#4568) 2022-03-30 20:36:31 +13:00
coroutine Add support for address sanitizer for amd64 and arm64. 2022-05-25 15:24:24 +12:00
coverage
cygwin Suppress msys2 pathname conversion also at single test runs [ci skip] 2022-07-06 00:22:32 +09:00
defs Gem.unpack extracts gems so able to execute 2022-07-17 19:57:48 +09:00
doc [DOC] Specify ways to run bootstrap tests 2022-08-02 09:40:53 -04:00
enc Rename ENCINDEX_ASCII to ENCINDEX_ASCII_8BIT 2022-07-19 08:48:56 +02:00
ext respect current frame of rb_eval_string 2022-08-01 17:48:05 +09:00
gems Try the tag without "v" prefix to checkout upstream repositories 2022-07-26 21:12:58 +09:00
include Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
internal Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
lib [rubygems/rubygems] Array is already uniq, no need to deduplicate it 2022-08-02 21:57:52 +09:00
libexec Merge rubygems master 1e4eda741d732ca1bd7031aef0a16c7348adf7a5 2022-04-28 19:08:49 +09:00
man [ci skip] Improve man page docs around --dump options 2022-06-28 10:10:26 -04:00
misc Get the insns_address_table from the vm_exec_core module table... 2022-07-14 08:25:37 -07:00
missing Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
sample Fix typo in README (#5925) 2022-05-20 14:45:46 -07:00
spec Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
template Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
test Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
tool Keep gitignore for libyaml source with psych 2022-07-29 19:10:10 +09:00
wasm [wasm] get rid of workaround use of older binaryen and update to latest 2022-07-06 11:59:38 +09:00
win32 Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
yjit Add --enable-yjit=dev_nodebug configure option 2022-07-29 16:32:14 -07:00
.appveyor.yml Skip CIs if the head commit message contains '[DOC]' 2022-06-19 11:05:31 +09:00
.cirrus.yml [DOC] Fix ghcr link 2022-03-31 08:35:39 +09:00
.dir-locals.el
.document Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
.editorconfig
.gdbinit
.git-blame-ignore-revs Update .git-blame-ignore-revs [ci skip] 2022-07-30 17:32:57 +09:00
.gitattributes
.gitignore Ignore rubyspec_temp fot Git 2022-05-09 07:29:37 +09:00
.indent.pro Update .indent.pro [ci skip] 2022-07-22 21:59:58 +09:00
.rdoc_options
.rspec_parallel
.travis.yml Skip CIs if the head commit message contains '[DOC]' 2022-06-19 11:05:31 +09:00
aclocal.m4
addr2line.c Fix warnings by old gcc 2022-06-23 22:52:45 +09:00
addr2line.h
array.c Make array slices views rather than copies 2022-07-28 10:02:12 -04:00
array.rb
ast.c Add ISEQ_BODY macro 2022-03-24 10:03:51 -04:00
ast.rb
autogen.sh
bignum.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
BSDL
builtin.c
builtin.h Typedef built-in function types 2022-06-02 16:05:35 +09:00
class.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
common.mk Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
compar.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
compile.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
complex.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
configure.ac Add --enable-yjit=dev_nodebug configure option 2022-07-29 16:32:14 -07:00
constant.h
cont.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
CONTRIBUTING.md Improve documentation on contributing to Ruby 2022-05-11 10:59:24 -04:00
COPYING
COPYING.ja
darray.h Remove _with_gc functions in darray 2022-05-03 09:07:39 -04:00
debug.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
debug_counter.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
debug_counter.h
dir.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
dir.rb fix typo in dir documentation (#6002) 2022-06-10 22:22:16 -07:00
dln.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
dln.h
dln_find.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
dmydln.c
dmyenc.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
dmyext.c
encindex.h Rename ENCINDEX_ASCII to ENCINDEX_ASCII_8BIT 2022-07-19 08:48:56 +02:00
encoding.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
enum.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
enumerator.c Implement Enumerator::Product and Enumerator.product [Feature #18685] 2022-07-30 20:05:14 +09:00
error.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
eval.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
eval_error.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
eval_intern.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
eval_jump.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
file.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
gc.c Lock the VM for rb_gc_writebarrier_unprotect 2022-07-28 10:02:12 -04:00
gc.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
gc.rb Add expand_heap option to GC.verify_compaction_references 2022-07-11 09:00:03 -04:00
gem_prelude.rb
golf_prelude.rb
goruby.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
GPL
hash.c [Bug #17767] Now ENV.clone raises TypeError as well as ENV.dup 2022-08-02 16:40:12 +09:00
hrtime.h Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
id_table.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
id_table.h
inits.c Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
insns.def Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
internal.h Fix macro redefinition warning for MacOS 2022-07-08 01:07:19 +09:00
io.c [DOC] Cross references for ARGF 2022-07-28 09:02:23 +09:00
io.rb
io_buffer.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
iseq.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
iseq.h Add "rb_" prefixes to toplevel enum definitions 2022-07-22 23:10:24 +09:00
kernel.rb
KNOWNBUGS.rb
LEGAL
lex.c.blt
load.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
loadpath.c
localeinit.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
main.c Check only whether RUBY_DEVEL is defined 2022-07-12 17:13:57 +09:00
marshal.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
marshal.rb
math.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
memory_view.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
method.h Allow method caching of protected FCALLs 2022-06-21 18:33:51 -07:00
mini_builtin.c
miniinit.c
mjit.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
mjit.h Remove MJIT worker thread (#6006) 2022-06-15 09:40:54 -07:00
mjit.rb Move RubyVM::MJIT to builtin Ruby 2022-06-15 10:52:37 -07:00
mjit_compile.c Implement Objects on VWA 2022-07-15 09:21:07 -04:00
mjit_unit.h MJIT: Share rb_mjit_unit through mjit_unit.h 2022-07-14 22:54:20 -07:00
NEWS.md Fix a link [ci skip] 2022-08-01 12:34:03 +09:00
nilclass.rb
node.c Initialize node_id 2022-08-01 10:36:36 +09:00
node.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
numeric.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
numeric.rb
object.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
pack.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
pack.rb [DOC] Repair format and links in What's Here sections (#5711) 2022-03-25 10:52:06 -05:00
parse.y Fix some UBSAN false positives (#6115) 2022-07-12 11:48:10 -07:00
prelude.rb
probes.d
probes_helper.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
proc.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
process.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
ractor.c Add "rb_" prefixes to toplevel enum definitions 2022-07-22 23:10:24 +09:00
ractor.rb Fix conversion of rb_ractor_id() 2022-07-28 23:46:07 +09:00
ractor_core.h Fix format-pedantic warnings 2022-07-28 23:46:07 +09:00
random.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
range.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
rational.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
re.c Speed up setting the backref match object 2022-08-02 09:04:04 -07:00
README.EXT
README.EXT.ja
README.ja.md
README.md Update "Reporting Issues" link in the README 2022-06-08 17:49:56 +09:00
regcomp.c Just free compiled pattern if no space is used 2022-04-12 20:24:14 +09:00
regenc.c
regenc.h
regerror.c
regexec.c re.c: Add Regexp.timeout= and Regexp.timeout 2022-03-30 16:50:46 +09:00
regint.h re.c: Add Regexp.timeout= and Regexp.timeout 2022-03-30 16:50:46 +09:00
regparse.c Fix some UBSAN false positives (#6115) 2022-07-12 11:48:10 -07:00
regparse.h
regsyntax.c
ruby-runner.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
ruby.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
ruby_assert.h
ruby_atomic.h
rubystub.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
scheduler.c
signal.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
siphash.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
siphash.h
sparc.c [DOC]Some link prefix replace 2022-04-09 17:43:46 +09:00
sprintf.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
st.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
strftime.c
string.c Adjust indent [ci skip] 2022-07-26 18:33:21 +09:00
string.rb [DOC] Fix markup for String (#5984) 2022-06-09 13:40:21 -05:00
struct.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
symbol.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
symbol.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
thread.c Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
thread_none.c GVL Instrumentation: remove the EXITED count assertion 2022-07-13 19:39:31 +02:00
thread_none.h introduce struct rb_native_thread 2022-04-23 03:08:27 +09:00
thread_pthread.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
thread_pthread.h altstack is native thread's attr 2022-05-24 17:50:49 +09:00
thread_sync.c Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
thread_sync.rb Implement Queue#pop(timeout: sec) 2022-08-02 11:04:28 +02:00
thread_win32.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
thread_win32.h native_tls_get()' should not check results 2022-05-24 10:06:51 +09:00
time.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
timev.h
timev.rb
trace_point.rb Fix comment 2022-03-29 18:14:33 -07:00
transcode.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
transcode_data.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
transient_heap.c Refactor macros of array.c 2022-07-21 09:02:45 -04:00
transient_heap.h
util.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
variable.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
variable.h
version.c Include JIT information in crash reports 2022-06-20 17:18:29 -04:00
version.h * 2022-08-02 [ci skip] 2022-08-02 02:38:08 +09:00
vm.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
vm_args.c Rename rb_ary_tmp_new to rb_ary_hidden_new 2022-07-26 09:12:09 -04:00
vm_backtrace.c Fix rb_profile_frames output includes dummy main thread frame 2022-07-26 10:43:44 +09:00
vm_callinfo.h Extract vm_ic_entry API to mimic vm_cc behavior 2022-07-18 12:44:01 -07:00
vm_core.h Add "rb_" prefixes to toplevel enum definitions 2022-07-22 23:10:24 +09:00
vm_debug.h RUBY_DEBUG_LOG2 should filter against the given file 2022-07-28 16:05:48 +09:00
vm_dump.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
vm_eval.c respect current frame of rb_eval_string 2022-08-01 17:48:05 +09:00
vm_exec.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
vm_exec.h Add ISEQ_BODY macro 2022-03-24 10:03:51 -04:00
vm_insnhelper.c Adjust styles [ci skip] 2022-07-27 18:42:27 +09:00
vm_insnhelper.h Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
vm_method.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
vm_opts.h
vm_sync.c Fix format specifier for rb_ractor_id() 2022-07-28 23:46:06 +09:00
vm_sync.h
vm_trace.c Expand tabs [ci skip] 2022-07-21 09:42:04 -07:00
vsnprintf.c
warning.rb
yjit.c YJIT: Teach getblockparamproxy to handle the no-block case without exiting (#6191) 2022-07-28 11:38:07 -04:00
yjit.h YJIT: Undef YJIT_SUPPORTED_P for hygiene 2022-06-26 08:36:10 -04:00
yjit.rb Speed up --yjit-trace-exits code (#6106) 2022-07-12 16:40:49 -04:00

Actions Status: MinGW Actions Status: MJIT Actions Status: Ubuntu Actions Status: Windows AppVeyor status Travis Status Cirrus Status

What is Ruby?

Ruby is an interpreted object-oriented programming language often used for web development. It also offers many scripting features to process plain text and serialized files, or manage system tasks. It is simple, straightforward, and extensible.

Features of Ruby

  • Simple Syntax
  • Normal Object-oriented Features (e.g. class, method calls)
  • Advanced Object-oriented Features (e.g. mix-in, singleton-method)
  • Operator Overloading
  • Exception Handling
  • Iterators and Closures
  • Garbage Collection
  • Dynamic Loading of Object Files (on some architectures)
  • Highly Portable (works on many Unix-like/POSIX compatible platforms as well as Windows, macOS, etc.) cf. https://github.com/ruby/ruby/blob/master/doc/maintainers.rdoc#label-Platform+Maintainers

How to get Ruby with Git

For a complete list of ways to install Ruby, including using third-party tools like rvm, see:

https://www.ruby-lang.org/en/downloads/

The mirror of the Ruby source tree can be checked out with the following command:

$ git clone https://github.com/ruby/ruby.git

There are some other branches under development. Try the following command to see the list of branches:

$ git ls-remote https://github.com/ruby/ruby.git

You may also want to use https://git.ruby-lang.org/ruby.git (actual master of Ruby source) if you are a committer.

Ruby home page

https://www.ruby-lang.org/

Documentation

Mailing list

There is a mailing list to discuss Ruby. To subscribe to this list, please send the following phrase:

subscribe

in the mail body (not subject) to the address ruby-talk-request@ruby-lang.org.

Copying

See the file COPYING.

Feedback

Questions about the Ruby language can be asked on the Ruby-Talk mailing list or on websites like https://stackoverflow.com.

Bugs should be reported at https://bugs.ruby-lang.org. Read "Reporting Issues" for more information.

Contributing

See "Contributing to Ruby", which includes setup and build instructions.

The Author

Ruby was originally designed and developed by Yukihiro Matsumoto (Matz) in 1995.

matz@ruby-lang.org