1
0
Fork 0
mirror of https://github.com/ruby/ruby.git synced 2022-11-09 12:17:21 -05:00

Web Mar 5 17:43:43 2008 Martin Duerst <duerst@it.aoyama.ac.jp>

* transcode.c (transcode_loop): Adjusted detection of invalid
	  (ill-formed) UTF-8 sequences. Fixing potential security issue, see
	  http://www.unicode.org/versions/Unicode5.1.0/#Notable_Changes.

	* test/ruby/test_transcode.rb: Added two tests for above fix.



git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@15692 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
This commit is contained in:
duerst 2008-03-05 08:45:51 +00:00
parent 39787ea14d
commit 08631278ad
3 changed files with 29 additions and 10 deletions

View file

@ -242,6 +242,11 @@ class TestTranscode < Test::Unit::TestCase
def test_invalid_ignore
# arguments only
'abc'.encode('utf-8', invalid: :ignore)
assert_nothing_raised { 'abc'.encode('utf-8', invalid: :ignore) }
# check handling of UTF-8 ill-formed subsequences
assert_equal("\x00\x41\x00\x3E\x00\x42".force_encoding('UTF-16BE'),
"\x41\xC2\x3E\x42".encode('UTF-16BE', 'UTF-8', invalid: :ignore))
assert_equal("\x00\x41\x00\xF1\x00\x42".force_encoding('UTF-16BE'),
"\x41\xC2\xC3\xB1\x42".encode('UTF-16BE', 'UTF-8', invalid: :ignore))
end
end