mirror of
https://github.com/ruby/ruby.git
synced 2022-11-09 12:17:21 -05:00
fa4bfa6af5
Fixes ticket:68. NOTE that this involves an API change! Entity declarations in the doctype now generate events that carry two, not one, arguments. Implements ticket:15, using gwrite's suggestion. This allows Element to be subclassed. Two unrelated changes, because subversion is retarded and doesn't do block-level commits: 1) Fixed a typo bug in previous change for ticket:15 2) Fixed namespaces handling in XPath and element. ***** Note that this is an API change!!! ***** Element.namespaces() now returns a hash of namespace mappings which are relevant for that node. Fixes a bug in multiple decodings The changeset 1230:1231 was bad. The default behavior is *not* to use the native REXML encodings by default, but rather to use ICONV by default. I know that this will piss some people off, but defaulting to the pure Ruby version isn't the correct solution, and it breaks other encodings, so I've reverted it. * Fixes ticket:61 (xpath_parser) * Fixes ticket:63 (UTF-16; UNILE decoding was bad) * Cleans up some tests, removing opportunities for test corruption * Improves parsing error messages a little * Adds the ability to override the encoding detection in Source construction * Fixes an edge case in Functions::string, where document nodes weren't correctly converted * Fixes Functions::string() for Element and Document nodes * Fixes some problems in entity handling Addresses ticket:66 Fixes ticket:71 Addresses ticket:78 NOTE: that this also fixes what is technically another bug in REXML. REXML's XPath parser used to allow exponential notation in numbers. The XPath spec is specific about what a number is, and scientific notation is not included. Therefore, this has been fixed. Cross-ported a fix for ticket:88 from CVS. Fixes ticket:80 Documentation cleanup. Ticket:84 Applied Kou's fix for an un-trac'ed bug. ------------------------------------------------------------------------ git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@11548 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
97 lines
3.4 KiB
Ruby
97 lines
3.4 KiB
Ruby
module REXML
|
|
# A template for stream parser listeners.
|
|
# Note that the declarations (attlistdecl, elementdecl, etc) are trivially
|
|
# processed; REXML doesn't yet handle doctype entity declarations, so you
|
|
# have to parse them out yourself.
|
|
# === Missing methods from SAX2
|
|
# ignorable_whitespace
|
|
# === Methods extending SAX2
|
|
# +WARNING+
|
|
# These methods are certainly going to change, until DTDs are fully
|
|
# supported. Be aware of this.
|
|
# start_document
|
|
# end_document
|
|
# doctype
|
|
# elementdecl
|
|
# attlistdecl
|
|
# entitydecl
|
|
# notationdecl
|
|
# cdata
|
|
# xmldecl
|
|
# comment
|
|
module SAX2Listener
|
|
def start_document
|
|
end
|
|
def end_document
|
|
end
|
|
def start_prefix_mapping prefix, uri
|
|
end
|
|
def end_prefix_mapping prefix
|
|
end
|
|
def start_element uri, localname, qname, attributes
|
|
end
|
|
def end_element uri, localname, qname
|
|
end
|
|
def characters text
|
|
end
|
|
def processing_instruction target, data
|
|
end
|
|
# Handles a doctype declaration. Any attributes of the doctype which are
|
|
# not supplied will be nil. # EG, <!DOCTYPE me PUBLIC "foo" "bar">
|
|
# @p name the name of the doctype; EG, "me"
|
|
# @p pub_sys "PUBLIC", "SYSTEM", or nil. EG, "PUBLIC"
|
|
# @p long_name the supplied long name, or nil. EG, "foo"
|
|
# @p uri the uri of the doctype, or nil. EG, "bar"
|
|
def doctype name, pub_sys, long_name, uri
|
|
end
|
|
# If a doctype includes an ATTLIST declaration, it will cause this
|
|
# method to be called. The content is the declaration itself, unparsed.
|
|
# EG, <!ATTLIST el attr CDATA #REQUIRED> will come to this method as "el
|
|
# attr CDATA #REQUIRED". This is the same for all of the .*decl
|
|
# methods.
|
|
def attlistdecl(element, pairs, contents)
|
|
end
|
|
# <!ELEMENT ...>
|
|
def elementdecl content
|
|
end
|
|
# <!ENTITY ...>
|
|
# The argument passed to this method is an array of the entity
|
|
# declaration. It can be in a number of formats, but in general it
|
|
# returns (example, result):
|
|
# <!ENTITY % YN '"Yes"'>
|
|
# ["%", "YN", "'\"Yes\"'", "\""]
|
|
# <!ENTITY % YN 'Yes'>
|
|
# ["%", "YN", "'Yes'", "s"]
|
|
# <!ENTITY WhatHeSaid "He said %YN;">
|
|
# ["WhatHeSaid", "\"He said %YN;\"", "YN"]
|
|
# <!ENTITY open-hatch SYSTEM "http://www.textuality.com/boilerplate/OpenHatch.xml">
|
|
# ["open-hatch", "SYSTEM", "\"http://www.textuality.com/boilerplate/OpenHatch.xml\""]
|
|
# <!ENTITY open-hatch PUBLIC "-//Textuality//TEXT Standard open-hatch boilerplate//EN" "http://www.textuality.com/boilerplate/OpenHatch.xml">
|
|
# ["open-hatch", "PUBLIC", "\"-//Textuality//TEXT Standard open-hatch boilerplate//EN\"", "\"http://www.textuality.com/boilerplate/OpenHatch.xml\""]
|
|
# <!ENTITY hatch-pic SYSTEM "../grafix/OpenHatch.gif" NDATA gif>
|
|
# ["hatch-pic", "SYSTEM", "\"../grafix/OpenHatch.gif\"", "\n\t\t\t\t\t\t\tNDATA gif", "gif"]
|
|
def entitydecl name, decl
|
|
end
|
|
# <!NOTATION ...>
|
|
def notationdecl content
|
|
end
|
|
# Called when <![CDATA[ ... ]]> is encountered in a document.
|
|
# @p content "..."
|
|
def cdata content
|
|
end
|
|
# Called when an XML PI is encountered in the document.
|
|
# EG: <?xml version="1.0" encoding="utf"?>
|
|
# @p version the version attribute value. EG, "1.0"
|
|
# @p encoding the encoding attribute value, or nil. EG, "utf"
|
|
# @p standalone the standalone attribute value, or nil. EG, nil
|
|
# @p spaced the declaration is followed by a line break
|
|
def xmldecl version, encoding, standalone
|
|
end
|
|
# Called when a comment is encountered.
|
|
# @p comment The content of the comment
|
|
def comment comment
|
|
end
|
|
def progress position
|
|
end
|
|
end
|
|
end
|