gitlab-org--gitlab-foss/lib/banzai/reference_extractor.rb
Yorick Peterse 86166d2802
Split Markdown rendering & reference gathering
This splits the Markdown rendering and reference extraction phases into
two distinct code bases. The reference extraction phase no longer relies
on the html-pipeline Gem (and any related code) and allows for
extracting of references from multiple HTML nodes in a single pass. This
means that if you want to extract user references from 200 comments you
no longer need to run 200 times N number of queries, instead only a
handful of queries may be needed.
2016-05-26 17:14:00 +02:00

29 lines
725 B
Ruby

module Banzai
# Extract possible GFM references from an arbitrary String for further processing.
class ReferenceExtractor
def initialize
@texts = []
end
def analyze(text, context = {})
@texts << Renderer.render(text, context)
end
def references(type, project, current_user = nil)
processor = Banzai::ReferenceParser[type].
new(project, current_user)
processor.process(html_documents)
end
private
def html_documents
# This ensures that we don't memoize anything until we have a number of
# text blobs to parse.
return [] if @texts.empty?
@html_documents ||= @texts.map { |html| Nokogiri::HTML.fragment(html) }
end
end
end