[Pkg-javascript-commits] [node-htmlparser2] annotated tag v3.5.1 created (now 714cddc)

Wolfgang Borgert debacle at moszumanska.debian.org
Mon Oct 6 21:08:56 UTC 2014


This is an automated email from the git hooks/post-receive script.

debacle pushed a change to annotated tag v3.5.1
in repository node-htmlparser2.

        at  714cddc   (tag)
   tagging  8006c5bd0b2c33a084ca81ff7d0e59b7d1c30da3 (commit)
 tagged by  fb55
        on  Tue Feb 25 22:06:22 2014 +0100

- Log -----------------------------------------------------------------
3.5.1

Andreas Lind Petersen (1):
      Added missing void elements.

Andreas Madsen (6):
      [tokenizer] text in special tags there looks like a tag ending
      [tokenizer] don't reset comment state in case of long endings
      [Tokenizer] don't reset CDATA state in case of long endings
      [Parser] open tags before close if never opened
      [Parser] implicit open only p and br tags
      [Tokenizer] move if context to methods allowing .write to be optimized

Chris Winberry (60):
      first commit
      Initial push
      Fixed some parsing problems found while creating tests
      Added automated tests in preparation for chunk parsing
      Added license
      Added some more tests
      Added comments to the code
      Beginning of change to streamable parser
      Fixed test (missing attributes)
      Refactored with stream parsing and support for custom handlers
      Fixed chunking bug and made it browser-compatible
      Added browser example
      Create browser test suite
      Fixed bug in test result
      Added email to notice
      Updated tests to be browser-compatible
      Updated info on tests
      Finished callback handling
      Fixed DefaultHandler to pass dom to callback
      Changed tagname detection for a 40% reduction in parse time
      Added profiler.js for profiling and comparison to libxmljs
      Updated profile.js to load a live html page
      Updated casing of method names and restructured a few things
      Updated profile.js for configurability
      Added some comments for future performance work
      Added options parameter to DefaultHandler
      Updated README
      Updated README
      Updated README
      Updated README
      Updated README
      Updated README
      Fixed bug and removed deprecated methods
      Added DomUtils
      Fixed IE8 bug with parsing tag attribs
      Fixed Opera bug
      Added mising JSON2 support include
      Fixed IE 6/7 bug
      Updated selectors to support function qualifiers
      Added minified version
      Added test for XML namespaces
      Added DefaultHandler option "enforceEmptyTags" so that XML can be parsed correctly
      Added support for RSS/Atom feeds
      Fixed README.md
      Optimized DomUtils by up to 2-3x
      Removed to debugging lines
      Updated to be added to npm
      Added npm instructions
      Added npm instructions
      Updated tests to use npm-compatible module location
      Fixed DomUtils.testElement() and added new, related projects to the README
      Fixed 'prevElement' error pointed out by Swizec
      Added options parameter to parser contructor
      Added test case for document position data. Test revealed that feature is broken for chunked parsing to feature is being disabled until a fix is available
      Forgot to include minified library
      Updated version
      Updated lib filename to exclude 'node-' prefix and included backwards compatible shim for those still expecting 'node-htmlparser.*'
      Updated docs
      Preparing for 1.8.0
      Updated license text for 2011

David Rousselie (1):
      Add startIndex and endIndex positional attributes to the parser

Felix Böhm (386):
      Restructured code, improved readability, deleted junk
      didn't export constructors
      Now all tests pass
      moved runtests.js to tests-directory
      fixed tests
      moved "last()"-method from _callStack to DefaultHandlers prototype
      removed repeating code in RssHandler
      again some cleanup
      Added EventedHandler, using an interface like sax.js
      Some improvements inside the parser. Still very ugly.
      Export the evented handler
      Fixed a bug & restructured some code
      Wrong number of arguments was passed in EventedHandler
      Removed repeating code
      renamed module, added it to npm (as "htmlparser2")
      Replaced indexOf(a)!==0 with substring(a.length)!==a, charAt(length-1) with substr(-1) + many other improvements
      1.1.0
      Restructured DomUtils
      Some small improvements
      npm bugfix
      use util.inherits (function was taken from node anyway)
      Code cleanup, preparation for big changes
      Create element object later
      Prepared big changes
      Some steps preparing for the upcoming changes
      Removed regexp for comment trimming
      Deleted browser related content in tests
      Added a test
      Fixed test 23
      Removed trash in runtests.js
      Replaced _reTags with indexOf
      Added doctype to first test
      Removed char loop from Parser#getLocation
      Renamed htmlparser.js to index.js
      Don't allow <tagname <tagname
      corrected reference in runtests
      Small changes to Parser#getLocation
      Updated readme
      readme, again
      Added lines to readme
      removed backticks from readme
      Removed last try-catch-blocks from RssHandler
      parseAttributes now returns an object
      parseAttributes now accepts an elements data
      minor improvements
      Restructured some code
      bugfix
      Small adjustments
      1.2.0
      Added RDF feed support to RssHandler
      restructured code
      Removed duplicated code
      Splited writeTag in both handlers to smaller functions + added a test for events
      added a limit to the elements fetched inside RssHandler
      improved DomUtils
      some small adjustments
      Replaced _tagStack with _contentFlags, tweaked DefaultHandler
      Made openTag & closeTag public methods
      Improved _addDomElement
      Set feeds to xmlMode
      minor changes
      Removed _elements, removed support for writeTag inside handlers
      RssHandler now calls a given callback
      updated readme
      fixed links in readme
      Updated readme
      Minor changes
      Added options to EventedHandler
      fixed a potential bug
      adjustments
      commented sax.js related part
      1.5
      restructured tests, rewrote 00-runtests.js
      Minor improvements
      Renamed RssHandler to FeedHandler
      Removed .raw and .data from elements, removed position data (no more verbose)
      Removed the verbose property from DefaultHandler
      Removed Parser#_current
      Moved assignment of current, started to use continue
      Fixed some tests
      Fixed two bugs
      All tests now match the new pattern
      Use the new name inside FeedHandler
      Updated readme
      updated readme
      Restructured some code, added 2x continue
      Don't call validateHandler by default
      Moved default callbacks for event tests to the processing part
      removed type property
      Added the template html-test as a event test
      Now all tests pass
      fix
      Moved empty tags to ClosingTags, improved closeTag logic
      Moved the interface of EventedHandler to the parser
      removed require EventedHandler
      fixed getElementsByTagType
      Added tests for DomUtils
      Added force option to parseTags, removed logic from done
      Added option to convert tag names to lower case
      Added a prototype for a new FeedHandler (not finished yet!)
      Removed EventedHandler from index.js
      Added callbacks to prototype
      Restructured some code in FeedHandler
      Updated readme concerning verbose output
      removed unused vars
      Restructured some code
      Updated readme
      Throw if there was an error
      Added .travis.yml
      Fixed tests dir in package.json
      Added Travis status to readme
      Updated readme, moved options to wiki
      Updated example code
      Changed formatting
      Feedhandler should always export the feed tree
      Rewrote parts of DomUtils
      use Parser#write instead of parseChunk
      Ensure that there is always an attires object
      2.0.0
      Added syntax highlighting to readme
      Fixed a bug
      cosmetic changes
      Added oncommentend event, fixed #2
      Added Parser#end as an alias for #done
      Accept malformed directives
      Added getInnerHTML & getOuterHTML methods to DomUtils
      Make empty tags self-closing, handle attributes better in DomUtils
      Added a better regexp to test for unquoted attributes
      Restructured Parser#_parseTags
      Shortened Parser#_parseTagName, removed elementName var
      Replaced Parser#_parseState with Parser#_tagSep instead
      Removed Parser#_isEmptyTag
      Fixed a bug, introduced Parser#_wroteSpecial
      2.1.1
      Added test for upper case case tags
      Dynamically load files when requested in index.js
      Moved self-closing tags to parser, removed ClosingTags.js
      Moved self-closing tags to parser
      Made the callback optional
      2.1.2
      Some fixes & improvements
      Added another test for #6
      Added support for CDATA, optimized regexps
      Switched to ES6 proxies inside events test
      Improved attrib parsing (again)
      Shortened Parser#_parseTagName
      Moved cb names to tests/test-helper, added additional tests to benchmark
      Fixed a bug with boolean attributes
      Use Array#lastIndexOf inside Parser#_processCloseTag
      2.1.3
      Removed FeedHandler prototype
      Added a Stream interface
      Minor changes
      Added two new events, `onopentagname` and `on attribut`
      Moved feeds to /tests/Documents
      Added missing `end` event to Stream
      Added a test for Streams, made runtests async
      Changed comments
      Introduced TagValues in Parser
      Removed proxy flag from tests
      Fix for `stream` module in node 0.4.x
      Renamed DefaultHandler to DomHandler
      Use the new name inside the readme
      Updated readme, changed examples from DomHandler to just the parser
      Removed information about running test
      Fixed bug in example
      Changed compare
      Added Parser#pause and Parser#resume
      Updated package.json
      Renamed Parser#_paused to _running, added check in Parser#continue, added logic to write ">" inside text fields
      Remonde DomHandler#_inSpecialTag
      Fixed tests
      2.2.1
      Minor changes
      2.2.2
      Removed unnecessary code inside FeedHandler
      Removed newlines, items var from FeedHandler
      Cache item.children inside FeedHandler
      Removed `type` attribute from cbs.onopentag
      Moved event names to index.js as `EVENTS
      Added WritableStream interface
      Renamed Parser#_processComment to _writeComment, moved logic for CDATA to #_writeCDATA
      Singular attributes should just return an empty string, not their names
      Cosmetic changes
      Renamed HTML dir to `DOM`
      Renamed `01-html.js` to `01-dom.js
      Fixed a bug
      2.2.3
      Sort properties before stringify in runtests
      RDF feeds now have a type of `rdf`
      Removed clutter from tests
      Set prototype of emptyTags to null
      Added bench.js
      Added ProxyHandler
      Parser#_processOpenTag now takes a single argument
      Use onopentagname and onattribute events in DomHandler
      Use JSON instead of JS files for describing tests
      Don't require `fs` inside the feed tests, just specify a path
      Use the assert library for equality checks, + some minor improvements
      Fixed names of stream tests
      2.2.4
      Also test with node 0.7.x
      Fixed ProxyHandler, use Object.defineProperty instead of __define[Getter/Setter]
      Use bitwise operators for flags, fixed a bug
      Removed Object.keys calls and the switch statement, fixed a bug in DomUtils
      2.2.5
      Fixed some errors in readme
      Removed ifs from constructors (no more polymorphic objects)
      2.5.6
      Fixed reset functions
      2.2.7
      Breaking change: XML-comments inside special tags don't get special treatment.
      DomHandler: Added an option to add a reference to each elements parent
      2.2.8
      added support for the universal selector ("*")
      2.2.0
      removed switch in Stream.js
      fixed whitespace
      quick fix for #19
      added lowerCaseAttributeNames option
      2.3.0
      Added a `onopentagend` event
      moved DomHandler & DomUtils to their own module
      Updated readme
      2.3.1
      publish the element types from DomHandler
      use numeric element types
      don't expose HandlerModule
      stylistic changes
      use the new dom modules, 2.5.0
      domelementtype must be version 1.x (not 1.0)
      2.5.1
      domelementtype must be version 1.x (not 1.0)
      2.5.1
      test on node 0.6, 0.8 & 0.9
      FeedHandler should return an error when nothing's found
      added missing semicolon in test-helper.js
      improved how tests are run
      don't run 03-rdf.js test
      renamed tests
      added semicolons & use EE#on in 02-stream.js
      changed how the end of all tests is shown
      allow `>` at the beginning of a document
      2.5.2
      2.6.0
      landed first version of FSM based tokenizer
      added logic for special tags
      [tokenizer] don't fail on `< >` and `< / >`
      [tokenizer] fixed ordering in cleanup
      [tokenizer] overwrite WritableStream#end, emit everything that's left
      [tokenizer] take care of this._index in cleanup, emit all text
      [tokenizer] set _sectionStart to 0 when text was emitted
      [tokenizer] call WritableStream#end after emitting the remaining data
      [tokenizer] call .write instead of ._write
      [parser] use the tokenizer
      removed WritableStream.js and ElementType.js
      [parser] made Parser#reset work again
      fall back to the readable-stream module
      [travis] removed 0.6 & 0.9, added 0.10 and 0.11
      minor changes
      [index.js] removed redundant code
      [stream] use a named function
      3.0.0
      [tokenizer] always call WritableStream#end
      [parser] call Tokenizer#end, clear the stack
      [index.js] added `createDomStream()` convenience method
      [tokenizer] added `opentagend` event
      [parser] use `opentagend` event
      3.0.1
      [tokenizer] emit opentagend on selfclosing tags, fixed handling of < in special tags
      [index.js] added tokenizer
      [tests] text events now contain more data
      [tokenizer] don't inherit from stream.Writable, fixed several bugs
      [tests/events] concat text events
      [tests/events] fixed order of attribute/opentag events, merged text events
      [tokenizer] use strings instead of buffers
      [parser] don't implement stream.Writable, use new tokenizer interface
      [tests/stream] fixed order of events
      [tokenizer] simplified logic
      [parser] fixed handling of implied closing and empty tags
      [tests/events] accidentally removed part of the document
      added a WritableStream interface again
      3.0.0 (finally!)
      [tokenizer] changed internal name to `Tokenizer`
      [proxyhandler] don't use getters/setters
      added CollectingHandler
      [tests] use the new CollectingHandler
      [tests] removed unused `f` var
      3.0.1
      3.0.2
      [bench] use setImmediate instead of process.nextTick
      [bench] try to test all available modules
      [bench] removed unused functions, improved output
      [readme] updated benchmarks
      [doc] call `end`, use single quotes
      [doc] updated section about node-htmlparser
      renamed repository, 3.0.3
      [tokenizer] name states consistently
      [readme] small updates
      [tokenizer] don't emit an "onopentagend" event for self-closing tags
      [parser] fixed handling of self-closing tags
      [tests] stream tests are run again
      [tests/feeds] run rdf test again
      [tests/stream] enabled xmlMode for RSS test
      [tests/stream] create a new handler for the second run
      [tests/stream] added tests for the files in tests/Documents
      3.0.4
      [parser] lowercase instruction names if lowerCaseTags option is set
      3.0.5
      [tests/events] added test case for jsdom#368
      changed behavior for non-xml mode
      [tests/events] updated tests to reflect latest changes
      3.1.0
      [bench] added the hubbub & html-parser modules
      3.1.2
      3.1.3
      [parser] renamed emptyTags to voidElements, sorted them
      [parser] improved consistency & simplified
      [tokenizer] simplified `end` logic
      [tokenizer] removed noop blocks in AFTER_{COMMENT,CDATA}_2
      [tokenizer] use `continue` instead of decreasing the index
      [bench] removed unnecessary noop functions
      [tokenizer] improved handling of remaining data
      3.1.4
      3.1.5
      [tokenizer] don't save the options object
      [tokenizer] use ternary expressions for simple states
      [tokenizer] added variables for states of _special
      [tokenizer] fixed whitespace
      [tokenizer] more ternaries
      [tokenizer] simplified _cleanup a bit
      [tokenizer] united some branches
      [tokenizer] get rid of _reconsume
      [tokenizer] even more ternaries
      [tokenizer] added abstractions for common state types, fixed previous regression
      [tokenizer] added _getSection, completely inlined _emitIfToken, partly inlined _emitToken
      [tokenizer] simplified _stateInTagName
      [tokenizer] simplified _stateInAttributeValueNoQuotes, reordered _stateBeforeAttributeName
      3.1.6
      [tests] added test for second occurance of same attribute
      [tokenizer] started adding support for HTML entities
      [tokenizer] corrected decoding of numeric entities
      [tokenizer] numeric entities are now decoded
      [tests] added test case for numeric entities
      [tokenizer] renamed the self-closing tags state, moved it to its own function
      [tokenizer] commented out support for entities in attributes
      [readme] updated benchmark results
      [bench] removed internal benchmarks
      [parser] fixed whitespace
      [parser] moved common logic to _updatePosition function
      [tokenizer] renamed IN_ATTRIBUTE_NAME_* states, improved formatting
      [tokenizer] re-added the carriage return as whitespace
      [tokenizer] fixed handling of unparsed data in end(), added support for several states
      [entities] added maps for normal & legacy entities
      [tokenizer] added support for decoding HTML entities in `ontext` events
      [tests] added test cases for decoding legacy & named entities
      [entities] added map for XML entities
      [tokenizer] added support for XML entities
      [tests] also test trailing data support in the numeric entity test
      [tokenizer] fixed handling non-existent entities
      [tests] added test case for XML entities
      [tokenizer] added _emitEntity
      3.2.0
      [tokenizer] moved decodeMap to entities/decode.json
      [tokenizer] renamed _emitEntity to _emitPartial
      [index] statically export Parser, Tokenizer and DomHandler
      [parser] use String#search and String#substr instead of String#split
      [parser] added onattribdata and onattribend events, dropped onattribvalue
      [tokenizer] enable support for decoding entities in attributes, added onattribend and onattribdata events, removed onattribvalue
      [tests] added test case for entities in attributes
      3.2.1
      [tokenizer] don't decode entities in special tags
      3.2.2
      [tokenizer] reintroduced _special, removed IN_SCRIPT and IN_STYLE
      3.2.3
      [parser] properly removed self-closing tag support
      [tests] read files in the tests file, improved os interoperability of stream test
      [tests] added helper.getCallback method
      [tests] converted tests to mocha
      [tests] renamed tests dir to `test`
      [package] run mocha as the test script
      [tests] simplifications
      3.2.4
      [tests] added test case for MatthewMueller/cheerio#247
      update to DomHandler at 2.1, updated FeedHandler accordingly, bump
      [tests] write only single characters for testing chunked data

Felix Böhm (59):
      Merge pull request #7 from siddMahen/master
      Some minor changes
      Merge pull request #12 from siddMahen/master
      Merge pull request #21 from lahmatiy/master
      fixed travis badge
      Revert "stylistic changes"
      Revert "Revert "stylistic changes""
      added missing comma in benchmark script
      added missing comma in benchmark script
      Merge pull request #31 from jugglinmike/text-after-cdata
      Merge pull request #28 from myndzi/master
      Update README.md
      Merge pull request #32 from jugglinmike/cdata-inside-special
      Merge pull request #37 from eonlepapillon/Add-test-for-Issue-#36
      Merge pull request #38 from burl/master
      use DomUtils.getText in fetch, split getElements
      [feedhandler] recursively walk the tree
      Merge pull request #46 from One-com/missing_void_elements
      Merge pull request #48 from AndreasMadsen/script-in-script
      [tokenizer] consume token again
      [parser] still recognize other options in non-xml-mode
      3.1.1
      Merge pull request #49 from AndreasMadsen/long-comment
      Merge pull request #50 from AndreasMadsen/long-cdata-ending
      readme: added version badge
      [readme] added yet another badge (dependency versions)
      Merge pull request #54 from abarre/master
      Merge pull request #52 from AndreasMadsen/implicit-open
      [readme] it~~'~~s
      Merge pull request #55 from ForbesLindesay/patch-1
      Merge pull request #56 from ForbesLindesay/patch-1
      [parser] default options & cbs to empty objects
      Merge pull request #58 from xcoderzach/master
      [parser] don't overwrite attribute values on second occurence
      [readme] behavior of example changed due to #58
      [tokenizer] recognize the form field (U+0C), drop the carriage return (U+0D)
      Merge pull request #61 from AndreasMadsen/optimize
      Merge pull request #63 from fasterize/parser_positions
      only respect self-closing tags in XML mode
      Delete .DS_Store
      [tokenizer] emit `onattribdata` in `_handleTrailingData`
      [readme] updated performance characteristics
      [tokenizer] handle `<<` correctly
      3.2.5
      [package] require domutils at 1.2
      package: update readable-stream
      package: use simple `license` field
      replace non-breaking space with regular space
      index: pass `options` argument to constructors
      tests: remove unused `cb` argument
      feedhandler: wrap assignments
      tests: changed indentation to tabs
      package: updated dom module versions, 3.4.0
      Merge pull request #75 from superdweebie/master
      Merge pull request #74 from patrick-steele-idem/master
      3.4.1
      parser: adjusted whitespace, fixed _updatePosition
      3.5.0
      Delete .DS_Store

Felix Gnass (5):
      Check for empty tags is now case-insensitive. Makes jsdom pass the W3C test-suite which use upper-case tags.
      Added support for extracting the source location, i.e. line- and column-number. The feature can be enabled by setting `includeLocation` to true on the parser instance.
      Merge branch 'master' of https://github.com/tautologistics/node-htmlparser
      Line numbers now start at 1. Optimized the loop so that it starts where it off.
      Fixed position calculation in chunked mode

Forbes Lindesay (3):
      Add parseDOM and parseFeed helper methods
      Add link to live demo
      Update link to demo

ForbesLindesay (3):
      Add .gitignore
      Add .gitattributes so tests still work on windows
      Normalize line endings

Kris Reeves (9):
      Made the attribute regular expression more correct with regards to unquoted attribute values.
      I didn't understand how RegExps worked in this way, and was desynching the attributes count. Here's a different way to accomplish the same thing.
      Merge branch 'master' of https://github.com/fb55/node-htmlparser
      Better handling of implied close tags. A list is given of tags whose close is implied by other tags being opened, and these are closed when those tags are opened. This helps correctly parse things like lists and tables with unterminated LI or TD tags.
      spaces -> tabs, thought the merge would update my local files to the correct spacing (and tried to match that)
      Derp.
      Merge remote-tracking branch 'upstream/master'
      Tests for changes.
      Fixes discussed in https://github.com/fb55/node-htmlparser/pull/28

Mike Pennisi (2):
      Recognize closing CDATA tags as end of "special"
      Do not parse CDATA-like text inside special tags

Patrick Steele-Idem (6):
      #73 Added support for recognizing self-closing tags and CDATA in non-XML mode
      Fix option to disable lower case tags and attars in non-XML mode
      Added this._lowerCaseTagNames and this._lowerCaseAttributeNames
      Handle case where options is null and allow truthy values
      Switched to using "in" operator for options
      Merged options initialization into a single line

Siddharth Mahendraker (2):
      Makes sure recursion works properly
      Cleaned up DomUtils

Tim Roediger (1):
      Add self-closeing svg tags

Zach Smith (1):
      [tokenizer] fix case where `<` followed by whitespace doesn't parse correctly

abarre (1):
      Fix perf regression in the Tokenizer : avoid a concatenation

burl (1):
      [tokenizer] fix for script tags causing following nodes to be interpreted as TEXT

eonlepapillon (1):
      Add a new test for issue #36

fb55 (2):
      tokenizer: Fixed handling of text containing `&` when decoding entities
      3.5.1

lahmatiy (1):
      Fix getOuterHTML for directives

-----------------------------------------------------------------------

No new revisions were added by this update.

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/pkg-javascript/node-htmlparser2.git



More information about the Pkg-javascript-commits mailing list