[Pkg-javascript-commits] [node-htmlparser2] annotated tag v3.5.1 created (now 714cddc)
Wolfgang Borgert
debacle at moszumanska.debian.org
Mon Oct 6 21:08:56 UTC 2014
This is an automated email from the git hooks/post-receive script.
debacle pushed a change to annotated tag v3.5.1
in repository node-htmlparser2.
at 714cddc (tag)
tagging 8006c5bd0b2c33a084ca81ff7d0e59b7d1c30da3 (commit)
tagged by fb55
on Tue Feb 25 22:06:22 2014 +0100
- Log -----------------------------------------------------------------
3.5.1
Andreas Lind Petersen (1):
Added missing void elements.
Andreas Madsen (6):
[tokenizer] text in special tags there looks like a tag ending
[tokenizer] don't reset comment state in case of long endings
[Tokenizer] don't reset CDATA state in case of long endings
[Parser] open tags before close if never opened
[Parser] implicit open only p and br tags
[Tokenizer] move if context to methods allowing .write to be optimized
Chris Winberry (60):
first commit
Initial push
Fixed some parsing problems found while creating tests
Added automated tests in preparation for chunk parsing
Added license
Added some more tests
Added comments to the code
Beginning of change to streamable parser
Fixed test (missing attributes)
Refactored with stream parsing and support for custom handlers
Fixed chunking bug and made it browser-compatible
Added browser example
Create browser test suite
Fixed bug in test result
Added email to notice
Updated tests to be browser-compatible
Updated info on tests
Finished callback handling
Fixed DefaultHandler to pass dom to callback
Changed tagname detection for a 40% reduction in parse time
Added profiler.js for profiling and comparison to libxmljs
Updated profile.js to load a live html page
Updated casing of method names and restructured a few things
Updated profile.js for configurability
Added some comments for future performance work
Added options parameter to DefaultHandler
Updated README
Updated README
Updated README
Updated README
Updated README
Updated README
Fixed bug and removed deprecated methods
Added DomUtils
Fixed IE8 bug with parsing tag attribs
Fixed Opera bug
Added mising JSON2 support include
Fixed IE 6/7 bug
Updated selectors to support function qualifiers
Added minified version
Added test for XML namespaces
Added DefaultHandler option "enforceEmptyTags" so that XML can be parsed correctly
Added support for RSS/Atom feeds
Fixed README.md
Optimized DomUtils by up to 2-3x
Removed to debugging lines
Updated to be added to npm
Added npm instructions
Added npm instructions
Updated tests to use npm-compatible module location
Fixed DomUtils.testElement() and added new, related projects to the README
Fixed 'prevElement' error pointed out by Swizec
Added options parameter to parser contructor
Added test case for document position data. Test revealed that feature is broken for chunked parsing to feature is being disabled until a fix is available
Forgot to include minified library
Updated version
Updated lib filename to exclude 'node-' prefix and included backwards compatible shim for those still expecting 'node-htmlparser.*'
Updated docs
Preparing for 1.8.0
Updated license text for 2011
David Rousselie (1):
Add startIndex and endIndex positional attributes to the parser
Felix Böhm (386):
Restructured code, improved readability, deleted junk
didn't export constructors
Now all tests pass
moved runtests.js to tests-directory
fixed tests
moved "last()"-method from _callStack to DefaultHandlers prototype
removed repeating code in RssHandler
again some cleanup
Added EventedHandler, using an interface like sax.js
Some improvements inside the parser. Still very ugly.
Export the evented handler
Fixed a bug & restructured some code
Wrong number of arguments was passed in EventedHandler
Removed repeating code
renamed module, added it to npm (as "htmlparser2")
Replaced indexOf(a)!==0 with substring(a.length)!==a, charAt(length-1) with substr(-1) + many other improvements
1.1.0
Restructured DomUtils
Some small improvements
npm bugfix
use util.inherits (function was taken from node anyway)
Code cleanup, preparation for big changes
Create element object later
Prepared big changes
Some steps preparing for the upcoming changes
Removed regexp for comment trimming
Deleted browser related content in tests
Added a test
Fixed test 23
Removed trash in runtests.js
Replaced _reTags with indexOf
Added doctype to first test
Removed char loop from Parser#getLocation
Renamed htmlparser.js to index.js
Don't allow <tagname <tagname
corrected reference in runtests
Small changes to Parser#getLocation
Updated readme
readme, again
Added lines to readme
removed backticks from readme
Removed last try-catch-blocks from RssHandler
parseAttributes now returns an object
parseAttributes now accepts an elements data
minor improvements
Restructured some code
bugfix
Small adjustments
1.2.0
Added RDF feed support to RssHandler
restructured code
Removed duplicated code
Splited writeTag in both handlers to smaller functions + added a test for events
added a limit to the elements fetched inside RssHandler
improved DomUtils
some small adjustments
Replaced _tagStack with _contentFlags, tweaked DefaultHandler
Made openTag & closeTag public methods
Improved _addDomElement
Set feeds to xmlMode
minor changes
Removed _elements, removed support for writeTag inside handlers
RssHandler now calls a given callback
updated readme
fixed links in readme
Updated readme
Minor changes
Added options to EventedHandler
fixed a potential bug
adjustments
commented sax.js related part
1.5
restructured tests, rewrote 00-runtests.js
Minor improvements
Renamed RssHandler to FeedHandler
Removed .raw and .data from elements, removed position data (no more verbose)
Removed the verbose property from DefaultHandler
Removed Parser#_current
Moved assignment of current, started to use continue
Fixed some tests
Fixed two bugs
All tests now match the new pattern
Use the new name inside FeedHandler
Updated readme
updated readme
Restructured some code, added 2x continue
Don't call validateHandler by default
Moved default callbacks for event tests to the processing part
removed type property
Added the template html-test as a event test
Now all tests pass
fix
Moved empty tags to ClosingTags, improved closeTag logic
Moved the interface of EventedHandler to the parser
removed require EventedHandler
fixed getElementsByTagType
Added tests for DomUtils
Added force option to parseTags, removed logic from done
Added option to convert tag names to lower case
Added a prototype for a new FeedHandler (not finished yet!)
Removed EventedHandler from index.js
Added callbacks to prototype
Restructured some code in FeedHandler
Updated readme concerning verbose output
removed unused vars
Restructured some code
Updated readme
Throw if there was an error
Added .travis.yml
Fixed tests dir in package.json
Added Travis status to readme
Updated readme, moved options to wiki
Updated example code
Changed formatting
Feedhandler should always export the feed tree
Rewrote parts of DomUtils
use Parser#write instead of parseChunk
Ensure that there is always an attires object
2.0.0
Added syntax highlighting to readme
Fixed a bug
cosmetic changes
Added oncommentend event, fixed #2
Added Parser#end as an alias for #done
Accept malformed directives
Added getInnerHTML & getOuterHTML methods to DomUtils
Make empty tags self-closing, handle attributes better in DomUtils
Added a better regexp to test for unquoted attributes
Restructured Parser#_parseTags
Shortened Parser#_parseTagName, removed elementName var
Replaced Parser#_parseState with Parser#_tagSep instead
Removed Parser#_isEmptyTag
Fixed a bug, introduced Parser#_wroteSpecial
2.1.1
Added test for upper case case tags
Dynamically load files when requested in index.js
Moved self-closing tags to parser, removed ClosingTags.js
Moved self-closing tags to parser
Made the callback optional
2.1.2
Some fixes & improvements
Added another test for #6
Added support for CDATA, optimized regexps
Switched to ES6 proxies inside events test
Improved attrib parsing (again)
Shortened Parser#_parseTagName
Moved cb names to tests/test-helper, added additional tests to benchmark
Fixed a bug with boolean attributes
Use Array#lastIndexOf inside Parser#_processCloseTag
2.1.3
Removed FeedHandler prototype
Added a Stream interface
Minor changes
Added two new events, `onopentagname` and `on attribut`
Moved feeds to /tests/Documents
Added missing `end` event to Stream
Added a test for Streams, made runtests async
Changed comments
Introduced TagValues in Parser
Removed proxy flag from tests
Fix for `stream` module in node 0.4.x
Renamed DefaultHandler to DomHandler
Use the new name inside the readme
Updated readme, changed examples from DomHandler to just the parser
Removed information about running test
Fixed bug in example
Changed compare
Added Parser#pause and Parser#resume
Updated package.json
Renamed Parser#_paused to _running, added check in Parser#continue, added logic to write ">" inside text fields
Remonde DomHandler#_inSpecialTag
Fixed tests
2.2.1
Minor changes
2.2.2
Removed unnecessary code inside FeedHandler
Removed newlines, items var from FeedHandler
Cache item.children inside FeedHandler
Removed `type` attribute from cbs.onopentag
Moved event names to index.js as `EVENTS
Added WritableStream interface
Renamed Parser#_processComment to _writeComment, moved logic for CDATA to #_writeCDATA
Singular attributes should just return an empty string, not their names
Cosmetic changes
Renamed HTML dir to `DOM`
Renamed `01-html.js` to `01-dom.js
Fixed a bug
2.2.3
Sort properties before stringify in runtests
RDF feeds now have a type of `rdf`
Removed clutter from tests
Set prototype of emptyTags to null
Added bench.js
Added ProxyHandler
Parser#_processOpenTag now takes a single argument
Use onopentagname and onattribute events in DomHandler
Use JSON instead of JS files for describing tests
Don't require `fs` inside the feed tests, just specify a path
Use the assert library for equality checks, + some minor improvements
Fixed names of stream tests
2.2.4
Also test with node 0.7.x
Fixed ProxyHandler, use Object.defineProperty instead of __define[Getter/Setter]
Use bitwise operators for flags, fixed a bug
Removed Object.keys calls and the switch statement, fixed a bug in DomUtils
2.2.5
Fixed some errors in readme
Removed ifs from constructors (no more polymorphic objects)
2.5.6
Fixed reset functions
2.2.7
Breaking change: XML-comments inside special tags don't get special treatment.
DomHandler: Added an option to add a reference to each elements parent
2.2.8
added support for the universal selector ("*")
2.2.0
removed switch in Stream.js
fixed whitespace
quick fix for #19
added lowerCaseAttributeNames option
2.3.0
Added a `onopentagend` event
moved DomHandler & DomUtils to their own module
Updated readme
2.3.1
publish the element types from DomHandler
use numeric element types
don't expose HandlerModule
stylistic changes
use the new dom modules, 2.5.0
domelementtype must be version 1.x (not 1.0)
2.5.1
domelementtype must be version 1.x (not 1.0)
2.5.1
test on node 0.6, 0.8 & 0.9
FeedHandler should return an error when nothing's found
added missing semicolon in test-helper.js
improved how tests are run
don't run 03-rdf.js test
renamed tests
added semicolons & use EE#on in 02-stream.js
changed how the end of all tests is shown
allow `>` at the beginning of a document
2.5.2
2.6.0
landed first version of FSM based tokenizer
added logic for special tags
[tokenizer] don't fail on `< >` and `< / >`
[tokenizer] fixed ordering in cleanup
[tokenizer] overwrite WritableStream#end, emit everything that's left
[tokenizer] take care of this._index in cleanup, emit all text
[tokenizer] set _sectionStart to 0 when text was emitted
[tokenizer] call WritableStream#end after emitting the remaining data
[tokenizer] call .write instead of ._write
[parser] use the tokenizer
removed WritableStream.js and ElementType.js
[parser] made Parser#reset work again
fall back to the readable-stream module
[travis] removed 0.6 & 0.9, added 0.10 and 0.11
minor changes
[index.js] removed redundant code
[stream] use a named function
3.0.0
[tokenizer] always call WritableStream#end
[parser] call Tokenizer#end, clear the stack
[index.js] added `createDomStream()` convenience method
[tokenizer] added `opentagend` event
[parser] use `opentagend` event
3.0.1
[tokenizer] emit opentagend on selfclosing tags, fixed handling of < in special tags
[index.js] added tokenizer
[tests] text events now contain more data
[tokenizer] don't inherit from stream.Writable, fixed several bugs
[tests/events] concat text events
[tests/events] fixed order of attribute/opentag events, merged text events
[tokenizer] use strings instead of buffers
[parser] don't implement stream.Writable, use new tokenizer interface
[tests/stream] fixed order of events
[tokenizer] simplified logic
[parser] fixed handling of implied closing and empty tags
[tests/events] accidentally removed part of the document
added a WritableStream interface again
3.0.0 (finally!)
[tokenizer] changed internal name to `Tokenizer`
[proxyhandler] don't use getters/setters
added CollectingHandler
[tests] use the new CollectingHandler
[tests] removed unused `f` var
3.0.1
3.0.2
[bench] use setImmediate instead of process.nextTick
[bench] try to test all available modules
[bench] removed unused functions, improved output
[readme] updated benchmarks
[doc] call `end`, use single quotes
[doc] updated section about node-htmlparser
renamed repository, 3.0.3
[tokenizer] name states consistently
[readme] small updates
[tokenizer] don't emit an "onopentagend" event for self-closing tags
[parser] fixed handling of self-closing tags
[tests] stream tests are run again
[tests/feeds] run rdf test again
[tests/stream] enabled xmlMode for RSS test
[tests/stream] create a new handler for the second run
[tests/stream] added tests for the files in tests/Documents
3.0.4
[parser] lowercase instruction names if lowerCaseTags option is set
3.0.5
[tests/events] added test case for jsdom#368
changed behavior for non-xml mode
[tests/events] updated tests to reflect latest changes
3.1.0
[bench] added the hubbub & html-parser modules
3.1.2
3.1.3
[parser] renamed emptyTags to voidElements, sorted them
[parser] improved consistency & simplified
[tokenizer] simplified `end` logic
[tokenizer] removed noop blocks in AFTER_{COMMENT,CDATA}_2
[tokenizer] use `continue` instead of decreasing the index
[bench] removed unnecessary noop functions
[tokenizer] improved handling of remaining data
3.1.4
3.1.5
[tokenizer] don't save the options object
[tokenizer] use ternary expressions for simple states
[tokenizer] added variables for states of _special
[tokenizer] fixed whitespace
[tokenizer] more ternaries
[tokenizer] simplified _cleanup a bit
[tokenizer] united some branches
[tokenizer] get rid of _reconsume
[tokenizer] even more ternaries
[tokenizer] added abstractions for common state types, fixed previous regression
[tokenizer] added _getSection, completely inlined _emitIfToken, partly inlined _emitToken
[tokenizer] simplified _stateInTagName
[tokenizer] simplified _stateInAttributeValueNoQuotes, reordered _stateBeforeAttributeName
3.1.6
[tests] added test for second occurance of same attribute
[tokenizer] started adding support for HTML entities
[tokenizer] corrected decoding of numeric entities
[tokenizer] numeric entities are now decoded
[tests] added test case for numeric entities
[tokenizer] renamed the self-closing tags state, moved it to its own function
[tokenizer] commented out support for entities in attributes
[readme] updated benchmark results
[bench] removed internal benchmarks
[parser] fixed whitespace
[parser] moved common logic to _updatePosition function
[tokenizer] renamed IN_ATTRIBUTE_NAME_* states, improved formatting
[tokenizer] re-added the carriage return as whitespace
[tokenizer] fixed handling of unparsed data in end(), added support for several states
[entities] added maps for normal & legacy entities
[tokenizer] added support for decoding HTML entities in `ontext` events
[tests] added test cases for decoding legacy & named entities
[entities] added map for XML entities
[tokenizer] added support for XML entities
[tests] also test trailing data support in the numeric entity test
[tokenizer] fixed handling non-existent entities
[tests] added test case for XML entities
[tokenizer] added _emitEntity
3.2.0
[tokenizer] moved decodeMap to entities/decode.json
[tokenizer] renamed _emitEntity to _emitPartial
[index] statically export Parser, Tokenizer and DomHandler
[parser] use String#search and String#substr instead of String#split
[parser] added onattribdata and onattribend events, dropped onattribvalue
[tokenizer] enable support for decoding entities in attributes, added onattribend and onattribdata events, removed onattribvalue
[tests] added test case for entities in attributes
3.2.1
[tokenizer] don't decode entities in special tags
3.2.2
[tokenizer] reintroduced _special, removed IN_SCRIPT and IN_STYLE
3.2.3
[parser] properly removed self-closing tag support
[tests] read files in the tests file, improved os interoperability of stream test
[tests] added helper.getCallback method
[tests] converted tests to mocha
[tests] renamed tests dir to `test`
[package] run mocha as the test script
[tests] simplifications
3.2.4
[tests] added test case for MatthewMueller/cheerio#247
update to DomHandler at 2.1, updated FeedHandler accordingly, bump
[tests] write only single characters for testing chunked data
Felix Böhm (59):
Merge pull request #7 from siddMahen/master
Some minor changes
Merge pull request #12 from siddMahen/master
Merge pull request #21 from lahmatiy/master
fixed travis badge
Revert "stylistic changes"
Revert "Revert "stylistic changes""
added missing comma in benchmark script
added missing comma in benchmark script
Merge pull request #31 from jugglinmike/text-after-cdata
Merge pull request #28 from myndzi/master
Update README.md
Merge pull request #32 from jugglinmike/cdata-inside-special
Merge pull request #37 from eonlepapillon/Add-test-for-Issue-#36
Merge pull request #38 from burl/master
use DomUtils.getText in fetch, split getElements
[feedhandler] recursively walk the tree
Merge pull request #46 from One-com/missing_void_elements
Merge pull request #48 from AndreasMadsen/script-in-script
[tokenizer] consume token again
[parser] still recognize other options in non-xml-mode
3.1.1
Merge pull request #49 from AndreasMadsen/long-comment
Merge pull request #50 from AndreasMadsen/long-cdata-ending
readme: added version badge
[readme] added yet another badge (dependency versions)
Merge pull request #54 from abarre/master
Merge pull request #52 from AndreasMadsen/implicit-open
[readme] it~~'~~s
Merge pull request #55 from ForbesLindesay/patch-1
Merge pull request #56 from ForbesLindesay/patch-1
[parser] default options & cbs to empty objects
Merge pull request #58 from xcoderzach/master
[parser] don't overwrite attribute values on second occurence
[readme] behavior of example changed due to #58
[tokenizer] recognize the form field (U+0C), drop the carriage return (U+0D)
Merge pull request #61 from AndreasMadsen/optimize
Merge pull request #63 from fasterize/parser_positions
only respect self-closing tags in XML mode
Delete .DS_Store
[tokenizer] emit `onattribdata` in `_handleTrailingData`
[readme] updated performance characteristics
[tokenizer] handle `<<` correctly
3.2.5
[package] require domutils at 1.2
package: update readable-stream
package: use simple `license` field
replace non-breaking space with regular space
index: pass `options` argument to constructors
tests: remove unused `cb` argument
feedhandler: wrap assignments
tests: changed indentation to tabs
package: updated dom module versions, 3.4.0
Merge pull request #75 from superdweebie/master
Merge pull request #74 from patrick-steele-idem/master
3.4.1
parser: adjusted whitespace, fixed _updatePosition
3.5.0
Delete .DS_Store
Felix Gnass (5):
Check for empty tags is now case-insensitive. Makes jsdom pass the W3C test-suite which use upper-case tags.
Added support for extracting the source location, i.e. line- and column-number. The feature can be enabled by setting `includeLocation` to true on the parser instance.
Merge branch 'master' of https://github.com/tautologistics/node-htmlparser
Line numbers now start at 1. Optimized the loop so that it starts where it off.
Fixed position calculation in chunked mode
Forbes Lindesay (3):
Add parseDOM and parseFeed helper methods
Add link to live demo
Update link to demo
ForbesLindesay (3):
Add .gitignore
Add .gitattributes so tests still work on windows
Normalize line endings
Kris Reeves (9):
Made the attribute regular expression more correct with regards to unquoted attribute values.
I didn't understand how RegExps worked in this way, and was desynching the attributes count. Here's a different way to accomplish the same thing.
Merge branch 'master' of https://github.com/fb55/node-htmlparser
Better handling of implied close tags. A list is given of tags whose close is implied by other tags being opened, and these are closed when those tags are opened. This helps correctly parse things like lists and tables with unterminated LI or TD tags.
spaces -> tabs, thought the merge would update my local files to the correct spacing (and tried to match that)
Derp.
Merge remote-tracking branch 'upstream/master'
Tests for changes.
Fixes discussed in https://github.com/fb55/node-htmlparser/pull/28
Mike Pennisi (2):
Recognize closing CDATA tags as end of "special"
Do not parse CDATA-like text inside special tags
Patrick Steele-Idem (6):
#73 Added support for recognizing self-closing tags and CDATA in non-XML mode
Fix option to disable lower case tags and attars in non-XML mode
Added this._lowerCaseTagNames and this._lowerCaseAttributeNames
Handle case where options is null and allow truthy values
Switched to using "in" operator for options
Merged options initialization into a single line
Siddharth Mahendraker (2):
Makes sure recursion works properly
Cleaned up DomUtils
Tim Roediger (1):
Add self-closeing svg tags
Zach Smith (1):
[tokenizer] fix case where `<` followed by whitespace doesn't parse correctly
abarre (1):
Fix perf regression in the Tokenizer : avoid a concatenation
burl (1):
[tokenizer] fix for script tags causing following nodes to be interpreted as TEXT
eonlepapillon (1):
Add a new test for issue #36
fb55 (2):
tokenizer: Fixed handling of text containing `&` when decoding entities
3.5.1
lahmatiy (1):
Fix getOuterHTML for directives
-----------------------------------------------------------------------
No new revisions were added by this update.
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/pkg-javascript/node-htmlparser2.git
More information about the Pkg-javascript-commits
mailing list