Streaming HTML parser with scripting support. A writable stream.

Example

const ParserStream = require('parse5-parser-stream');
const http = require('http');
const { finished } = require('node:stream');

// Fetch the page content and obtain it's <head> node
http.get('http://inikulin.github.io/parse5/', res => {
const parser = new ParserStream();

finished(parser, () => {
console.log(parser.document.childNodes[1].childNodes[0].tagName); //> 'head'
});

res.pipe(parser);
});

Type Parameters

Hierarchy

  • Writable
    • ParserStream

    Constructors

    Properties

    closed: boolean

    Is true after 'close' has been emitted.

    Since

    v18.0.0

    destroyed: boolean

    Is true after writable.destroy() has been called.

    Since

    v8.0.0

    errored: null | Error

    Returns error if the stream has been destroyed with an error.

    Since

    v18.0.0

    parser: Parser<T> = ...
    writable: boolean

    Is true if it is safe to call writable.write(), which means the stream has not been destroyed, errored or ended.

    Since

    v11.4.0

    writableCorked: number

    Number of times writable.uncork() needs to be called in order to fully uncork the stream.

    Since

    v13.2.0, v12.16.0

    writableEnded: boolean

    Is true after writable.end() has been called. This property does not indicate whether the data has been flushed, for this use writable.writableFinished instead.

    Since

    v12.9.0

    writableFinished: boolean

    Is set to true immediately before the 'finish' event is emitted.

    Since

    v12.6.0

    writableHighWaterMark: number

    Return the value of highWaterMark passed when creating this Writable.

    Since

    v9.3.0

    writableLength: number

    This property contains the number of bytes (or objects) in the queue ready to be written. The value provides introspection data regarding the status of the highWaterMark.

    Since

    v9.4.0

    writableNeedDrain: boolean

    Is true if the stream's buffer has been full and stream will emit 'drain'.

    Since

    v15.2.0, v14.17.0

    writableObjectMode: boolean

    Getter for the property objectMode of a given Writable stream.

    Since

    v12.3.0

    captureRejectionSymbol: typeof captureRejectionSymbol
    captureRejections: boolean

    Sets or gets the default captureRejection value for all emitters.

    defaultMaxListeners: number
    errorMonitor: typeof errorMonitor

    This symbol shall be used to install a listener for only monitoring 'error' events. Listeners installed using this symbol are called before the regular 'error' listeners are called.

    Installing a listener using this symbol does not change the behavior once an 'error' event is emitted, therefore the process will still crash if no regular 'error' listener is installed.

    Accessors

    Methods

    • Parameters

      • callback: ((error?) => void)
          • (error?): void
          • Parameters

            • Optional error: null | Error

            Returns void

      Returns void

    • Parameters

      • error: null | Error
      • callback: ((error?) => void)
          • (error?): void
          • Parameters

            • Optional error: null | Error

            Returns void

      Returns void

    • Parameters

      • callback: ((error?) => void)
          • (error?): void
          • Parameters

            • Optional error: null | Error

            Returns void

      Returns void

    • Parameters

      • chunks: {
            chunk: any;
            encoding: BufferEncoding;
        }[]
      • callback: ((error?) => void)
          • (error?): void
          • Parameters

            • Optional error: null | Error

            Returns void

      Returns void

    • Event emitter The defined events on documents including:

      1. close
      2. drain
      3. error
      4. finish
      5. pipe
      6. unpipe

      Parameters

      • event: "close"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "drain"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "error"
      • listener: ((err) => void)
          • (err): void
          • Parameters

            • err: Error

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "finish"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "pipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "unpipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • The writable.cork() method forces all written data to be buffered in memory. The buffered data will be flushed when either the uncork or end methods are called.

      The primary intent of writable.cork() is to accommodate a situation in which several small chunks are written to the stream in rapid succession. Instead of immediately forwarding them to the underlying destination, writable.cork()buffers all the chunks until writable.uncork() is called, which will pass them all to writable._writev(), if present. This prevents a head-of-line blocking situation where data is being buffered while waiting for the first small chunk to be processed. However, use of writable.cork() without implementingwritable._writev() may have an adverse effect on throughput.

      See also: writable.uncork(), writable._writev().

      Returns void

      Since

      v0.11.2

    • Destroy the stream. Optionally emit an 'error' event, and emit a 'close'event (unless emitClose is set to false). After this call, the writable stream has ended and subsequent calls to write() or end() will result in an ERR_STREAM_DESTROYED error. This is a destructive and immediate way to destroy a stream. Previous calls towrite() may not have drained, and may trigger an ERR_STREAM_DESTROYED error. Use end() instead of destroy if data should flush before close, or wait for the 'drain' event before destroying the stream.

      Once destroy() has been called any further calls will be a no-op and no further errors except from _destroy() may be emitted as 'error'.

      Implementors should not override this method, but instead implement writable._destroy().

      Parameters

      • Optional error: Error

        Optional, an error to emit with 'error' event.

      Returns ParserStream<T>

      Since

      v8.0.0

    • Parameters

      • event: "close"

      Returns boolean

    • Parameters

      • event: "drain"

      Returns boolean

    • Parameters

      • event: "error"
      • err: Error

      Returns boolean

    • Parameters

      • event: "finish"

      Returns boolean

    • Parameters

      • event: "pipe"
      • src: Readable

      Returns boolean

    • Parameters

      • event: "unpipe"
      • src: Readable

      Returns boolean

    • Parameters

      • event: string | symbol
      • Rest ...args: any[]

      Returns boolean

    • Returns an array listing the events for which the emitter has registered listeners. The values in the array are strings or Symbols.

      const EventEmitter = require('events');
      const myEE = new EventEmitter();
      myEE.on('foo', () => {});
      myEE.on('bar', () => {});

      const sym = Symbol('symbol');
      myEE.on(sym, () => {});

      console.log(myEE.eventNames());
      // Prints: [ 'foo', 'bar', Symbol(symbol) ]

      Returns (string | symbol)[]

      Since

      v6.0.0

    • Returns the current max listener value for the EventEmitter which is either set by emitter.setMaxListeners(n) or defaults to defaultMaxListeners.

      Returns number

      Since

      v1.0.0

    • Returns the number of listeners listening to the event named eventName.

      Parameters

      • eventName: string | symbol

        The name of the event being listened for

      Returns number

      Since

      v3.2.0

    • Returns a copy of the array of listeners for the event named eventName.

      server.on('connection', (stream) => {
      console.log('someone connected!');
      });
      console.log(util.inspect(server.listeners('connection')));
      // Prints: [ [Function] ]

      Parameters

      • eventName: string | symbol

      Returns Function[]

      Since

      v0.1.26

    • Alias for emitter.removeListener().

      Parameters

      • eventName: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

      Since

      v10.0.0

    • Raised when parser encounters a <script> element. If this event has listeners, parsing will be suspended once it is emitted. So, if <script> has the src attribute, you can fetch it, execute and then resume parsing just like browsers do.

      Parameters

      • event: "script"

        Name of the event

      • handler: ((scriptElement, documentWrite, resume) => void)
          • (scriptElement, documentWrite, resume): void
          • Parameters

            • scriptElement: T["element"]
            • documentWrite: ((html) => void)
                • (html): void
                • Parameters

                  • html: string

                  Returns void

            • resume: (() => void)
                • (): void
                • Returns void

            Returns void

      Returns void

      Example

      const ParserStream = require('parse5-parser-stream');
      const http = require('http');

      const parser = new ParserStream();

      parser.on('script', (scriptElement, documentWrite, resume) => {
      const src = scriptElement.attrs.find(({ name }) => name === 'src').value;

      http.get(src, res => {
      // Fetch the script content, execute it with DOM built around `parser.document` and
      // `document.write` implemented using `documentWrite`.
      ...
      // Then resume parsing.
      resume();
      });
      });

      parser.end('<script src="example.com/script.js"></script>');
    • Base event handler.

      Parameters

      • event: string

        Name of the event

      • handler: ((...args) => void)

        Event handler

          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "close"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "drain"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "error"
      • listener: ((err) => void)
          • (err): void
          • Parameters

            • err: Error

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "finish"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "pipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "unpipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • Type Parameters

      • T extends WritableStream

      Parameters

      • destination: T
      • Optional options: {
            end?: boolean;
        }
        • Optional end?: boolean

      Returns T

    • Parameters

      • event: "close"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "drain"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "error"
      • listener: ((err) => void)
          • (err): void
          • Parameters

            • err: Error

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "finish"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "pipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "unpipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "close"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "drain"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "error"
      • listener: ((err) => void)
          • (err): void
          • Parameters

            • err: Error

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "finish"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "pipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "unpipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • Returns a copy of the array of listeners for the event named eventName, including any wrappers (such as those created by .once()).

      const emitter = new EventEmitter();
      emitter.once('log', () => console.log('log once'));

      // Returns a new Array with a function `onceWrapper` which has a property
      // `listener` which contains the original listener bound above
      const listeners = emitter.rawListeners('log');
      const logFnWrapper = listeners[0];

      // Logs "log once" to the console and does not unbind the `once` event
      logFnWrapper.listener();

      // Logs "log once" to the console and removes the listener
      logFnWrapper();

      emitter.on('log', () => console.log('log persistently'));
      // Will return a new Array with a single function bound by `.on()` above
      const newListeners = emitter.rawListeners('log');

      // Logs "log persistently" twice
      newListeners[0]();
      emitter.emit('log');

      Parameters

      • eventName: string | symbol

      Returns Function[]

      Since

      v9.4.0

    • Removes all listeners, or those of the specified eventName.

      It is bad practice to remove listeners added elsewhere in the code, particularly when the EventEmitter instance was created by some other component or module (e.g. sockets or file streams).

      Returns a reference to the EventEmitter, so that calls can be chained.

      Parameters

      • Optional event: string | symbol

      Returns ParserStream<T>

      Since

      v0.1.26

    • Parameters

      • event: "close"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "drain"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "error"
      • listener: ((err) => void)
          • (err): void
          • Parameters

            • err: Error

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "finish"
      • listener: (() => void)
          • (): void
          • Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "pipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: "unpipe"
      • listener: ((src) => void)
          • (src): void
          • Parameters

            • src: Readable

            Returns void

      Returns ParserStream<T>

    • Parameters

      • event: string | symbol
      • listener: ((...args) => void)
          • (...args): void
          • Parameters

            • Rest ...args: any[]

            Returns void

      Returns ParserStream<T>

    • The writable.setDefaultEncoding() method sets the default encoding for a Writable stream.

      Parameters

      • encoding: BufferEncoding

        The new default encoding

      Returns ParserStream<T>

      Since

      v0.11.15

    • By default EventEmitters will print a warning if more than 10 listeners are added for a particular event. This is a useful default that helps finding memory leaks. The emitter.setMaxListeners() method allows the limit to be modified for this specific EventEmitter instance. The value can be set toInfinity (or 0) to indicate an unlimited number of listeners.

      Returns a reference to the EventEmitter, so that calls can be chained.

      Parameters

      • n: number

      Returns ParserStream<T>

      Since

      v0.3.5

    • The writable.uncork() method flushes all data buffered since cork was called.

      When using writable.cork() and writable.uncork() to manage the buffering of writes to a stream, defer calls to writable.uncork() usingprocess.nextTick(). Doing so allows batching of allwritable.write() calls that occur within a given Node.js event loop phase.

      stream.cork();
      stream.write('some ');
      stream.write('data ');
      process.nextTick(() => stream.uncork());

      If the writable.cork() method is called multiple times on a stream, the same number of calls to writable.uncork() must be called to flush the buffered data.

      stream.cork();
      stream.write('some ');
      stream.cork();
      stream.write('data ');
      process.nextTick(() => {
      stream.uncork();
      // The data will not be flushed until uncork() is called a second time.
      stream.uncork();
      });

      See also: writable.cork().

      Returns void

      Since

      v0.11.2

    • The writable.write() method writes some data to the stream, and calls the supplied callback once the data has been fully handled. If an error occurs, the callback will be called with the error as its first argument. The callback is called asynchronously and before 'error' is emitted.

      The return value is true if the internal buffer is less than thehighWaterMark configured when the stream was created after admitting chunk. If false is returned, further attempts to write data to the stream should stop until the 'drain' event is emitted.

      While a stream is not draining, calls to write() will buffer chunk, and return false. Once all currently buffered chunks are drained (accepted for delivery by the operating system), the 'drain' event will be emitted. Once write() returns false, do not write more chunks until the 'drain' event is emitted. While calling write() on a stream that is not draining is allowed, Node.js will buffer all written chunks until maximum memory usage occurs, at which point it will abort unconditionally. Even before it aborts, high memory usage will cause poor garbage collector performance and high RSS (which is not typically released back to the system, even after the memory is no longer required). Since TCP sockets may never drain if the remote peer does not read the data, writing a socket that is not draining may lead to a remotely exploitable vulnerability.

      Writing data while the stream is not draining is particularly problematic for a Transform, because the Transform streams are paused by default until they are piped or a 'data' or 'readable' event handler is added.

      If the data to be written can be generated or fetched on demand, it is recommended to encapsulate the logic into a Readable and use pipe. However, if calling write() is preferred, it is possible to respect backpressure and avoid memory issues using the 'drain' event:

      function write(data, cb) {
      if (!stream.write(data)) {
      stream.once('drain', cb);
      } else {
      process.nextTick(cb);
      }
      }

      // Wait for cb to be called before doing any other write.
      write('hello', () => {
      console.log('Write completed, do more writes now.');
      });

      A Writable stream in object mode will always ignore the encoding argument.

      Parameters

      • chunk: any

        Optional data to write. For streams not operating in object mode, chunk must be a string, Buffer or Uint8Array. For object mode streams, chunk may be any JavaScript value other than null.

      • Optional callback: ((error) => void)

        Callback for when this chunk of data is flushed.

          • (error): void
          • Parameters

            • error: undefined | null | Error

            Returns void

      Returns boolean

      false if the stream wishes for the calling code to wait for the 'drain' event to be emitted before continuing to write additional data; otherwise true.

      Since

      v0.9.4

    • Parameters

      • chunk: any
      • encoding: BufferEncoding
      • Optional callback: ((error) => void)
          • (error): void
          • Parameters

            • error: undefined | null | Error

            Returns void

      Returns boolean

    • Experimental

      A utility method for creating a Writable from a web WritableStream.

      Parameters

      • writableStream: WritableStream<any>
      • Optional options: Pick<WritableOptions, "signal" | "decodeStrings" | "highWaterMark" | "objectMode">

      Returns Writable

      Since

      v17.0.0

    • Returns a copy of the array of listeners for the event named eventName.

      For EventEmitters this behaves exactly the same as calling .listeners on the emitter.

      For EventTargets this is the only way to get the event listeners for the event target. This is useful for debugging and diagnostic purposes.

      const { getEventListeners, EventEmitter } = require('events');

      {
      const ee = new EventEmitter();
      const listener = () => console.log('Events are fun');
      ee.on('foo', listener);
      getEventListeners(ee, 'foo'); // [listener]
      }
      {
      const et = new EventTarget();
      const listener = () => console.log('Events are fun');
      et.addEventListener('foo', listener);
      getEventListeners(et, 'foo'); // [listener]
      }

      Parameters

      • emitter: EventEmitter | _DOMEventTarget
      • name: string | symbol

      Returns Function[]

      Since

      v15.2.0, v14.17.0

    • A class method that returns the number of listeners for the given eventNameregistered on the given emitter.

      const { EventEmitter, listenerCount } = require('events');
      const myEmitter = new EventEmitter();
      myEmitter.on('event', () => {});
      myEmitter.on('event', () => {});
      console.log(listenerCount(myEmitter, 'event'));
      // Prints: 2

      Parameters

      • emitter: EventEmitter

        The emitter to query

      • eventName: string | symbol

        The event name

      Returns number

      Since

      v0.9.12

      Deprecated

      Since v3.2.0 - Use listenerCount instead.

    • const { on, EventEmitter } = require('events');

      (async () => {
      const ee = new EventEmitter();

      // Emit later on
      process.nextTick(() => {
      ee.emit('foo', 'bar');
      ee.emit('foo', 42);
      });

      for await (const event of on(ee, 'foo')) {
      // The execution of this inner block is synchronous and it
      // processes one event at a time (even with await). Do not use
      // if concurrent execution is required.
      console.log(event); // prints ['bar'] [42]
      }
      // Unreachable here
      })();

      Returns an AsyncIterator that iterates eventName events. It will throw if the EventEmitter emits 'error'. It removes all listeners when exiting the loop. The value returned by each iteration is an array composed of the emitted event arguments.

      An AbortSignal can be used to cancel waiting on events:

      const { on, EventEmitter } = require('events');
      const ac = new AbortController();

      (async () => {
      const ee = new EventEmitter();

      // Emit later on
      process.nextTick(() => {
      ee.emit('foo', 'bar');
      ee.emit('foo', 42);
      });

      for await (const event of on(ee, 'foo', { signal: ac.signal })) {
      // The execution of this inner block is synchronous and it
      // processes one event at a time (even with await). Do not use
      // if concurrent execution is required.
      console.log(event); // prints ['bar'] [42]
      }
      // Unreachable here
      })();

      process.nextTick(() => ac.abort());

      Parameters

      • emitter: EventEmitter
      • eventName: string

        The name of the event being listened for

      • Optional options: StaticEventEmitterOptions

      Returns AsyncIterableIterator<any>

      that iterates eventName events emitted by the emitter

      Since

      v13.6.0, v12.16.0

    • Creates a Promise that is fulfilled when the EventEmitter emits the given event or that is rejected if the EventEmitter emits 'error' while waiting. The Promise will resolve with an array of all the arguments emitted to the given event.

      This method is intentionally generic and works with the web platform EventTarget interface, which has no special'error' event semantics and does not listen to the 'error' event.

      const { once, EventEmitter } = require('events');

      async function run() {
      const ee = new EventEmitter();

      process.nextTick(() => {
      ee.emit('myevent', 42);
      });

      const [value] = await once(ee, 'myevent');
      console.log(value);

      const err = new Error('kaboom');
      process.nextTick(() => {
      ee.emit('error', err);
      });

      try {
      await once(ee, 'myevent');
      } catch (err) {
      console.log('error happened', err);
      }
      }

      run();

      The special handling of the 'error' event is only used when events.once()is used to wait for another event. If events.once() is used to wait for the 'error' event itself, then it is treated as any other kind of event without special handling:

      const { EventEmitter, once } = require('events');

      const ee = new EventEmitter();

      once(ee, 'error')
      .then(([err]) => console.log('ok', err.message))
      .catch((err) => console.log('error', err.message));

      ee.emit('error', new Error('boom'));

      // Prints: ok boom

      An AbortSignal can be used to cancel waiting for the event:

      const { EventEmitter, once } = require('events');

      const ee = new EventEmitter();
      const ac = new AbortController();

      async function foo(emitter, event, signal) {
      try {
      await once(emitter, event, { signal });
      console.log('event emitted!');
      } catch (error) {
      if (error.name === 'AbortError') {
      console.error('Waiting for the event was canceled!');
      } else {
      console.error('There was an error', error.message);
      }
      }
      }

      foo(ee, 'foo', ac.signal);
      ac.abort(); // Abort waiting for the event
      ee.emit('foo'); // Prints: Waiting for the event was canceled!

      Parameters

      • emitter: _NodeEventTarget
      • eventName: string | symbol
      • Optional options: StaticEventEmitterOptions

      Returns Promise<any[]>

      Since

      v11.13.0, v10.16.0

    • Parameters

      • emitter: _DOMEventTarget
      • eventName: string
      • Optional options: StaticEventEmitterOptions

      Returns Promise<any[]>

    • const {
      setMaxListeners,
      EventEmitter
      } = require('events');

      const target = new EventTarget();
      const emitter = new EventEmitter();

      setMaxListeners(5, target, emitter);

      Parameters

      • Optional n: number

        A non-negative number. The maximum number of listeners per EventTarget event.

      • Rest ...eventTargets: (EventEmitter | _DOMEventTarget)[]

      Returns void

      Since

      v15.4.0

    • Experimental

      A utility method for creating a web WritableStream from a Writable.

      Parameters

      • streamWritable: Writable

      Returns WritableStream<any>

      Since

      v17.0.0

    Generated using TypeDoc