Back to Explore

html-to-text/node-html-to-text

GitHub
1 updates · last 90 days1 watchersOpen source

Last release: 1 month ago

Advanced converter that turns HTML into plain text, useful for formatting and extracting readable text from HTML content such as emails. Includes a CLI tool for running the conversion from the command line, and a monorepo with related packages.

Project status

  • The source appears actively maintained, with a very recent upstream push on 2026-05-02 and a recent update published as 10.0.0 on 2026-04-30.
  • The apparent update cadence looks bursty rather than frequent, with the last clearly described update before 10.0.0 dating back to 2023-03-23 (9.0.5), suggesting multi year gaps between updates.

AI summary generated Today

AI-generated from public sources. May be inaccurate. Report

Recent updates

  • 10.0.0

    1 month ago

    Release notes for version 10.0.0 are not provided. The code diff shows a broader modernization of the monorepo, including major dependency upgrades, a new centralized options composition module, and updated ESM import specifiers across packages.

    Features
  • 9.0.5

    3/23/2023

    Release 9.0.5 (html-to-text) and the related CLI (0.5.4) update core HTML parsing dependencies and adjust the build/test scripts to use npm workspaces. The included code changes and added tests indicate fixes to HTML entity handling after special tags and expanded support for escaped characters in selector strings.

    Features
  • 9.0.4

    2/10/2023

    Release 9.0.4 was published on 2023-02-10, but no release notes were provided by the publisher. As a result, this release cannot be analyzed for new features, breaking changes, bug fixes, security updates, or dependency changes based on the provided information.

  • 9.0.3

    12/14/2022

    Release 9.0.3 contains no published release notes. The diff mainly updates dependencies (notably adding dom-serializer and bumping mocha for dev), bumps package versions, and updates documentation to describe new support for a metadata object in html-to-text conversion.

    Features
  • 9.0.2

    12/7/2022

    This release (9.0.2) introduces changes to how the `encodeCharacters` option is interpreted when generating replacements during HTML to text/markdown conversion. It also adds a new escaping utility (`unicodeEscape`) and adjusts internal regex generation logic to handle unicode code points more robustly.

    Features
  • 9.0.1

    12/6/2022

    Release 9.0.1 contains no publisher release notes. The code diff shows a small but potentially breaking runtime signature change in html-to-text’s BlockTextBuilder.closeTable, plus several documentation/test updates and a CLI dependency fix.

  • 9.0.0

    12/2/2022

    Release 9.0.0 appears to be a major internal refactor, moving the core conversion logic into a new packages/base implementation with an ES module style codebase. It also introduces new output targets (for example, an html-to-md package) and expands the formatting pipeline with additional helpers and wrapping controls. No publisher release notes were provided, so the code changes below are effectively undocumented.

    BreakingFeatures
  • 8.2.1

    7/30/2022

    No release notes were provided for version 8.2.1, so there is no documented guidance on what changed. The code diff from 7.1.3 to 8.2.1 shows substantial runtime refactoring, including a selectors-based architecture, changes to link and image bracket formatting options, and updated whitespace handling for preserveNewlines.

  • 7.1.3

    7/30/2022

    Release 7.1.3 has no publisher-provided release notes. The actual diff shows the package version was updated to 7.1.3 and npm dependencies were bumped, with package-lock.json regenerated accordingly.

  • 8.2.0

    4/7/2022

    Release 8.2.0 was published on 2022-04-07, but the publisher did not provide any release notes or detailed change log. As a result, there is no actionable information here about new features, bug fixes, breaking changes, security updates, or dependency changes.