Skip to content

Releases: html-extract/hext

Hext v1.0.8

09 Nov 15:58
Compare
Choose a tag to compare

Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):

pip install hext

Install Hext for Node (v21 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).

Install Hext for JavaScript (Node/Browser on any platform):

npm install hext.js

For other installation methods please see the download page for Hext.

Changes

  • New releases for macOS on ARM64 built with the new M1 Github Runner.
  • Node, Linux: Fixed accidental dependency on libgcc and libstdc++.
  • Node, Linux: Fixed accidental dependency on GLIBC2.34. Now only GLIBC2.14 is required, as was previously.

Hext v1.0.6

17 Oct 22:40
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):

pip install hext

Install Hext for Node (v21 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.6
See html-extract/hext.js for more.

Changes

  • Added release for Node v21 and Python v3.12
  • Dropped support for Node v19 and v16
  • Dropped support for Python v3.7
  • Updated boost from 1.82.0 to 1.83.0
  • Improved compatibility with libgumbo forks (specifically the libgumbo fork that is used in Arch) 07d08ce

Hext v1.0.5

21 Apr 16:05
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):

pip install hext

Install Hext for Node (v20 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.5
See html-extract/hext.js for more.

Changes

  • Added release for node v20
  • Dropped support for node v14
  • Updated boost from 1.80.0 to 1.82.0
  • Removed hext-ruby and hext-php from continuous integration

Hext v1.0.4

29 Oct 09:00
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):

pip install hext

Install Hext for Node (v19 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.4
See html-extract/hext.js for more.

Changes

  • Added release for node v19, python v3.11
  • Dropped support for node v17, python v3.6
  • Updated boost from 1.79.0 to 1.80.0

Hext v1.0.3

22 Apr 13:36
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v18 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for JavaScript

Hext.js v1.0.3
See html-extract/hext.js for more.

Changes

  • Added release for node v18
  • Dropped support for node v12
  • Updated boost from 1.78.0 to 1.79.0

Hext v1.0.2

09 Jan 18:17
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v17 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for JavaScript

Hext.js v1.0.2
See html-extract/hext.js for more.

Changes

Hext v1.0.1

17 Dec 17:19
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v17 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for WebAssembly

Hext Emscripten v1.0.1
See html-extract/hext-emscripten for more.

Changes

  • Language bindings: rule.extract now supports the optional parameter max_searches, which is of type unsigned int. The search for matching elements is aborted after this limit is reached. The default is 0, which never aborts. If running untrusted hext templates, I recommend setting max_searches to some high value, like 10000, to protect against resource exhaustion.
  • Consistent versions: htmlext, libhext and pypi now share the same version number, enforced by CI. The NPM package has the same version but incremented by 10.
  • Refactored build and CI scripts

Hext v1.0.0

12 Dec 23:05
Compare
Choose a tag to compare

Changes

  • New syntax: Nested rules ( #22 )
    # match <div> elements that have a descendant <a> at any depth
    <div> { <a/> } </div>
    
  • Abort extraction after a specified amount of searches (4dff797): Added a new parameter max_searches to Rule::extract and htmlext. It is disabled by default (value 0). If running untrusted hext templates, I recommend setting max_searches to some high value, like 10000, to protect against resource exhaustion. Nested rules can cause nasty runtime performance (see #22 for an example).

Hext v0.8.3

09 Nov 15:32
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.8 or earlier):

pip install hext

Install Hext for Node (v12 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for WebAssembly

See https://github.com/html-extract/hext-emscripten/releases

Changes

  • Rules can now match custom-tags
  • Custom-tags are matched in a case-insensitive manner

Releases can be verified with the public key at https://thomastrapp.com/public_key.asc, which has the key ID 086653AA8CC7270E and the fingerprint E6EA EFD0 2CBB 0EFF C010 1324 0866 53AA 8CC7 270E.

Hext v0.8.2

23 Jul 20:40
Compare
Choose a tag to compare

Notable but minor changes:

  • make install now uses CMake's GNUInstallDirs. This allows for finer control of what gets installed where, when configuring the project (67afed7).
  • htmlext's Version.cpp and libhext's Version.cpp are generated out of the source tree (9e8125e).
  • Removed libhext's custom Doxygen theme (93dab11).

Releases can be verified with the public key at http://thomastrapp.com/public_key.asc, which has the key ID 086653AA8CC7270E and the fingerprint E6EA EFD0 2CBB 0EFF C010 1324 0866 53AA 8CC7 270E.