Releases: html-extract/hext
Hext v1.0.8
Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):
pip install hext
Install Hext for Node (v21 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).
Install Hext for JavaScript (Node/Browser on any platform):
npm install hext.js
For other installation methods please see the download page for Hext.
Changes
- New releases for macOS on ARM64 built with the new M1 Github Runner.
- Node, Linux: Fixed accidental dependency on libgcc and libstdc++.
- Node, Linux: Fixed accidental dependency on GLIBC2.34. Now only GLIBC2.14 is required, as was previously.
Hext v1.0.6
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):
pip install hext
Install Hext for Node (v21 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.6
See html-extract/hext.js for more.
Changes
- Added release for Node v21 and Python v3.12
- Dropped support for Node v19 and v16
- Dropped support for Python v3.7
- Updated boost from 1.82.0 to 1.83.0
- Improved compatibility with libgumbo forks (specifically the libgumbo fork that is used in Arch) 07d08ce
Hext v1.0.5
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):
pip install hext
Install Hext for Node (v20 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.5
See html-extract/hext.js for more.
Changes
- Added release for node v20
- Dropped support for node v14
- Updated boost from 1.80.0 to 1.82.0
- Removed hext-ruby and hext-php from continuous integration
Hext v1.0.4
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):
pip install hext
Install Hext for Node (v19 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.4
See html-extract/hext.js for more.
Changes
- Added release for node v19, python v3.11
- Dropped support for node v17, python v3.6
- Updated boost from 1.79.0 to 1.80.0
Hext v1.0.3
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v18 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for JavaScript
Hext.js v1.0.3
See html-extract/hext.js for more.
Changes
- Added release for node v18
- Dropped support for node v12
- Updated boost from 1.78.0 to 1.79.0
Hext v1.0.2
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v17 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for JavaScript
Hext.js v1.0.2
See html-extract/hext.js for more.
Changes
- Hext for Node: rewrite with node-addon-api ( 9aea455 )
- Hext for JavaScript: Improved API ( html-extract/hext.js#6 )
- Hext website: Improved download page
- libhext: Fix warnings on newer compilers
Hext v1.0.1
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v17 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for WebAssembly
Hext Emscripten v1.0.1
See html-extract/hext-emscripten for more.
Changes
- Language bindings:
rule.extract
now supports the optional parametermax_searches
, which is of type unsigned int. The search for matching elements is aborted after this limit is reached. The default is 0, which never aborts. If running untrusted hext templates, I recommend settingmax_searches
to some high value, like 10000, to protect against resource exhaustion. - Consistent versions: htmlext, libhext and pypi now share the same version number, enforced by CI. The NPM package has the same version but incremented by 10.
- Refactored build and CI scripts
Hext v1.0.0
Changes
- New syntax: Nested rules ( #22 )
# match <div> elements that have a descendant <a> at any depth <div> { <a/> } </div>
- Abort extraction after a specified amount of searches (4dff797): Added a new parameter
max_searches
toRule::extract
andhtmlext
. It is disabled by default (value0
). If running untrusted hext templates, I recommend settingmax_searches
to some high value, like 10000, to protect against resource exhaustion. Nested rules can cause nasty runtime performance (see #22 for an example).
Hext v0.8.3
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.8 or earlier):
pip install hext
Install Hext for Node (v12 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for WebAssembly
See https://github.com/html-extract/hext-emscripten/releases
Changes
- Rules can now match custom-tags
- Custom-tags are matched in a case-insensitive manner
Releases can be verified with the public key at https://thomastrapp.com/public_key.asc, which has the key ID 086653AA8CC7270E
and the fingerprint E6EA EFD0 2CBB 0EFF C010 1324 0866 53AA 8CC7 270E
.
Hext v0.8.2
Notable but minor changes:
make install
now uses CMake's GNUInstallDirs. This allows for finer control of what gets installed where, when configuring the project (67afed7).- htmlext's
Version.cpp
and libhext'sVersion.cpp
are generated out of the source tree (9e8125e). - Removed libhext's custom Doxygen theme (93dab11).
Releases can be verified with the public key at http://thomastrapp.com/public_key.asc, which has the key ID 086653AA8CC7270E
and the fingerprint E6EA EFD0 2CBB 0EFF C010 1324 0866 53AA 8CC7 270E
.