Skip to content

html-extract/hext-on-websockets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hext on Websockets

Hext is a domain-specific language for extracting structured data from HTML documents.

This is the websocket server behind the "Try Hext in your Browser!" section.

Dependencies

  • CMake
  • OpenSSL
  • Boost
  • RapidJSON
  • Hext
  • Optional: libseccomp
  • Testing: websocat, jq

Build

cd build
cmake -DCMAKE_BUILD_TYPE=Release .. && make -j8
ls -lh ./hextws

Build with seccomp enabled (see the list of allowed system calls in SetupSeccomp.cpp, YMMV):

cd build
cmake -DCMAKE_BUILD_TYPE=Release -DHEXTWS_SECCOMP=On .. && make -j8
ls -lh ./hextws

Usage

./hextws <address> <port> <num-threads> <max-searches> \
    <path-to-cert> <path-to-key> <path-to-dhparam>

License

All source code of this project is released under the Apache License v2.0.

About

Websocket Server for Hext. Hext is a domain-specific language for extracting structured data from HTML documents.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published