WebKB Dataset
All Datasets: boost-starai/BoostSRL-Datasets
by: Nandini Ramanan, Alexander L. Hayes
<< "CiteSeer" | BoostSRL Wiki | "Toy Cancer" >>
The Alchemy WebKB Dataset was adapted from a dataset by the same name from Mark Craven's website (from the University of Wisconsin-Madison). WebKB consists of web pages and hyperlinks "from four computer science departments: Cornell University, The University of Texas, The University of Washington, and The University of Wisconsin."
This version contains the necessary background and train/test folders.
Target: faculty
The facts contain information on five labels: courseprof
, courseta
, project
, sameperson
, student
.
Table of Contents - BoostSRL Wiki
Download: WebKB.zip (41.1 KB)
-
md5sum
: 977e62fca51bfa7fe9c27bdf8af5d478 -
sha256sum
: 7b36e85cc99483a98c68fc868ba9890398339eaca20b48b80e4b56d16ddc1522
Table of Contents - BoostSRL Wiki
Linux/Mac:
- After downloading, unzip WebKB.zip
unzip WebKB.zip
- If you're using a jar file, move it into the WebKB directory:
mv (jar file) WebKB/
- Learning:
java -jar BoostSRL.jar -l -train train/ -target faculty -trees 10
- Inference:
java -jar BoostSRL.jar -i -test test/ -model train/models/ -target faculty -trees 10
Windows:
(Coming soon)
Table of Contents - BoostSRL Wiki
setParam: loadAllLibraries = false.
setParam: treeDepth=3.
setParam: nodeSize=3.
setParam: numOfClauses=8.
mode:courseprof(-Course, +Person).
mode:courseprof(+Course, -Person).
mode: courseta(+Course, -Person).
mode: courseta(-Course, +Person).
mode:faculty(+Person).
mode:project(-Proj, +Person).
mode:project(+Proj, -Person).
mode:sameperson(-Person, +Person).
mode:student(+Person).
Table of Contents - BoostSRL Wiki
BoostSRL Wiki
Home
BoostSRL Basics
- Getting Started
- File Structure
- Basic Usage Parameters
- Advanced Usage Parameters
- Basic Modes Guide
- Advanced Modes Guide
Deep dive into BoostSRL
- Default (RDN-Boost)
- MLN-Boost
- Regression
- Cost-sensitive SRL
- Learning with Advice
- Approximate Counting
- One-class Classification (coming soon)
- Discretization of Continuous Valued Attributes
- Lifted Relational Random Walks
- Grounded Relational Random Walks
Datasets
Applications of BoostSRL