Judger of Yao OJ

It's worth known that this project is still under development, thus everything is varying :).

（大概）是新一代 OJ 评测模块！

基于 kafel 的一个沙箱模块，支持资源用度报告。主要用于 OI/ACM 的代码评测。

项目核心：在限制条件下执行某一程序（可能带参数）并得到相应的运行结果分析。

目前只支持在 x86_64 架构上运行。

Features

使用 C 语言编写，直接调用系统 API
模块化架构，易于扩展，目前已支持：传统题、交互题、任意命令的执行
使用 kafel 配置更易读的 syscall 限制策略，并添加扩展语法以实现在运行时动态加载策略
rlimit + rusage + pthread 全面限制/监控进程的资源使用
可以实现大家梦寐以求（？）的 CCF 数组开爆的功能（即数组超过空间限制直接 MLE）

Getting Start

Download

右转 Github Release 获取构建好的二进制文件和静态链接库。

Build from source

在构建项目之前请确保您的 Linux 系统安装有

make（GNU Make）
flex（the fast lexical analyser generator）
ausyscall（a program that allows mapping syscall names and numbers）
clang 系列命令
gengetopt

命令。或者（前提有 make）手动在项目根目录执行 make check_buildenv 看是否报错。

接下在项目根目录执行：

make all

Usage

一个简单易懂的例子见 tests/stress/01_OK。

$ ./yaoj-judger --detailed-help
Usage: yaoj-judger [OPTION]... [arguments]...
judger for the future yaoj

A set of program runners with resource limitation, syscall limitation and final
status report.

  -h, --help                    Print help and exit
      --detailed-help           Print help, including all details and hidden
                                  options, and exit
  -V, --version                 Print version and exit
  -j, --judger=judgername       specify which judger to use  (possible
                                  values="interactive", "general")
                                  (required)
  -r, --result=string           predict judgement result  (possible
                                  values="OK", "RE", "MLE", "TLE",
                                  "OLE", "SE", "DSC", "ECE")
  
    Meanings of those shortname:
      OK: all correct
      RE: runtime error
      MLE: memory limitation exceed
      TLE: time limitation exceed
      OLE: output limitation exceed
      SE: system error, aka judger error
      DSC: dangerous system call
      ECE: exit code error
    
      --log=filename            specify judger result file (required)
      --log-color               whether display colorful log  (default=off)
  -p, --policy=filename         specify policy name (required)
  
    Note that if using builtin policy, add 'builtin:' prefix to policy's name.
    
  -P, --policy-dir=filename     specify policy search directory, depend on
                                  'policy' option  (default=`.')
  
    If using builtin policy, this option is meaningless.
    
      --json                    output judgement result to stdout in JSON
                                  format  (default=off)
      --fork                    fork a new process to perform  (default=off)

Resource Limitations:
  note that 'timeout' and 'memory' option can be override by their
  corresponding detailed options, such as realtime, stack-memory.
  -t, --timeout=integer         specify both time limits in milliseconds
      --realtime=integer        specify the runtime limit in milliseconds
      --cputime=integer         specify the cpu limit in milliseconds
  -m, --memory=integer          specify all three memory limits in MB
      --virtual-memory=integer  specify the virtual memory limit in MB
      --real-memory=integer     specify the actual memory limit in MB
      --stack-memory=integer    specify the stack memory limit in MB
  -g, --output-size=integer     specify the output limit in MB
  -f, --fileno=integer          specify the maximum number of opened fd

Resource Limitation

realtime 指实际的运行时间，例如包括 sleep 占用的时间。而 cputime 只考虑 cpu 使用时间。事实上如今大多数 OJ 都是限制 cputime，realtime 一般做为一个超时阈值。毕竟考虑到多个进程同时评测，realtime 很多时候是不准的。timeout 会同时设置 realtime 和 cputime 的限制。

virtual-memory 表示对进程内存地址空间的限制，而 real-memory 表示对实际使用的内存的限制。例如大多数 OI 选手所知，数组开了很大一般是用不完的，而大多数 OJ 显示的是实际被使用的内存，也是在这上面做出的限制。而 CCF 评测会出现数组开爆的情况，大概是因为他们对前者也做出了相同的限制。stack-memory 则是栈空间限制。memory 会同时设置上述三个限制。

output-size 则限制进程创建文件的大小，即输出限制。

Notes

另外如果想要安装为系统命令：

make all
sudo make install

删除安装的系统命令

sudo make uninstall

内建规则见 src/builtin_policy 目录。

对于更多使用方法，您可以去 tests/ 了解一下！

如果你在尝试链接库文件来自己写 main，那么在编译时需加上 -Lpath/to/libyjudger -lpthread -lyjudger，并且链接的顺序有时候会影响编译结果（Why does the order in which libraries are linked sometimes cause errors in GCC?）。解决方法就是穷举……一般来说 -lyjudger 放最前面就可以。

Development

首先你需要有 clang。

code coverage（需要 gcovr）：

make clean
make gcovr

可以在 local.cov 文件夹下查看代码覆盖情况。

文档生成（需要 doxygen）：

make docs

可以在 docs/web 里查看文档。

测试：

make test

会执行所有的测试。

Design

早期思路来源于 QingdaoU/Judger，在此鸣谢。在此基础上引入 kafel 以更友好的方式配置系统调用规则，并重新整理了执行逻辑。

首先分析一下 OJ 判题的流程。一个题目有若干个测试点，可能有子任务依赖关系，这些都是顶层逻辑。下一层就是在沙箱中执行程序，而执行程序是一个通用的模块，囊括编译、执行和检查等的步骤。执行单个程序的过程中不会有与外界的信息交换，但在执行两个程序之间可以进行信息交换（比如看看当前评测到哪个点了），这部分也可以交给顶层逻辑完成。

而执行一个程序要考虑三个要素：资源限制（时间、空间）、系统安全（系统调用的限制）、数据处理（执行程序获得输出）。

容易发现三个要素互不相关。其中资源限制是高度通用的模块，适用于几乎所有地方。系统调用的限制则需要考虑不同的执行过程以应用不同的 policy，也相对通用。而数据处理的部分则稍显复杂。不同程序的传参不同，并且 IO 的配置也有区别。

不过考虑到数据处理的情况是有限的，目前来看 OI 评测有以下几种代码：

标准输入输出型（std_io）: exec + input_file + output_file + error_file
文件 IO 型（file_io）: exec + input_file + output_file + error_file
交互型（interaction）: exec + interactor + input_file(both) + output_file(interactor) + error_file(both)
通用型（代码编译、执行一段命令、testlib）（general）: execve() + input_file + output_file + error_file

Hooks

为了处理资源限制和系统安全限制，我们引入 hooks 框架（其实也可以理解为一个 event 模式），一定程度上规范了程序的评测过程。简单来说所有对当前进程施加的影响都可以封装为一个模块来调用。而把模块挂载（register）到它该执行的阶段就是 hook。

一个典型的 judger 应该具有以下过程：

graph TD
init(initialize) --> bfh(hook:before_fork)
bfh --> fork{fork} -->|parent| afh(hook:after_fork)
rd -.-> afh --> c_t{{child_terminate}} --> awh(hook:after_wait)
fork -->|child| cafh(child_prework) --> rd{{ready}} --> child_run -.-> c_t

除了 child 的部分需要一些逻辑实现，其他部分都可以用 hook 的形式挂载到框架上。原理在于，父进程不会与子进程有通讯，它只需要明确：

子进程的较为准确的开始（方便计时）
子进程的结束以及其状态码（结果判定）

对于第一个任务，我们采用无名管道的方式，在 child_prework 完成前一直阻塞父进程，直到 ready 信号发出。这样 after_fork 的第一个 hook 执行的时间就非常接近于 child_run 的执行时间。

对于第二个任务，使用 wait 和 getrusage 即可。前者需要逻辑方面的实现，而后者也可以封装为 hook。

Policy

对于系统安全方面的限制，可以想到 seccomp，以及其 bpf 等。本项目采用更加人性化的方式：kafel 规则。它可读且易于扩展。

当然，有一些单纯的 kafel 做不了的需求，例如动态的规则内容。这时我们可以在 kafel 编译前外加一些字符串处理，把规则内容模板化。这样可以动态插入所需内容。

于是，我们将定制化的需求通过魔改 kafel-lang 实现。例如引入 %[0-9]s 占位符表示 runner 的参数列表字符串地址。这样可以更好地对 execve 等系统调用进行限制。

已知问题：只能在注册的入口 policy 中使用占位符（没有实现递归替换）。

Reference

Thanks to

QingdaoU/Judger: SATA LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 192 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs/doxygen-awesome-css		docs/doxygen-awesome-css
include		include
kafel		kafel
src		src
tests		tests
.gitignore		.gitignore
Doxyfile		Doxyfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
codecov.yml		codecov.yml
var.mk		var.mk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Judger of Yao OJ

Features

Getting Start

Download

Build from source

Usage

Resource Limitation

Notes

Development

Design

Hooks

Policy

Reference

Thanks to

About

Releases 5

Packages

Languages

License

sshwy/yaoj-judger

Folders and files

Latest commit

History

Repository files navigation

Judger of Yao OJ

Features

Getting Start

Download

Build from source

Usage

Resource Limitation

Notes

Development

Design

Hooks

Policy

Reference

Thanks to

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages