sbox

History

Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile.javascript		Dockerfile.javascript
Dockerfile.python		Dockerfile.python
README.md		README.md
eval.javascript.tpl		eval.javascript.tpl
eval.python.tpl		eval.python.tpl
sandbox.py		sandbox.py
timeout.sh		timeout.sh

README.md

Simple Sandbox

Problem

We can't reliably tell if code is working without executing it, but directly executing untrusted and quite-possibly-buggy code generated by a random LLM directly on our development machines is undesirable.

Solution

Network and Filesystem isolation provider by Docker
Timeout provider by a bash exec wrapper
Eval wrapper to capture errors, format outputs and always return a consitent result

Implementation

sandbox.py (intended to be used as a library)

extract_function_info(language, code) perform static analysis on potentially non-working code that implements a function. returns a { name, args[] } object containing the functions name and a list of its arguments.
FunctionSandbox(code, language) high-level Docker sandbox class. Use the .call(..) method to invoke the untrusted function.

timeout.sh

Bash implementation of the timeout layer

Dockerfile.javascript, Dockerfile.python

Docker implementations of the isolation layer

eval.javascript.tpl, eval.python.tpl

Eval wrappers.

Assumptions

code contains a single function in language
the language we are working with supports try/catch, lists and objects
the language we are working with can serialize JSON

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

sbox

sbox

README.md

Simple Sandbox

Problem

Solution

Implementation

Assumptions

Files

sbox

Directory actions

More options

Directory actions

More options

Latest commit

History

sbox

Folders and files

parent directory

README.md

Simple Sandbox

Problem

Solution

Implementation

Assumptions