Evaluate on OSWorld #642
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
Feature request
We would like to test OpenAdapt's ability to perform the tasks in https://os-world.github.io/.
This may involve creating recordings of the tasks described in the benchmark, since (as per https://github.com/xlang-ai/OSWorld/tree/main/evaluation_examples) the data sample are formatted as:
Unfortunately this file does not appear to be included in the repo. Therefore completing this evaluation may involve manually re-creating the trajectories via
openadapt.record
.Motivation
Evaluation
The text was updated successfully, but these errors were encountered: