Bug Report: Some task can't be addressed when Headless parameter is enabled #4

3rdCore · 2023-06-06T17:31:20Z

Bug Description:
I encountered a strange bug while running two almost similar benchmarks for the "enter-time" task (but it might consider many other tasks). The only difference between the two runs is the value of the "headless" parameter. In the first case, I set it to False (headless = False), while in the second case, I left it as True, which was the default value.

Steps to Reproduce:

Git clone the SNow_benchmark branch from my fork and follow the installation in the README.md
.
Set the headless parameter to False and run the benchmark for the "enter-time" :
python main.py --env enter-time --llm chatgpt --num-episodes 1 --irci 1 --sgrounding
Set the headless parameter to True and run the benchmark for the "enter-time" :
python main.py --env enter-time --llm chatgpt --num-episodes 1 --irci 1 --sgrounding --headless
Expected Behavior:
The results should be identical, regardless of the value of the headless parameter.

Actual Behavior:
When the headless parameter is disabled (set to False), certain actions are not allowed or counted, resulting in a failed task. (I could benchmark the task several time, I will still get the same results)

(RCI-agent-WSL) thirdcore@DESKTOP-5I4C9HH:~/rci-agent$ python main.py --env enter-time --llm chatgpt --num-episodes 1 --irci 1 --sgrounding 
False
INFO:root:Starting WebDriver Instance 0
INFO:selenium.webdriver.common.selenium_manager:Applicable driver not found; attempting to install with Selenium Manager (Beta)
INFO:root:Send a request to the language model from initialize_plan
INFO:root:The number of generated action steps: 4
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: clickxpath //*[@id="tt"]
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: type 02:07PM
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: clickxpath //*[@id="subbtn"]
success rate: 1.0
(RCI-agent-WSL) thirdcore@DESKTOP-5I4C9HH:~/rci-agent$ python main.py --env enter-time --llm chatgpt --num-episodes 1 --irci 1 --sgrounding --headless
True
INFO:root:Starting WebDriver Instance 0
INFO:selenium.webdriver.common.selenium_manager:Applicable driver not found; attempting to install with Selenium Manager (Beta)
INFO:root:Send a request to the language model from initialize_plan
INFO:root:The number of generated action steps: 4
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: clickxpath //*[@id="tt"]
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: type 1017AM
INFO:root:Send a request to the language model from generate_action
INFO:root:The executed instruction: clickxpath //*[@id="subbtn"]
success rate: 0.0

Additional Information:
I'm still investigating the root cause of this issue. It seems that when the browser is not displayed, some actions are restricted or not properly accounted for, leading to the task failure. Did you have the same behavior, is there something that I'm missing ?

The text was updated successfully, but these errors were encountered:

3rdCore · 2023-06-06T21:13:43Z

This issue was related to selenium.

In the miniwob++ package, I changed :

options.add_argument("headless")
to :
options.add_argument("--headless=new")

and it fixed the issue I had. (I now got the same performance whatever the value of the parameter headless)

This issue is related to this blogpost.

3rdCore closed this as completed Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Report: Some task can't be addressed when Headless parameter is enabled #4

Bug Report: Some task can't be addressed when Headless parameter is enabled #4

3rdCore commented Jun 6, 2023 •

edited

3rdCore commented Jun 6, 2023

Bug Report: Some task can't be addressed when Headless parameter is enabled #4

Bug Report: Some task can't be addressed when Headless parameter is enabled #4

Comments

3rdCore commented Jun 6, 2023 • edited

3rdCore commented Jun 6, 2023

3rdCore commented Jun 6, 2023 •

edited