adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README #70

klxu03 · 2023-12-04T02:10:27Z

Closes #77

…EADME to include example of -accurate run

…nd and color in green, otherwise draw with white background and black color

klxu03 · 2023-12-04T04:16:25Z

some premature work on having the model pick which grid to choose on for revamped -accuracy mode. properly display the grid coordinates now.

I plan on modifying the idea, I first cut out a 400px x 400px area around the originally guessed location, and then have the model continually pick which grid option/quadrant to click on from there, cropping out the selected grid in the process and x2 upsampling the image after each crop every time before passing it once again to GPT.

klxu03 · 2023-12-04T06:32:08Z

Adding an implementation note for my future self:

A clean way to implement the picking which grid to zoom in on when deciding which pixel to click can be cleanly implemented by the loop constantly storing the top left percentages and the bottom right percentages at each iteration of the loop. That way, at the end, you can just average the two percentages and return that as the pixel clicked.

Additionally, maybe at first do 4 grid lines (dividing the area into 16 grids). but later, when more narrowed down, only do 2 grid lines (so dividing the area into fourths). Something like two 4 grid lines, and two 2 grid lines will yield a final pixel area of 400/(4^2 * 2^2) = 6.25, or a pixel mistake up to 3 pixels in any dimension. That is pretty darn accurate assuming the model correctly picks the correct grid every time.

Additionally, look into polling the model. So ask the model to generate 9 responses, and then choose the most popular grid selection. Fail-safing the chance that a wrong grid choice was picked.

joshbickett · 2023-12-05T03:39:08Z

Hmm, I'm curious for a bit more context for some on this commit. Hoping to keep most the none -accurate code the same when making -accurate improvements

klxu03 · 2023-12-07T02:39:23Z

@joshbickett hey sorry just seeing this now, what do you mean by most of the none code the same? I'm planning on basically having two different draw_labels. for normal mouse clicking it shows the percentages in black with a white background. But when choosing grid, I forego the white rectangle and just display the text in a green color (this is because at some point, it gets zoomed in a lot to like a 6px x 6px range so having a white rectangle taking up pixels doesn't seem like the best idea). Esp when the model should know the top left corner is grid 0, then goes down then right (column major order)

…pture a mini screenshot based on top left and bottom right percentages

joshbickett · 2023-12-09T15:22:01Z

@klxu03 you can ignore my last comment. I thought draw_label_with_background changed significantly but now I just see you added a condition for your -accurate method. All good, no concerns.

I am taking a closer look now. Got an error I haven't seen running normal operate without -accurate. Maybe a fluke, I'll look closer

joshbickett · 2023-12-09T15:32:09Z

@klxu03 Tried -accurate mode on a task got this error. I'm very interested to see where this PR goes. Let me know when you think it is ready for more testing!

slavakurilyak · 2023-12-16T00:09:25Z

+1 for accuracy mode

joshbickett · 2023-12-21T15:15:44Z

I am taking a closer look now. Got an error I haven't seen running normal operate without -accurate. Maybe a fluke, I'll look closer

@klxu03 let me know if you have any updates or thoughts on this. Thanks

joshbickett · 2024-01-04T03:43:58Z

@klxu03 curious if you have any updates. Looks like -accurate may still have issues. May make sense to remove for now until there are updates

klxu03 · 2024-01-07T08:32:56Z

For sure remove, it's likely outdated. My bad I've been offline for a while on vacation. Returning later

joshbickett · 2024-01-16T18:04:21Z

@klxu03 did a rewrite of the project without accuracy mode. I think that multimodal are going to solve this mouse click problem pretty soon. See CogAgent: https://arxiv.org/abs/2312.08914

I'll close this for now. If you have additional updates, let me know.

klxu03 added 4 commits December 3, 2023 21:08

adding README comment on -accuracy

95b421d

creating and labeling which grid to choose from as well as updating R…

51f16b0

…EADME to include example of -accurate run

adding way to sample save/run a screenshot with grids to choose from

3e4d1b7

conditional drawing label, if grid mode then dont draw white backgrou…

7bdcec0

…nd and color in green, otherwise draw with white background and black color

klxu03 changed the title ~~adding README comment on -accuracy~~ adding README comment on -accuracy and beginning of the -accuracy grid rewrite Dec 4, 2023

updated README to remove poetry

a816aa7

klxu03 mentioned this pull request Dec 4, 2023

Windows 11 installation issue: Poetry could not find a pyproject.toml file in *path*\self-operating-computer or its parents #77

Closed

klxu03 changed the title ~~adding README comment on -accuracy and beginning of the -accuracy grid rewrite~~ adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README Dec 4, 2023

klxu03 added 4 commits December 8, 2023 20:53

fixed merge cofnlicts in readme

5ece32c

code refactoring on mini screenshot, and adding helper function to ca…

37e6935

…pture a mini screenshot based on top left and bottom right percentages

infra set up for the for loop in accurate_mode_click picking

8822ffd

iterating through different accuracy grids

e62d45d

Merge branch 'main' into reflective-mouse-click

ec0d8da

joshbickett closed this Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README #70

adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README #70

klxu03 commented Dec 4, 2023 •

edited

klxu03 commented Dec 4, 2023

klxu03 commented Dec 4, 2023

joshbickett commented Dec 5, 2023

klxu03 commented Dec 7, 2023

joshbickett commented Dec 9, 2023 •

edited

joshbickett commented Dec 9, 2023 •

edited

slavakurilyak commented Dec 16, 2023

joshbickett commented Dec 21, 2023

joshbickett commented Jan 4, 2024

klxu03 commented Jan 7, 2024

joshbickett commented Jan 16, 2024

adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README #70

adding README comment on -accuracy and beginning of the -accuracy grid rewrite, and delete Poetry artifacts from README #70

Conversation

klxu03 commented Dec 4, 2023 • edited

klxu03 commented Dec 4, 2023

klxu03 commented Dec 4, 2023

joshbickett commented Dec 5, 2023

klxu03 commented Dec 7, 2023

joshbickett commented Dec 9, 2023 • edited

joshbickett commented Dec 9, 2023 • edited

slavakurilyak commented Dec 16, 2023

joshbickett commented Dec 21, 2023

joshbickett commented Jan 4, 2024

klxu03 commented Jan 7, 2024

joshbickett commented Jan 16, 2024

klxu03 commented Dec 4, 2023 •

edited

joshbickett commented Dec 9, 2023 •

edited

joshbickett commented Dec 9, 2023 •

edited