nanobot - Autonomous on Playing and interacting with game from screenshot only

git clone https://github.com/diopthe20/nanobot
git submodule update --init --recursive

TODO

Develop a deployment strategy

Vision model

We will use some vision model to get the information about what we will see in the screen

name	status	description
llama-3-vision-alpha		projection module trained to add vision capabilties to Llama 3 using SigLIP. built by @yeswondwerr and @qtnx_

OCR

I used EasyOCR for recognize some text in the screen during gathering. You can go to https://huggingface.co/spaces/tomofi/EasyOCR to test with EasyOCR

Event Handling

We take the environment state as an event and send it to event handler

Object Detection

Label with Label Studio, Export to YOLO Format and then Upload to ROBOFLOW to export to the right format for YOLO

Train with YOLO

=> Predict from the screen stream

Currently this project in development. The current phase is try out new probilities

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.obsidian		.obsidian
albion-infer-server @ 3429440		albion-infer-server @ 3429440
albion_pathfinder @ ce7d9aa		albion_pathfinder @ ce7d9aa
bot_trainer @ e8f7e74		bot_trainer @ e8f7e74
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
action_space.py		action_space.py
agent.py		agent.py
env_state.py		env_state.py
fiber_detection.py		fiber_detection.py
gather_state.py		gather_state.py
main.py		main.py
model.pt		model.pt
predict.py		predict.py
requirements.txt		requirements.txt
value_sort.py		value_sort.py
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nanobot - Autonomous on Playing and interacting with game from screenshot only

TODO

Vision model

OCR

Event Handling

Object Detection

About

Releases

Packages

Contributors 2

Languages

cyborgx0x/nano-agent

Folders and files

Latest commit

History

Repository files navigation

nanobot - Autonomous on Playing and interacting with game from screenshot only

TODO

Vision model

OCR

Event Handling

Object Detection

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages