Guibot

A tool for GUI automation using a variety of computer vision and display control backends.

Introduction and concepts

In order to do GUI automation you usually need to solve two problems: first, you need to have a way to control and interact with the interface and platform you are automating and second, you need to be able to locate the objects you are interested in on the screen. Guibot helps you do both.

To interact with GUIs, Guibot provides the controller module which contains a common interface for different display backends, with methods to move the mouse, take screenshots, type characters and so on. The backend to use will depend on how your platform is accessible, with some backends running directly as native binaries or python scripts on Windows, macOS, and Linux while others connecting through remote VNC displays.

To locate an element on the screen, you will need an image representing the screen, a target representing the element (an image or a text in the simplest cases) and a finder configured for the target. The finder looks for the target within the screenshot image and returns the coordinates to the region where that target appears. Finders, just like display controllers, are wrappers around different backends supported by Guibot that could vary from a simplest 1:1 pixel matching by controller backends, to template or feature matching mix by OpenCV, to OCR and ML solutions by Tesseract and AI frameworks.

Finally, to bridge the gap between controlling the GUI and finding target elements, the region module is provided. It represents a subregion of a screen and contains methods to locate targets in this region using a choice of finder and interact with the graphical interface using a choice of controller.

Supported backends

Supported Computer Vision (CV) backends are based on

OpenCV
- Template matching
- Contour matching
- Feature matching
- Haar cascade matching
- Template-feature and mixed matching
Tesseract OCR
- Text matching through pytesseract, tesserocr, or OpenCV's bindings
PyTorch
- R-CNN matching through Faster R-CNN or Mask R-CNN
autopy
- AutoPy matching

Supported Display Controller (DC) backends are based on

Further resources

Homepage: http://guibot.org

Documentation: http://guibot.readthedocs.io

Installation: https://github.com/intra2net/guibot/wiki/Packaging

Issue tracking: https://github.com/intra2net/guibot/issues

Project wiki: https://github.com/intra2net/guibot/wiki

Name		Name	Last commit message	Last commit date
Latest commit History 928 Commits
.github		.github
docs		docs
guibot		guibot
misc		misc
packaging		packaging
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
codecov.yml		codecov.yml
lgtm.yml		lgtm.yml
mypy.ini		mypy.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guibot

Introduction and concepts

Supported backends

Further resources

About

Releases

Packages

Contributors 6

Languages

License

intra2net/guibot

Folders and files

Latest commit

History

Repository files navigation

Guibot

Introduction and concepts

Supported backends

Further resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages