Used for reading simple CAPTCHA. Powered by Tesseract 4.1.1
/base64 POST
Example body:
{
"base64": "iVBORw0KGgoAAAANSUhEUgAAABYAAAAkCAMAAAC62DqvAAAAP1BMVEUAAAAkJSgjKCgoKCglJSgjJSckJSgjJSkkJCYnJycnJycqKiokJSgjJSckJSgkJCclJSglJSklJSwaGhokJSjbbGjNAAAAFHRSTlMA8joT0HDMil0hGgbVtaWVUkQpCmqOj4cAAABQSURBVCjP5cg3DoAwEATAdcSBjP//VtCVvkWixlMOPqopFL0utmayWt8ek15puibTtT/cGtki0UWQnl3Xxcj7/u2gf/EPoL+B/gHlXJcdb24oNg3pSN9UAQAAAABJRU5ErkJggg==",
"whitelist": "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
}
base64 field can have prefix data:image/png;base64,
This project Dockerfile build Tesseract from source so you can choose suitable version.
Convenient commands for dev:
docker build --tag=daominah/ocr_server .
docker rm -f ocr_server
docker run -dit --restart always --name=ocr_server -p=35735:35735 daominah/ocr_server
Tesseract parameters can be changed to modify its behaviour in tesseract.cfg
Doc: Tesseract improve quality
- This project is forked from otiai10/ocrserver
- Go wrap library: otiai10/gosseract
- Origin Tesseract project in C++ tesseract-ocr/tesseract