Tabulo is an open source toolkit for computer vision. Currently, we support table detection, but we are aiming for much more. It is built in Python, using Luminoth, TensorFlow and Sonnet.
- Installation Instructions
- Avaiable API's
- Working with pretrained Models
- Runnning Tabulo
- Runnning Tabulo As Service
- Supported models
- Usage
- Working with datasets
- Training
- LICENSE
Tabulo currently supports Python 2.7 and 3.4–3.6.
To use Tabulo, TensorFlow must be installed beforehand. If you want GPU support, you should install the GPU version of TensorFlow with pip install tensorflow-gpu
, or else you can use the CPU version using pip install tensorflow
.
We are using tesseract to extract data from table so you have to install tesseract also. Follow this link to install tessersact
First, clone the repo on your machine and then install with pip
:
git clone https://github.com/interviewBubble/Tabulo.git
cd tabulo
pip install -e .
Simply run tabulo --help
.
localhost:5000/api/fasterrcnn/predict/
- To detect table in the imagelocalhost:5000/api/fasterrcnn/extract/
- Extract table content from detected tables
- DOWNLOAD pretrained model from Google drive
- Unzip and Copy downloaded luminoth folder inside
luminoth/utils/pretrained_models
folder - Hit this command to list all check points:
tabulo checkpoint list
- You will get output like this:
- Now run server using this command:
tabulo server web --checkpoint 6aac7a1e8a8e
curl -X POST \
http://localhost:5000/api/fasterrcnn/predict/ \
-H 'Content-Type: application/x-www-form-urlencoded' \
-H 'Postman-Token: 70478bd2-e1e8-442f-b0bf-ea5ecf7bf4d8' \
-H 'cache-control: no-cache' \
-H 'content-type: multipart/form-data; boundary=----WebKitFormBoundary7MA4YWxkTrZu0gW' \
-F image=@/path/to/image/page_8-min.jpg
Currently, we support the following models:
- Object Detection
We also provide pre-trained checkpoints for the above models trained on popular datasets such as COCO and Pascal.
There is one main command line interface which you can use with the tabulo
command. Whenever you are confused on how you are supposed to do something just type:
tabulo --help
or tabulo <subcommand> --help
and a list of available options with descriptions will show up.
DataSet to train your custom model.
See Training your own model to learn how to train locally or in Google Cloud.
Released under the BSD 3-Clause.