Skip to content

Disordering of Extracted Lines from ML kit Text Extraction API(OnDeviceTextRecogniser) #58

@phani-artiovatic

Description

@phani-artiovatic

Step 1: Describe your environment

  • Android device: Nokia 2.2___
  • Android OS version: Android 9.0
  • Google Play Services version: _____
  • Firebase/Play Services SDK version: _____

Step 2: Describe the problem:

On providing images with texts in same line having large spaces in between them are separated into different blocks. This inreturn messes up order of lines in text extraction. But order of lines is very important in our case.

Steps to reproduce:

1.Provide an image with words in same line with large spaces
2.And check output Text , you can see order of lines disordered in large degree

Observed Results: This is Raw text from TextExtraction API

Sample Input
sample

Output

ORIGINAL
L For Recioient
BILL OF SUPPLY
ABD CO
20/0/208
BOS0057
M.G.RAOD, Delhi, Delhi 110099
GSTIN 0TA
State
PAN
434
B Date
07-Delhi
AAECC8220
No
Reference Nio PO.78708
Customer Name
ACC &CO
Customar GSTIN
27A
Place of Supsly
Billing Addres
ACC &CO
Maharashtr
Shipping Addres
ACC&CO
Maharashtra
120
27-Maharashtra
Due Date 24/01/2038
Discount (
Rate / tem (
Total ()
tem
HSN/SAC Quantity
1,809.00
KGS
1.Slag for manufacturing iron
8,678.67
171,30,30,762.64
Total ()
1,30,30,762.64
1,30,30,763.00
0.36
One Crore Thirty Lakh Thirty Thousand Seven Hundred Sity Three Rupees Only
Total Value
Rounding off
Total amount in words)
For ABD Cco
Authorised Signatory

Expected Results:

Expected result is line by line extraction.Currently its block by block extraction or is there way to acheive it?

Relevant Code:

depnedecny used com.google.firebase:firebase-ml-vision:20.0.0'(OnDeviceTextRecogniser)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions