How to get key value data from OCR

regex
extract-data
zonal-ocr

#1

I am trying to use OCR API for some document with Table Structure… its working fine i am also getting output of the OCR in Json… with TOP X , Left X etc now i want to store same data in database with Field Name / Value example for Item Name / Price from database how can we identify Header / Value etc…


#2

So you want to extract certain infos from the OCR’ed text? The standard way to do is to use regular expressions.

Examples:

The other option is to use the coordinates of the word bounding boxes. This works if you know that certain data is always at a certain position, e. g. when scanning always the same type of invoices.


#3

Thanx @ulrich
I am working on same direction… hope it should work