cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Read Tabular Data Row wise from PDF

himanshu
Deputy Chef I
Deputy Chef I

Hello All,


I am trying to build a solution that can read values from a PDF & post it to a database engine.


DocParser seemed to be promising but was not able to read the row wise data from it.


Also DocParser is a paid service so I tried JavaScript/Python block based solutions but was not able to import library files .


Attached is the screenshot of sample PDF.


Any solutions/suggestions please?


Thank you.



5 REPLIES 5

rayudu_addagarl
Deputy Chef I
Deputy Chef I
There are limitations of Python libraries that can be added 
Might have to use Amazon text tact or adobe cloud services for this use case or hyperscienece etc , all those have cost tonier 

Rayudu Addagarla 
Sr Manager ,Business Consulting
Toronto, Canada

patrick-steil
Deputy Chef III
Deputy Chef III

Himanshu, are you able to copy / paste text from the tabular data section? If you cannot, the text might not actually be text that can be exported directly and may need some type of OCR solution.


Hello Patrick,


Good suggestion.

Yes this is a text which is selectable.



james-ng
Deputy Chef I
Deputy Chef I

for OCR solution, you could try the app "Google Lens"