OCR To Text File in C#.Net

Posted by Naraayanan under C# on 6/12/2014 | Points: 10 | Views : 546 | Status : [Member] | Replies : 2
How to extract PDF File data into text File in C#.net ? The file contents are Optical Character Recognition format.

Regards,
Lakshmi Naraayanan.S
http://dotnettechrocks.blogspot.in/
http://abaprocker.blogspot.com/



Responses

Posted by: Goud.Kv on: 6/12/2014 [Member] [MVP] Gold | Points: 25

Up
0
Down
Hi,

Refer the below resolved post.
http://stackoverflow.com/questions/158479/programatically-recognize-text-from-scans-in-a-pdf-file
Hope it helps you..

Thanks & Regards,
Krishna

Naraayanan, if this helps please login to Mark As Answer. | Alert Moderator

Posted by: Bartosz.Zygadlo on: 12/11/2014 [Member] Starter | Points: 25

Up
0
Down
Naraayanan,
Your question implies you already performed OCR and you now have the recognition result in some intermediate format that needs conversion to text. If this is the case, you will need whatever library you already used to perform OCR so that the library would do the conversion from OCR result to text.
I based these remarks on the library I use which is LeadTools. In this library, the OCR result can be saved as intermediate "LTD" files which can then be converted to text, DOC, PDF, etc.


Naraayanan, if this helps please login to Mark As Answer. | Alert Moderator

Login to post response