PDF Text Extraction in .Net [Resolved]

Posted by Nashfana under ASP.NET on 6/6/2012 | Points: 10 | Views : 1538 | Status : [Member] | Replies : 5
How can extract text from PDF file, by using PDFBOX in .net ?




Responses

Posted by: Bronte on: 5/7/2013 [Member] Starter | Points: 50

Up
0
Down

Resolved
Extracting text from pdf is a little complex, here i can offer all some sample code in .net, also you can see more detail guide here: http://www.rasteredge.com/how-to/csharp-imaging/pdf-text-extract/
public static void LoadPDFFromFile()

{
string fileName = FolderName + "Sample.PDF";

REDocument doc = REFile.OpenDocumentFile(fileName, new PDFDecoder());//use TIFDecoder open a pdf file

BasePage aPage = doc.GetPage(0);//get page from REDocument

REImage img = (REImage)aPage.ToImage();//translate page to image
}


Nashfana, if this helps please login to Mark As Answer. | Alert Moderator

Posted by: Vuyiswamb on: 6/6/2012 [Member] [MVP] [Administrator] NotApplicable | Points: 25

Up
0
Down
Look at this

http://www.ikvm.net/

Thank you for posting at Dotnetfunda
[Administrator]

Nashfana, if this helps please login to Mark As Answer. | Alert Moderator

Posted by: Nashfana on: 6/7/2012 [Member] Starter | Points: 25

Up
0
Down
Thank You

Nashfana, if this helps please login to Mark As Answer. | Alert Moderator

Posted by: Evanpan on: 1/20/2016 [Member] Starter | Points: 25

Up
0
Down
How about extracting text from pdf files with the help of some 3rd party toolkits?
http://www.pqscan.com/extract-text/from-pdf-csharp.html

Nashfana, if this helps please login to Mark As Answer. | Alert Moderator

Posted by: Richard789 on: 1/25/2016 [Member] Starter | Points: 25

Up
0
Down
Thank you

Nashfana, if this helps please login to Mark As Answer. | Alert Moderator

Login to post response