试图提取PDF文件的文本内容与下面的code:
Trying to extract the textual content of a pdf with the following code:
PdfReader reader = new PdfReader(path);
string strText = string.Empty;
for (int page = 1; page <= reader.NumberOfPages; page++)
{
string s = PdfTextExtractor.GetTextFromPage(reader, page);
strText += " " + s;
}
reader.Close();
NumberOfPages返回257,但227页,GetTextFromPage()抛出一个IndexOutOfRangeException。
NumberOfPages returns 257, but at page 227, GetTextFromPage() throws a IndexOutOfRangeException.
任何帮助是AP preciated。
Any help is appreciated.
hofnarwillie
hofnarwillie
我通过更新我从5.1版本iTextSharp的,以5.2解决了这个问题。
I resolved this issue by updating my version of iTextSharp from 5.1 to 5.2.
上一篇:如何产生用于HiddenField的每个值的表?HiddenField
下一篇:检索COM类工厂具有CLSID组件{C1F400A0-3F08-11D3-9F0B-006008039E37}失败,原因是以下错误:80040154组件、工厂、错误、原因