Hello group,
I just stubled over iText as I'm looking for way to extract the text
elements of a PDF for storage in a text index (Apache Lucene).
But the documentation and examples I found was not too exciting:
http://www.lowagie.com/iText/tutorial/ch13.html
http://www.lowagie.com/iText/examples/Chap13_pdfreader.java
Nor did the API-docs help me figure out how to extract the text.
My goal is a subclassed PdfReader with a convenience method called
"enumerateTextElements", "enumerateElements" or so.
Could someone please give me a hint or two what I need to do?
--
kalle
-------------------------------------------------------
SF.Net is sponsored by: Speed Start Your Linux Apps Now.
Build and deploy apps & Web services for Linux with
a free DVD software kit from IBM. Click Now!
http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click
_______________________________________________
iText-questions mailing list
iText-questions@(protected)
https://lists.sourceforge.net/lists/listinfo/itext-questions