Subject: Re: u guys konw how to read the data from pdfusing java itext ? 2007-10-08 - By wasegraves@(protected)
Back Please see my most recent reply to Leonard's comments. This should help you understand the issues that are involved. In the text-extraction procedure I outlined earlier, I used Reader to produce the PS file from the PDF, thus skirting around the issues you have.
You should also be aware that use of the text-extraction procedure I outlined using the procedure would not be permissible if you contemplate using it for a protected PDF, i.e., one that has protections enabled that you would be circumventing by using the procedure.
Now, to your problem. You are trying to extend the text-extraction procedure outlined earlier in this thread to text editing. Please heed Leonard's advice that editing the text of a PDF is a very bad idea, together with my advice that if you insist on editing a PDF, you should be prepared to repair the damage that you do to it.
In another thread, I outlined a procedure for editing a PDF which is known to work. If you wish to develop your own procedure, please feel free to do so. The path you are on right now, however, is not likely to be successful.
Best regards, Bill Segraves
-- ---- ------ Original message from "Sarath Dorbala" <itext.newbie@(protected)>: -- ---- ------
Hello, I am sorry if I am digressing. I have a bunch of PDF files in which I just want to blank out the text. As I read the threads of this particular post, I got an idea of replacing any textby doing this
(SOME TEXT) Tj to ( ) Tj
But when I converted my pdf file to ps using pdf2ps, I did not see any of such structures. Still the ps file could be viewed using ghostscript viewer. I dont know if my PDF is encoded in some sense. Is there any other way text is rendered in ps?
I have attached my original PDF and converted PS file for your reference.
Sorry if it sounds very naive. I am pretty new to this whole thing.
Thank you, Sarath <snip> <html> <!-- BEGIN WEBMAIL STATIONERY --> <head></head> <body> <!-- WEBMAIL STATIONERY noneset --> <DIV></DIV> <P>Please see my most recent reply to Leonard's comments. This should help you understand the issues that are involved. In the text-extraction procedure I outlined earlier, I used Reader to produce the PS file from the PDF, thus skirting around the issues you have.</P> <P> </P> <P>You should also be aware that use of the text-extraction procedure I outlined using the procedure would not be permissible if you contemplate using it for a protected PDF, i.e., one that has protections enabled that you would be circumventing by using the procedure.</P> <P> </P> <P>Now, to your problem. You are trying to extend the text-extraction procedure outlined earlier in this thread to text editing. Please heed Leonard's advice that editing the text of a PDF is a very bad idea, together with my advice that if you insist on editing a PDF, you should be prepared to repair the damage that you do to it.</P> <P> </P> <P>In another thread, I outlined a procedure for editing a PDF which is known to work. If you wish to develop your own procedure, please feel free to do so. The path you are on right now, however, is not likely to be successful.</P> <P> </P> <P>Best regards,</P> <P>Bill Segraves<BR></P> <BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">-- ---- ------ Original message from "Sarath Dorbala" <itext .newbie@(protected)>: -- ---- ------ <BR><BR> <DIV>Hello,</DIV> <DIV>I am sorry if I am digressing. I have a bunch of PDF files in which I just want to blank out the text. As I read the threads of this particular post, I got an idea of replacing any textby doing this</DIV> <DIV> </DIV> <DIV>(SOME TEXT) Tj to ( ) Tj< /DIV> <DIV> </DIV> <DIV>But when I converted my pdf file to ps using pdf2ps, I did not see any of such structures. Still the ps file could be viewed using  ;ghostscript viewer. I dont know if my PDF is encoded in some sense. Is there any other way text is rendered in ps? </DIV> <DIV> </DIV> <DIV>I have attached my original PDF and converted PS file for your reference. </DIV> <DIV> </DIV> <DIV>Sorry if it sounds very naive. I am pretty new to this whole thing.</DIV> <DIV> </DIV> <DIV>Thank you,</DIV> <DIV>Sarath<BR><snip></DIV></BLOCKQUOTE> <!-- END WEBMAIL STATIONERY -->
</body> </html>
-- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ----- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ __ ____ ____ ____ ____ ____ ____ ____ ____ ____ iText-questions mailing list iText-questions@(protected) https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://itext.ugent.be/itext-in-action/
|
|