Java Mailing List Archive

http://www.junlu.com/

Google
Google
Mailing List
Home
Forum Home
JBoss - Java Application Server
Tomcat - JSP/Servlet container
Struts - A MVC web framework
iText - An open source PDF Java Library
JDOM - JDOM XML Parser
JSP - A mailing list about Java Server Pages specification and reference
J2EE - A mailing list for Java(tm) 2 Platform, Enterprise Edition
J2EE Pattern - An interest list for Sun Java Center J2EE Pattern Catalog
Servlet - A mailing list for discussion about Sun Microsystem's Java Servlet API Technology
Struts & Hibernate
Subjects
JSP editor plugin for eclipse ?
org apache jasper JasperException: Unable to compile class for JSP
Tomcat: Connection reset by peer: socket write error
Cannot retrieve definition for form bean null
Struts Tiles Tutorial (free Struts training)
Where do I download Tomcat 4 0 6?
Data Access Object (DAO) pattern, example DAO 's
Where to download Tomcat v 4 1 24 from?
Tomcat 5 0 16 Requested resource not available
Subject: Servlet : Session invalidate
Oracle Connection Pooling in 3 2 2
Servlet action is currently unavailable
Tomcat/Struts Unicode Encoding/Decoding problems
Subject: Running a Simple JMS Example
Tomcat and webapplication specific java library path
Mapping in workers2 properties
org apache jasper JasperException
problem with html:text bean throwing exception
Cannot find message resources under key org apache struts action
   MESSAGE
Cannot find message resources under key org apache struts action MESSAGE
invalid direct reference problem with solution
Tool for jsp debug Try Sysdeo Eclipse Plugin
Tomcat 5 Cannot load JDBC driver class 'null ' SQL state: null
weblogic ejbc
java properties file
Jboss 3 2 3 Coyote Can 't re
Tomcat 5, Apache2 and mod jk2 integration problem
JBoss example problem new to J2EE
Value attribute of <html:checkbox
url string for connecting jboss to oracle
javax servlet ServletException: BeanUtils populate
5 0 18: Windows XP Pro vs Windows 2000
HTTP Status 404 The requested resource is not available
 
Subject: Re: u guys konw how to read the data from pdfusing java itext ?

Subject: Re: u guys konw how to read the data from pdfusing java itext ?

2007-10-08       - By wasegraves@(protected)

 Back
Please see my most recent reply to Leonard's comments. This should help you
understand the issues that are involved. In the text-extraction procedure I
outlined earlier, I used Reader to produce the PS file from the PDF, thus
skirting around the issues you have.

You should also be aware that use of the text-extraction procedure I outlined
using the procedure would not be permissible if you contemplate using it for a
protected PDF, i.e., one that has protections enabled that you would be
circumventing by using the procedure.

Now, to your problem. You are trying to extend the text-extraction procedure
outlined earlier in this thread to text editing. Please heed Leonard's advice
that editing the text of a PDF is a very bad idea, together with my advice that
if you insist on editing a PDF, you should be prepared to repair the damage
that you do to it.

In another thread, I outlined a procedure for editing a PDF which is known to
work. If you wish to develop your own procedure, please feel free to do so. The
path you are on right now, however, is not likely to be successful.

Best regards,
Bill Segraves

-- ---- ------ Original message from "Sarath Dorbala" <itext.newbie@(protected)>:
-- ---- ------


Hello,
I am sorry if I am digressing. I have a bunch of PDF files in which I just want
to blank out the text. As I read the threads of this particular post, I got an
idea of replacing any textby doing this

(SOME TEXT) Tj to (         ) Tj

But when I converted my pdf file to ps using pdf2ps, I did not see any of such
structures. Still the ps file could be viewed using ghostscript viewer. I dont
know if my PDF is encoded in some sense. Is there any other way text is
rendered in ps?

I have attached my original PDF and converted PS file for your reference.

Sorry if it sounds very naive. I am pretty new to this whole thing.

Thank you,
Sarath
<snip>
<html>
<!-- BEGIN WEBMAIL STATIONERY -->
<head></head>
<body>
<!-- WEBMAIL STATIONERY noneset -->
<DIV></DIV>
<P>Please see my most recent reply to Leonard's comments. This should help you
understand the issues that are involved. In the text-extraction procedure I
outlined earlier, I used Reader to produce the PS file from the PDF, thus
skirting around the issues you have.</P>
<P>&nbsp;</P>
<P>You should also be aware that use of the text-extraction procedure I
outlined using the procedure would not be permissible if you contemplate using
it for a protected PDF, i.e., one&nbsp;that has protections enabled that you
would be circumventing by using the procedure.</P>
<P>&nbsp;</P>
<P>Now, to your problem. You are trying to extend the text-extraction procedure
outlined earlier in this thread to text editing. Please heed Leonard's advice
that editing the text of a PDF is a very bad idea, together with my advice that
if you insist on editing a PDF, you should be prepared to repair the damage
that you do to it.</P>
<P>&nbsp;</P>
<P>In another thread, I outlined a procedure for editing a PDF which is known
to work. If you wish to develop your own procedure, please feel free to do so.
The path you are on right now, however, is not likely to be successful.</P>
<P>&nbsp;</P>
<P>Best regards,</P>
<P>Bill Segraves<BR></P>
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff
2px solid">-- ---- ------ Original message from "Sarath Dorbala" &lt;itext
.newbie@(protected)&gt;: -- ---- ------ <BR><BR>
<DIV>Hello,</DIV>
<DIV>I am sorry if I am digressing. I have a bunch of PDF files in which I just
want to blank out the text. As I read the threads of this particular post, I
got an idea of replacing any textby doing this</DIV>
<DIV>&nbsp;</DIV>
<DIV>(SOME TEXT) Tj to (&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ) Tj<
/DIV>
<DIV>&nbsp;</DIV>
<DIV>But when I converted my pdf file to ps using pdf2ps, I did not see any of
such structures. Still the&nbsp;ps file could be viewed&nbsp;using&nbsp
;ghostscript viewer. I dont know if&nbsp;my PDF&nbsp;is encoded in some sense.
&nbsp;Is there any other way text is rendered in ps? </DIV>
<DIV>&nbsp;</DIV>
<DIV>I have attached my original PDF and converted PS file for your reference.
</DIV>
<DIV>&nbsp;</DIV>
<DIV>Sorry if it sounds very naive. I am pretty new to this whole thing.</DIV>
<DIV>&nbsp;</DIV>
<DIV>Thank you,</DIV>
<DIV>Sarath<BR>&lt;snip&gt;</DIV></BLOCKQUOTE>
<!-- END WEBMAIL STATIONERY -->

</body>
</html>

-- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- -----
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
__ ____ ____ ____ ____ ____ ____ ____ ____ ____
iText-questions mailing list
iText-questions@(protected)
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

©2008 junlu.com - Jax Systems, LLC, U.S.A.