Java Mailing List Archive

http://www.junlu.com/

Google
Google
Mailing List
Home
Forum Home
JBoss - Java Application Server
Tomcat - JSP/Servlet container
Struts - A MVC web framework
iText - An open source PDF Java Library
JDOM - JDOM XML Parser
JSP - A mailing list about Java Server Pages specification and reference
J2EE - A mailing list for Java(tm) 2 Platform, Enterprise Edition
J2EE Pattern - An interest list for Sun Java Center J2EE Pattern Catalog
Servlet - A mailing list for discussion about Sun Microsystem's Java Servlet API Technology
Struts & Hibernate
Subjects
JSP editor plugin for eclipse ?
org apache jasper JasperException: Unable to compile class for JSP
Tomcat: Connection reset by peer: socket write error
Cannot retrieve definition for form bean null
Struts Tiles Tutorial (free Struts training)
Where do I download Tomcat 4 0 6?
Data Access Object (DAO) pattern, example DAO 's
Where to download Tomcat v 4 1 24 from?
Tomcat 5 0 16 Requested resource not available
Servlet : Session invalidate
Oracle Connection Pooling in 3 2 2
Servlet action is currently unavailable
Tomcat/Struts Unicode Encoding/Decoding problems
Running a Simple JMS Example
Tomcat and webapplication specific java library path
Mapping in workers2 properties
org apache jasper JasperException
problem with html:text bean throwing exception
Cannot find message resources under key org apache struts action
   MESSAGE
Cannot find message resources under key org apache struts action MESSAGE
invalid direct reference problem with solution
Tool for jsp debug Try Sysdeo Eclipse Plugin
Tomcat 5 Cannot load JDBC driver class 'null ' SQL state: null
weblogic ejbc
java properties file
Jboss 3 2 3 Coyote Can 't re
Tomcat 5, Apache2 and mod jk2 integration problem
JBoss example problem new to J2EE
Value attribute of <html:checkbox
url string for connecting jboss to oracle
javax servlet ServletException: BeanUtils populate
5 0 18: Windows XP Pro vs Windows 2000
HTTP Status 404 The requested resource is not available
 
-none-

-none-

2007-10-07       - By wasegraves@(protected)

 Back
If the PDF is locked with a password, but still printable, the approach offered
by this author is one that would work, while attempting to use this approach on
the original PDF would fail. This author was simply trying to help the poster
with an approach that would avoid the frustration that would ensue if he tried
to work with an original locked PDF.

Of course, the approach espoused by the esteemed sage would be easier, for both
unlocked and unlocked PDFs. OTOH, this author doesn't count easier to fail as
an acceptable approach.

Cheers,
Bill Segraves

-- ---- ------ Original message from Leonard Rosenthol <leonardr@(protected)>:
-- ---- ------


> Why would working through the PostScript be easier than doing this on
> the original PDF?
>
> You can get to all the PDF operators just fine.
> Font & text information is more easily referenceable from the PDF
> PostScript also has "XObjects", Patterns, etc. that may contain text.
> etc.
>
> Not understanding the logic :(.
>
> Leonard
>
>
> On Oct 6, 2007, at 4:53 PM, wasegraves@(protected) wrote:
>
> > Yes; but it is not practicable with iText. You could, however, as
> > long as the PDF is printable, use the following procedure:
> >
> > 1. Print to a PS file.
> >
> > 2. Scan the PS file from step1 above, dropping all lines that
> > do not end with Tj or TJ.
> >
> > 3. Use a regular expression (together with Substitution or
> > Match) to extract the instances of "text fragment" from within
> > multiple instances of "(text fragment)Tj", printing the resulting
> > text fragments to STDOUT.
> >
> > Bruno has given an excellent example of why you should not expect
> > the resulting output to make sense, i.e., the text fragments may
> > not appear in the order in which you'd like for them to appear.
> >
> > Cheers,
> >
> > Bill Segraves
> >
> > -- ---- ------ Original message from krammark
> > : -- ---- ------
> >
> >
> > >
> > > so , how we read the data from pdf ?
> > > i mean , can we read them line by line from the specific pages ?
> > >
> > > thanks buddy.
> > >
> > >
> > > Bruno Lowagie (iText) wrote:
> > > >
> > > > krammark wrote:
> > > >> hey gusy,
> > > >> do u guys have a idea how to read the data from pdf pages
> > using itext ?
> > > >> basically, i want to read the data from table and write them
> > into excel
> > > >> files.
> > > >> is that possible ?
> > > >
> > > > There is no such thing as 'a table' in plain PDF.
> > > > It's just lines and words painted on a canvas,
> > > > possible in an arbitrary order.
> > > >
> > > > Unless your tables cells are form fields, or your
> > > ; > PDF contains specific table structures (Tagged PDF),
> > > > iText probably won't help you.
> > > >
> > > > br,
> > > > Bruno
> > > >
> > > >
> > -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- --
> > ---
> > > > This SF.net email is sponsored by: Splunk Inc.
> > > > Still grepping through log files to find problems? Stop.
> > > > Now Search log events and configuration files using AJAX and a
> > browser.
> > > > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > > > __ ____ ____ ____ ____ ____ ____ ____ ____ ____
> > > > iText-questions mailing list
> > > > iText-questions@(protected)
> > > > https://lists.sourceforge.net/lists/listinfo/itext-questions
> > > > Buy the iText book: http://itext.ugent.be/itext-in-action/
> > > >
> > > >
> > >
> > > --
> > > View this message in context:
> > > http://www.nabble.com/u-guys-konw -how-t o-read-the-data-from-pdf-
> > using-java-itext
> > > ---tf4572506.html#a13067937
> > > Sent from the iText - General mailing list archive at Nabble.com.
> > >
> > >
> > >
> > -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- --
> > ---
> > > This SF.net email is sponsored by: Splunk Inc.
> > > Still grepping through log files to find problems? Stop.
> > > Now Search log events and configuration files using AJAX and a
> > browser.
> > > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > > __ ____ ____ ____ ____ ____ ____ ____ ____ ____
> > > iText-questions mailing list
> > > iText-questions@(protected)
> > > https://lists.sourceforge.net/lists/listinfo/itext-questions
> > > Buy the iText book: http://itext.ugent.be/itext-in-action/
> > -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- --
> > ---
> > This SF.net email is sponsored by: Splunk Inc.
> > Still grepping through log files to find problems? Stop.
> > Now Search log events and configuration files using AJAX and a
> > browser.
> > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > __ ____ ____ ____ ____ ____ ____ ____ ____ ____
> > iText-questions mailing list
> > iText-questions@(protected)
> > https://lists.sourceforge.net/lists/listinfo/itext-questions
> > Buy the iText book: http://itext.ugent.be/itext-in-action/
>
>
> -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- -----
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems? Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> __ ____ ____ ____ ____ ____ ____ ____ ____ ____
> iText-questions mailing list
> iText-questions@(protected)
> https://lists.sourceforge.net/lists/listinfo/itext-questions
> Buy the iText book: http://itext.ugent.be/itext-in-action/
<html>
<!-- BEGIN WEBMAIL STATIONERY -->
<head></head>
<body>
<!-- WEBMAIL STATIONERY noneset -->
<DIV></DIV>
<P>If the PDF is locked with a password, but still printable, the approach
offered by this author is one that would work, while attempting to use this
approach on the original PDF would fail. This author was simply trying to help
the poster with an approach that would avoid the frustration that would ensue
if he tried to work with an original locked PDF.</P>
<P>&nbsp;</P>
<P>Of course, the approach espoused by the esteemed sage would be easier, for
both unlocked and unlocked PDFs. OTOH, this author doesn't count easier to fail
as an acceptable approach.</P>
<P>&nbsp;</P>
<P>Cheers,</P>
<P>Bill Segraves<BR></P>
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff
2px solid">-- ---- ------ Original message from Leonard Rosenthol &lt;leonardr
@(protected)&gt;: -- ---- ------ <BR><BR><BR>&gt; Why would working through the
PostScript be easier than doing this on <BR>&gt; the original PDF? <BR>&gt; <BR
>&gt; You can get to all the PDF operators just fine. <BR>&gt; Font &amp; text
information is more easily referenceable from the PDF <BR>&gt; PostScript also
has "XObjects", Patterns, etc. that may contain text. <BR>&gt; etc. <BR>&gt;
<BR>&gt; Not understanding the logic :(. <BR>&gt; <BR>&gt; Leonard <BR>&gt; <BR>
&gt; <BR>&gt; On Oct 6, 2007, at 4:53 PM, wasegraves@(protected) wrote: <BR>
&gt; <BR>&gt; &gt; Yes; but it is not practicable with iText. You could, however
, as <BR>&gt; &gt; long as the PDF is printable, use the following procedure:
<BR>&gt; &gt; <BR>&gt; &gt; 1. Print to a PS file. <BR>&gt; &gt; <BR>&gt; &gt; 2
. Scan the PS file from step1 above, droppin
g all
lines that <BR>&gt; &gt; do not end with Tj or TJ. <BR>&gt; &gt; <BR>&gt; &gt;
3. Use a regular expression (together with Substitution or <BR>&gt; &gt; Match)
to extract the instances of "text fragment" from within <BR>&gt; &gt; multiple
instances of "(text fragment)Tj", printing the resulting <BR>&gt; &gt; text
fragments to STDOUT. <BR>&gt; &gt; <BR>&gt; &gt; Bruno has given an excellent
example of why you should not expect <BR>&gt; &gt; the resulting output to make
sense, i.e., the text fragments may <BR>&gt; &gt; not appear in the order in
which you'd like for them to appear. <BR>&gt; &gt; <BR>&gt; &gt; Cheers, <BR>
&gt; &gt; <BR>&gt; &gt; Bill Segraves <BR>&gt; &gt; <BR>&gt; &gt; -- ---- ------
Original message from krammark <BR>&gt; &gt; <WENWEN_829@(protected)>: -- -----
-- --- <BR>&gt; &gt; <BR>&gt; &gt; <BR>&gt; &gt; &gt; <BR>&gt; &gt; &gt; so ,
how we read the data from pdf ? <BR>&gt; &gt; &gt; i mean , can we read them
line by line from the specific pages ? <BR>&gt; &
gt; &g
t; <BR>&gt; &gt; &gt; thanks buddy. <BR>&gt; &gt; &gt; <BR>&gt; &gt; &gt; <BR>
&gt; &gt; &gt; Bruno Lowagie (iText) wrote: <BR>&gt; &gt; &gt; &gt; <BR>&gt; &gt
; &gt; &gt; krammark wrote: <BR>&gt; &gt; &gt; &gt;&gt; hey gusy, <BR>&gt; &gt;
&gt; &gt;&gt; do u guys have a idea how to read the data from pdf pages <BR>&gt;
&gt; using itext ? <BR>&gt; &gt; &gt; &gt;&gt; basically, i want to read the
data from table and write them <BR>&gt; &gt; into excel <BR>&gt; &gt; &gt; &gt;
&gt; files. <BR>&gt; &gt; &gt; &gt;&gt; is that possible ? <BR>&gt; &gt; &gt;
&gt; <BR>&gt; &gt; &gt; &gt; There is no such thing as 'a table' in plain PDF.
<BR>&gt; &gt; &gt; &gt; It's just lines and words painted on a canvas, <BR>&gt;
&gt; &gt; &gt; possible in an arbitrary order. <BR>&gt; &gt; &gt; &gt; <BR>&gt;
&gt; &gt; &gt; Unless your tables cells are form fields, or your <BR>&gt; &gt;
&gt; ; &gt; PDF contains specific table structures (Tagged PDF), <BR>&gt; &gt;
&gt; &gt; iText probably won't help you.
 <BR>&
gt; &gt; &gt; &gt; <BR>&gt; &gt; &gt; &gt; br, <BR>&gt; &gt; &gt; &gt; Bruno
<BR>&gt; &gt; &gt; &gt; <BR>&gt; &gt; &gt; &gt; <BR>&gt; &gt; -- ---- ---- -----
-- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- <BR>&gt; &gt; --- <BR>&gt;
&gt; &gt; &gt; This SF.net email is sponsored by: Splunk Inc. <BR>&gt; &gt; &gt;
&gt; Still grepping through log files to find problems? Stop. <BR>&gt; &gt; &gt
; &gt; Now Search log events and configuration files using AJAX and a <BR>&gt;
&gt; browser. <BR>&gt; &gt; &gt; &gt; Download your FREE copy of Splunk now &gt;
&gt; http://get.splunk.com/ <BR>&gt; &gt; &gt; &gt; __ ____ ____ ____ ____ _____
__ ____ ____ ______ <BR>&gt; &gt; &gt; &gt; iText-questions mailing list <BR>&gt
; &gt; &gt; &gt; iText-questions@(protected) <BR>&gt; &gt; &gt; &gt;
https://lists.sourceforge.net/lists/listinfo/itext-questions <BR>&gt; &gt; &gt;
&gt; Buy the iText book: http://itext.ugent.be/itext-in-action/ <BR>&gt; &gt;
&gt; &gt; <BR>&gt; &gt; &gt; &gt; <BR
>&gt;
&gt; &gt; <BR>&gt; &gt; &gt; -- <BR>&gt; &gt; &gt; View this message in context
: <BR>&gt; &gt; &gt; http://www.nabble.com/u-guys-konw -how-t o-read-the-data
-from-pdf- <BR>&gt; &gt; using-java-itext <BR>&gt; &gt; &gt; ---tf4572506.html
#a13067937 <BR>&gt; &gt; &gt; Sent from the iText - General mailing list archive
at Nabble.com. <BR>&gt; &gt; &gt; <BR>&gt; &gt; &gt; <BR>&gt; &gt; &gt; <BR>&gt
; &gt; -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- --
<BR>&gt; &gt; --- <BR>&gt; &gt; &gt; This SF.net email is sponsored by: Splunk
Inc. <BR>&gt; &gt; &gt; Still grepping through log files to find problems? Stop
. <BR>&gt; &gt; &gt; Now Search log events and configuration files using AJAX
and a <BR>&gt; &gt; browser. <BR>&gt; &gt; &gt; Download your FREE copy of
Splunk now &gt;&gt; http://get.splunk.com/ <BR>&gt; &gt; &gt; __ ____ ____ ____
__ ____ ____ ____ ____ ____ __ <BR>&gt; &gt; &gt; iText-questions mailing list
<BR>&gt; &gt; &gt; iText-questions@(protected)
rge.ne
t <BR>&gt; &gt; &gt; https://lists.sourceforge.net/lists/listinfo/itext
-questions <BR>&gt; &gt; &gt; Buy the iText book: http://itext.ugent.be/itext-in
-action/ <BR>&gt; &gt; -- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ----
-- ---- ----- <BR>&gt; &gt; --- <BR>&gt; &gt; This SF.net email is sponsored by:
Splunk Inc. <BR>&gt; &gt; Still grepping through log files to find problems?
Stop. <BR>&gt; &gt; Now Search log events and configuration files using AJAX
and a <BR>&gt; &gt; browser. <BR>&gt; &gt; Download your FREE copy of Splunk
now &gt;&gt; http://get.splunk.com/ <BR>&gt; &gt; __ ____ ____ ____ ____ ______
__ ____ ____ _____ <BR>&gt; &gt; iText-questions mailing list <BR>&gt; &gt;
iText-questions@(protected) <BR>&gt; &gt; https://lists.sourceforge
.net/lists/listinfo/itext-questions <BR>&gt; &gt; Buy the iText book: http:/
/itext.ugent.be/itext-in-action/ <BR>&gt; <BR>&gt; <BR>&gt; -- ---- ---- ---- --
-- ---- ---- ---- ---- ---- ---- ---- ---- ---- ----- <BR>&
gt; Th
is SF.net email is sponsored by: Splunk Inc. <BR>&gt; Still grepping through
log files to find problems? Stop. <BR>&gt; Now Search log events and
configuration files using AJAX and a browser. <BR>&gt; Download your FREE copy
of Splunk now &gt;&gt; http://get.splunk.com/ <BR>&gt; __ ____ ____ ____ ______
__ ____ ____ ____ _____ <BR>&gt; iText-questions mailing list <BR>&gt; iText
-questions@(protected) <BR>&gt; https://lists.sourceforge.net/lists
/listinfo/itext-questions <BR>&gt; Buy the iText book: http://itext.ugent.be
/itext-in-action/ </BLOCKQUOTE>
<!-- END WEBMAIL STATIONERY -->

</body>
</html>

-- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- -----
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
__ ____ ____ ____ ____ ____ ____ ____ ____ ____
iText-questions mailing list
iText-questions@(protected)
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

©2008 junlu.com - Jax Systems, LLC, U.S.A.