Java Mailing List Archive

http://www.junlu.com/

Google
Google
Mailing List
Home
Forum Home
JBoss - Java Application Server
Tomcat - JSP/Servlet container
Struts - A MVC web framework
iText - An open source PDF Java Library
JDOM - JDOM XML Parser
JSP - A mailing list about Java Server Pages specification and reference
J2EE - A mailing list for Java(tm) 2 Platform, Enterprise Edition
J2EE Pattern - An interest list for Sun Java Center J2EE Pattern Catalog
Servlet - A mailing list for discussion about Sun Microsystem's Java Servlet API Technology
Struts & Hibernate
Subjects
JSP editor plugin for eclipse ?
org apache jasper JasperException: Unable to compile class for JSP
Tomcat: Connection reset by peer: socket write error
Cannot retrieve definition for form bean null
Struts Tiles Tutorial (free Struts training)
Where do I download Tomcat 4 0 6?
Data Access Object (DAO) pattern, example DAO 's
Where to download Tomcat v 4 1 24 from?
Tomcat 5 0 16 Requested resource not available
Servlet : Session invalidate
Oracle Connection Pooling in 3 2 2
Servlet action is currently unavailable
Tomcat/Struts Unicode Encoding/Decoding problems
Running a Simple JMS Example
Tomcat and webapplication specific java library path
Mapping in workers2 properties
org apache jasper JasperException
problem with html:text bean throwing exception
Cannot find message resources under key org apache struts action
   MESSAGE
Cannot find message resources under key org apache struts action MESSAGE
invalid direct reference problem with solution
Tool for jsp debug Try Sysdeo Eclipse Plugin
Tomcat 5 Cannot load JDBC driver class 'null ' SQL state: null
weblogic ejbc
java properties file
Jboss 3 2 3 Coyote Can 't re
Tomcat 5, Apache2 and mod jk2 integration problem
JBoss example problem new to J2EE
Value attribute of <html:checkbox
url string for connecting jboss to oracle
javax servlet ServletException: BeanUtils populate
5 0 18: Windows XP Pro vs Windows 2000
HTTP Status 404 The requested resource is not available
 
skipping a huge text node

skipping a huge text node

2006-06-20       - By Tobias Thierer

 Back
Reply:     1     2     3  

Hi,

I am trying to parse a very large XML document, 99% of which consists of one
huge text node:

 <sequence>ACGGAAAT[...]</sequence>

which is too large to fit into memory. So instead of getting the whole
String returned by the parser (which won't work because it doesn't fit into
memory), I'd like to get just the length of the string and its offset in the
XML file, so that whenever I want to access parts of the sequence, I can
seek to the correct position and read just the substring that I am
interested in.

Is it somehow possible to tell jdom to consume the text node and reporting
its offset in the file and its length, rather than storing it in memory?

I've looked at jdom-contrib which provides an ElementListener interface, but
that one's elementMatched() method is only called *after* the element
(including the close tag) has been fully read. All the classes like
SAXBuilder etc. only seem to handle events that come from the parser, but
what I want to do is change the events that the parser reports.

Is there any chance to do this with jdom(-contrib)? If not, do you know of
any other XML parser with which I could do that?

Cheers,

 Tobias

__ ____ ____ ____ ____ ____ ____ ____ ____ ____
To control your jdom-interest membership:
http://www.jdom.org/mailman/options/jdom-interest/youraddr@(protected)

©2008 junlu.com - Jax Systems, LLC, U.S.A.