process large file

Jimmy Zhang · Apr 5, 2004

I am having some trouble processing some large file (40mb) in Java. For the
problem I have, I have tried to use SAX, but doesn't
find it suitable (well, coding just becomes a little complicated). So DOM is
better but a little overhead on memory. Can someone share with me their
experiences in dealing with their situations?

Cheers,
JZ

Toivo Lainevool · Apr 7, 2004

Jimmy Zhang" wrote in message news: said:
I am having some trouble processing some large file (40mb) in Java. For the
problem I have, I have tried to use SAX, but doesn't
find it suitable (well, coding just becomes a little complicated). So DOM is
better but a little overhead on memory. Can someone share with me their
experiences in dealing with their situations?

One good option is to use a pull parser. It has a simpler interface
than SAX, but doesn't have the memory overhead of DOM.

See http://www.extreme.indiana.edu/xgws/xsoap/xpp/ or
http://www.xmlpull.org/

Toivo Lainevool
http://www.XMLPatterns.com - Develop effective DTDs and XML Schema
documents for your XML using structural design patterns.

Alexey Shirshov · Apr 8, 2004

Hello, Toivo!
You wrote on 7 Apr 2004 12:09:05 -0700:

[Sorry, skipped]

TL> One good option is to use a pull parser. It has a simpler interface
TL> than SAX, but doesn't have the memory overhead of DOM.

TL> See http://www.extreme.indiana.edu/xgws/xsoap/xpp/ or
TL> http://www.xmlpull.org/

Intresting, that MS calls it cursor model processing (XPathNavigator), and
based on SAX calls push/pull model (XmlWriter/XmlReader).

With best regards, Alexey Shirshov.

Jimmy Zhang · Apr 9, 2004

How much memory does XPathNavigator consume? I assume it loads everything in
memory like DOM.

Alexey Shirshov · Apr 9, 2004

Hello, Jimmy!
You wrote on Thu, 08 Apr 2004 23:40:00 GMT:

JZ> How much memory does XPathNavigator consume? I assume it loads
JZ> everything in memory like DOM.

Well, XPathNavigator is just an interface (actually abstract class) and we
cann't talk about it performance. The important thing is an implementation
of this class. XmlDocument, which represents DOM implements it in
DocumentXPathNavigator class.
You can create the instance of this class via CreateNavigator method.
Another implementation you can get via CreateNavigator of the XPathDocument
class.
First implementation uses DOM as underlying data model, while the second -
XPath data model.
I think, for very large documents the XPathDocument will be much faster.

[Sorry, skipped]

With best regards, Alexey Shirshov.

large xml file...	11	Aug 23, 2011
Processing large CSV files - how to maximise throughput?	11	Oct 25, 2013
Best way to display a large XML	8	Jan 6, 2009
XSLTranslation of a large XML file using Java results in OutOfMemory	6	May 17, 2006
help in reading a large text file using verilog....	0	Jul 22, 2010
Perl to process "mbox" file	4	Dec 28, 2009
How to avoid Out of Memory Errors when dealing with a large XML file?	2	Jan 10, 2011
dealing with large csv files	5	Nov 30, 2008

process large file

Jimmy Zhang

Toivo Lainevool

Alexey Shirshov

Jimmy Zhang

Alexey Shirshov

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads