new office formats, REs and Python

T

tubby

How are Python users dealing with some of the new OASIS Open Document
formats (Open Office) or MS Open XML formats. These formats store data
in a file which is actual a zip archive that contains numerous files and
folders. For example, a file saved from Open Office 2.0 named 'test.odt'
can be unzipped into many parts... content.xml, meta.xml, settings.xml,
etc, etc.

How can I read these files do re searching and matching on them, etc?
content.xml may have a string I'm searching for, but how would I attach
that back to the original 'test.odt' file?

Perl seems to have a module to handle this, does Python?

http://search.cpan.org/dist/OpenOffice-OODoc/
 
F

Fredrik Lundh

tubby said:
How are Python users dealing with some of the new OASIS Open Document
formats (Open Office) or MS Open XML formats. These formats store data
in a file which is actual a zip archive that contains numerous files and
folders. For example, a file saved from Open Office 2.0 named 'test.odt'
can be unzipped into many parts... content.xml, meta.xml, settings.xml,
etc, etc.

How can I read these files do re searching and matching on them, etc?
content.xml may have a string I'm searching for, but how would I attach
that back to the original 'test.odt' file?

http://ooopy.sourceforge.net/
http://www.google.com/search?q=python+open+office
etc

</F>
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,989
Messages
2,570,207
Members
46,783
Latest member
RickeyDort

Latest Threads

Top