xml.etree - why no HTMLTreeBuilder included?

Thread starter Jon P.
Start date Sep 26, 2010

Jon P.

Sep 26, 2010

It is great that Fredrik Lundh's ElementTree is now a part of the
Python Standard Library.

However, Is it correct that if you want to use xml.etree.ElementTree
to parse an HTML Document that you will have to install a separate
HTMLTreeBuilder (e.g. TidyHTMLTreeBuilder) and that the only
TreeBuilder objects that come with the Standard Library is the one for
XML source?

Seems like some kind of HTMLTreeBuilder ought to be included by
default.

For a script I'm doing which deals with HTML, I thought I could
jettison lxml and use xml.etree instead, but since I would need to
have to ask the end-user to install an external library anyways even
if I use xml.etree, I switched back to lxml.

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Noob trying to parse bad HTML using xml.etree.ElementTree	0	Dec 30, 2012
Weird scope error	1	Apr 5, 2008
elementsoap with Python 2.5	1	Oct 11, 2006
How to get xml.etree.ElementTree not bomb on invalid characters inXML file ?	0	May 4, 2010
lxml 2.0 released	0	Feb 1, 2008
how to make a tree with randomly selected html tags from an array in python?	0	Mar 10, 2013
elementtree XML() unicode	5	Nov 4, 2009
insert html into ElementTree without parsing it	1	Feb 24, 2014

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

VBService

Total: 94 (members: 1, guests: 93)
Robots: 245

Forum statistics

Threads: 473,981

Messages: 2,570,188

Members: 46,731

Latest member: MarcyGipso

Latest Threads

Are there expert programmers here who can mentor me?
- Started by Schmueller
- Today at 2:35 AM
Download in mass
- Started by bobkuspe
- Today at 2:30 AM
Is there any point of using Zimbra Mail over Gmail?
- Started by Techykaus24
- Yesterday at 7:42 AM
New to Forum
- Started by Zohan0786
- Yesterday at 4:42 AM
Need help with <rowspan> in an HTML table
- Started by jakey
- Wednesday at 1:32 PM
PST to MBOX Conversion with PST File Converter Tool
- Started by justinchapman
- Wednesday at 4:57 AM
Java exercise
- Started by Stefkari24
- Tuesday at 6:55 PM
Trying to use clangd with VSCodium, CMake_World_COMPILER not set
- Started by scassowary
- Tuesday at 4:44 AM
Mql5 programming - expert bot source code
- Started by GeraldMann
- Monday at 10:15 PM
Problems in creating libraries
- Started by Riccardo 'Taro'
- Sunday at 12:11 PM

Top