Parsing HTML document, how?

Thread starter George K
Start date Sep 24, 2004

George K

Sep 24, 2004

This what my program should do, you give it the URL to a page and a
template file, it downloads that page and then using the template file it
returns some information.

The way I thought of doing it was that the template file uses regex and
then in my program I just do re.search(template, htmlpage) and this would
work but the HTML document has characters like ? and * that I need to
escape in the template, so this solution doesn't work. What is a better
way to accomplish what I want? does Python have any standard library for
this?

The parsing has to be dynamic, from the template file, the URLs are not
fixed.

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Digital Signature field form in PDF generated document from HTML	5	Nov 16, 2022
External html	2	May 13, 2020
How to push data from one HTML page to another	4	Jan 3, 2024
Changing .html in URL	3	Jul 11, 2022
Parsing files in python	0	Dec 23, 2012
I'm about to get in trouble with the HTML <body></body> tags	10	Aug 12, 2023
Uniquely identifying each & every html template	58	Jan 18, 2013
HTML Parsing	5	Feb 10, 2007

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

AustinFairchild

Total: 770 (members: 1, guests: 769)
Robots: 70

Forum statistics

Threads: 474,208

Messages: 2,571,082

Members: 47,683

Latest member: AustinFairchild

Latest Threads

How to Transfer Your Apple Mail Account to MS Outlook?
- Started by Regain@123
- 59 minutes ago
Member posted off topic diatribe about his family. His son just killed his brother
- Started by Justforamoment
- Today at 6:04 AM
Whats the key advantage of an OST to PST converter?
- Started by tarunsaini53
- Yesterday at 2:28 PM
Python output?
- Started by jakey
- Yesterday at 12:54 PM
How to transform Zimbra TGZ emails to PST?
- Started by Regain@123
- Yesterday at 6:04 AM
How to back up Microsoft 365 mailbox to PST?
- Started by pradeepkatiyar
- Yesterday at 5:29 AM
How to Convert Excel Contacts to VCF File Format for iCloud?
- Started by nafaxay326
- Saturday at 9:05 AM
How to convert PST files to MBOX with attachments intact?
- Started by Regain@123
- Friday at 12:25 PM
How to Convert Excel to VCF Format Quickly and Easily?
- Started by xayaci5906
- Friday at 10:29 AM
How to migrate emails from Apple Mail to Outlook?
- Started by Regain@123
- Friday at 7:28 AM

Top