Htmlcxx & HTML Parsing

C

Choi

Hi !

I made several searches in order to parse HTML in C++. I found one
way :

http://htmlcxx.sf.net

However, I don't manage to compile the provided example. Maybe a
compilation option is missing .

Has someone already used HTMLCXX ? Or do you know how I can extract
information ( such as URL ) in HTML Web pages using C++ ?

Thanks guys...
 
Joined
Mar 4, 2009
Messages
1
Reaction score
0
building htmlcxx

I have built (and rebuilt) htmlcxx under cygwin/XP. On this platform, there were errors in the "libtool" script, as well as in the argument-types in a couple of functions in subdirectory html.

For "libtool", I had to add the following line:
echo="$ECHO"

The function argument changes are pretty simple.

The build did not work with the older cygwin components I had on my system. You have to update the cygwin libraries and gcc executable before building.

But once you do this, the program and libraries build fine. They do an excellent job of parsing HTML tags, content, and, I believe, attribute content.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,163
Messages
2,570,897
Members
47,434
Latest member
TobiasLoan

Latest Threads

Top