![]() |
|
![]() |
|
|
Thread Tools | Display Modes |
|
|
#1 |
|
Newbie
Join Date: Oct 2004
Posts: 15
Rep Power: 0
![]() |
What's the best way to go about checking a HTML website and gather data from it ina C program? I am having a bit of trouble with Google do to the common string "HTML"
Thanks alot! |
|
|
|
|
|
#2 |
|
Programmer
Join Date: Oct 2004
Location: Canada
Posts: 82
Rep Power: 5
![]() |
You can search a text file and find all of the URL's and stuff like that, but there is no library in Standard C that involves web programming facilities. Maybe try C in conjunction with Python or Perl. I think that Perl is probably the best language to use for this type of program.
|
|
|
|
|
|
#3 |
|
Expert Programmer
|
You can get CURL for C, which will connect to a server for you... intiate a request and retreive data from a variety of resource types, including HTTP and HTTPS.
From that point you would need to find another library to process the HTML, you can likely find a few DOM (Document Object Model) HTML parsers out there. The other object is that you can embed an MSIE AtciveX control into your application (Has to be window gui based though) and set the control to hidden if you do not want it to be seen. Send the request and once the request is complete use one of its many COM interfaces to extract the code. I have done this before myself (only likely not for the same purpose, and my controls were not hidden) so I know for fact that it is possible.
__________________
Clifford Matthew Roche <geek@cliffordroche.com> Web Hosting: http://www.crd-hosting.com Consulting: http://www.crdev-consulting.com |
|
|
|
|
|
#4 |
|
Newbie
Join Date: Oct 2004
Posts: 15
Rep Power: 0
![]() |
|
|
|
|
![]() |
| Bookmarks |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
| Display Modes | |
|
|