Screen scraping and transforming
Messages   Related Types
This message was discovered on ASPFriends.com 'aspngfreeforall' list.
Responses highlighted in red are from those people who are likely to be able to contribute good, authoratitive information to this discussion. They include Microsoft employees, MVP's and others who IMHO contribute well to these kinds of discussions.

Michael A. Michalski (VIP)
I have used the WebClient object to obtain the text of a web page. What I
would like to do is to take the page and obtain only certain link objects
for display.

I am not certain how to do this other than doing basic string manipulation,
which I would like to avoid.

How would I take the results of DownloadData and walk a DOM tree to obtain
the links? Or is there a better method of doing this? I have found a
couple of links that address the use of WebClient but do not go into parsing
the results to obtain a list of only certain objects within the page.

Thanks!

Michael

Reply to this message...
 
    
dave wanta (VIP)
Check out Webservices book by Bill Evjen.
http://www.amazon.com/exec/obidos/tg/detail/-/0764548298/qid=1029621605/sr=8
-2/ref=sr_8_2/104-0312338-7849535?s=books&n=507846

Ch 23 shows you how to turn a web page into a WSDL document, which allows
you to access a HTML page as Object.Name .

Example:
page.MsnQuote will fine the stock quote on one of the msn.com pages.

I think Amazon.com may actually have this posted as a sample chapter in his
book and Bill may be on the webservices list.

Cheers!
Dave
----- Original Message -----
From: "Michael A. Michalski" <Click here to reveal e-mail address>
To: "aspngfreeforall" <Click here to reveal e-mail address>
Sent: Saturday, August 17, 2002 9:13 AM
Subject: [aspngfreeforall] Screen scraping and transforming

[Original message clipped]

Reply to this message...
 
 
System.Net.WebClient
System.Object




Ad
MBR BootFX
Best-of-breed application framework for .NET projects, developed by Matthew Baxter-Reynolds and MBR IT
 
 Copyright © Matthew Baxter-Reynolds 2001-2008. '.NET 247 Software Development Services' is a trading style of MBR IT Solutions Ltd.
Contact Us - Terms of Use - Privacy Policy - www.dotnet247.com