Advertisement

05.10.2008 at 07:10AM PDT, ID: 23391641
[x]
Attachment Details

VB.net MultiThreaded Web Page Parsing application - how to parse multiple url's simultaneously

Asked by marclindsay1 in Microsoft Visual Basic.Net, Visual Studio .NET 2005, .NET Framework 2.0

Tags: vb.net multithreading web requests, Use multithreading application to request multiple url's simultaneously and parse data

Hi Guys,

I am a VB.net newbie and am learning through creating applications I can actually use.  Now my application is designed to crawl certain url's and extract needed information from them.

It is working fine in one threaded mode, but when you have to do 17k url's and extract information from 3 key seed url's (yahoo and google) then it needs multithreading.

For the parsing we are currently using webbrowser control but I realize now this is definately not going to work for multithreading.

We are currently using Document.GetAttribute type calls to exctract the information we need (href tags and innerhtml etc from links within the document).

What I need is to be able to say start 10 threads (or enter number of threads to use in main box in gui though I can add that later if we work out the thread part first).

That will read from the dataset containing the url's sequentially, so first thread grabs first url from first row, second will know that the first url has been taken so grab the next one in line, and so on.

Then each thread should be able to get the url and parse the contents of it so we can then extrac html links etc etc.  Then write the extracted data back to the table.

So who can help me with setting up a multi url parsing multi threaded code to get moving?

I would be greatly appreciative.

ThanksStart Free Trial
 
Loading Advertisement...
 
[+][-]05.10.2008 at 11:42AM PDT, ID: 21540117

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
[+][-]05.10.2008 at 11:53AM PDT, ID: 21540144

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
[+][-]05.10.2008 at 11:56AM PDT, ID: 21540152

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
[+][-]05.10.2008 at 04:10PM PDT, ID: 21540888

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]05.10.2008 at 04:30PM PDT, ID: 21540933

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
[+][-]05.14.2008 at 02:46PM PDT, ID: 21569024

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]05.14.2008 at 02:48PM PDT, ID: 21569031

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]05.14.2008 at 03:17PM PDT, ID: 21569193

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
[+][-]05.14.2008 at 03:33PM PDT, ID: 21569277

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]05.14.2008 at 04:34PM PDT, ID: 21569565

View this solution now by starting your 7-day free trial. Setting up your free trial is quick, easy, and secure. We will return you to this solution, unlocked, when you're done.

 

About this solution

Zones: Microsoft Visual Basic.Net, Visual Studio .NET 2005, .NET Framework 2.0
Tags: vb.net multithreading web requests, Use multithreading application to request multiple url's simultaneously and parse data
Sign Up Now!
Solution Provided By: DarkoLord
Participating Experts: 2
Solution Grade: A
 
 
[+][-]05.15.2008 at 02:33AM PDT, ID: 21571795

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]05.15.2008 at 04:13AM PDT, ID: 21572298

At Experts Exchange, members can ask their questions to thousands of technology professionals, also known as Experts. Experts compete and collaborate to answer those questions by leaving comments like this one.

Start your 7-day free trial to view this Expert Comment or ask the Experts your question.

 
 
Loading Advertisement...
20080716-EE-VQP-32 / EE_QW_2_20070628