• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 368
  • Last Modified:

Extract and Save the content loaded in webbrowser control

Hi all ,

           In my project , the HTML and PDF pages are loaded in my Webbrowser control . I extract the inner text of that  HTML page and write it in to a file. But I dont know the way to extract the PDF content From webbrowser control. Im doing my project in VB.NET.If any body knows the solution means tell me.
0
lword
Asked:
lword
2 Solutions
 
r_a_j_e_s_hCommented:
u have to download the pdf file and store it .... u can't extract the content...
0
 
gecko_au2003Commented:
You could take a screen shot of the pdf file and use OCR 3rd party components to convert the image into text ? Not done any of that via programming but have come across things like that. Maybe have a look around on www.planet-source-code.com or www.pscode.com which is the same site I think. www.codeguru.com , www.allapi.net or even vbnet.mvps.org
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now