Solved

Open URL from batch file and copy the text to a file

Posted on 2014-03-07
9
4,001 Views
Last Modified: 2014-03-11
Hello Experts,

I want to make a batch file which should something like ....

Once run the batch file that should perform the below activities:

Open a URL

copy all the text from the URL
paste it on one text file.


Now when every I run the batch that should do the same operation but if I run 2 times then the text copied will be twice on the same file.

For example i open www.####.com

that URL will have the only say :

ABCEDED

Now when I run it , all the text will be copied to my text file named as test.txt and then when run again this should also do the same thing but if opened the text it should be like below :

output of test.text

time taken <<time when the batch was executed >>:
ABCEDED

time taken <<time when the batch was executed >>:
ABCEDED
0
Comment
Question by:Swadhin Ray
9 Comments
 
LVL 35

Expert Comment

by:Kimputer
ID: 39912204
Get wget from sourceforge > http://sourceforge.net/projects/gnuwin32/files/wget/
Get it working on system wide your PC (put it somewhere with the dependency dll's, preferably in system32)
batch file will have the lines:

time < null >> logfile
wget "url"

The trick is to know which file is created (probably index.html but could be slightly different, so test it)

Next is to have the batch file do:

type index.html >> logfile
del index.html

because otherwise wget will add the file again next time with a different filename (adding ".1" or ".2" etc)
0
 
LVL 83

Expert Comment

by:oBdA
ID: 39912312
"Open" and "paste" is not a big deal. What may be problematic is "copy all the text", as it takes megabytes of browser suites to just "display text" coming from a website. If you don't want the html source code, the plain text interesting you might be extractable, but that requires knowledge of the exact source you want.
The script below uses curl.exe (project page http://curl.haxx.se/download.html, direct Windows download http://www.paehl.com/open_source/?download=curl_735_0_ssl.zip. Unzip curl.exe (no dependencies) from the download and put it either into the script's folder or in any folder that's in the path.
@echo off
setlocal enabledelayedexpansion
set OutFile=C:\Temp\test.txt
set URL=http://checkip.dyndns.org/
>>"%OutFile%" echo --------------------------------------------------------------------------------
>>"%OutFile%" echo Source of 'URL', %Date% %Time%
>>"%OutFile%" curl.exe -s -S %URL%
if errorlevel 1 (
	echo "Download of '%URL%' failed!"
) else (
	echo "Download of %URL% successful."
)
>>"%OutFile%" echo --------------------------------------------------------------------------------

Open in new window

0
 
LVL 68

Expert Comment

by:Qlemo
ID: 39912384
VBS and PowerShell have powerful means to parse web page content, so that would be another option.
0
 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912428
@oBda: I tried your solution but when I do it is not selecting all the text that are there when we open in browser :

my output file gets :


--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 19:23:25.04
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 19:23:51.21
--------------------------------------------------------------------------------

Open in new window

0
3 Use Cases for Connected Systems

Our Dev teams are like yours. They’re continually cranking out code for new features/bugs fixes, testing, deploying, testing some more, responding to production monitoring events and more. It’s complex. So, we thought you’d like to see what’s working for us.

 
LVL 83

Expert Comment

by:oBdA
ID: 39912564
That's somewhat not enough.
Is there any output on the screen when you run this? If you started the script using a double-click in Explorer, please open a command prompt, enter "cd /d Path to the script", and then run the script again from there.
What's the output when you enter "curl.exe http://checkip.dyndns.org" in the command prompt?
Are you using a proxy to connect to the internet?
In line 6, there are percent signs missing around "URL", but that's just cosmetic.
0
 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912643
When I run your script I get the below:

--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 20:27:37.82
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 20:27:46.64
--------------------------------------------------------------------------------

Open in new window

0
 
LVL 83

Accepted Solution

by:
oBdA earned 500 total points
ID: 39912686
That's the content of the log file; but what's the output in the command prompt window?
Please open a command prompt by entering "cmd.exe" in the run box, then enter "cd /d Path to the script", and then run the script again from there.
What's the output when you enter curl.exe http://checkip.dyndns.org in the command prompt?
Are you using a proxy to connect to the internet?
0
 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912779
Ok go it ..

But I have another problem ..
My URL is having username and password .
Once login to the URL I need to open another tab and then open another link for which I need to get the data what the provided script is doing
0
 
LVL 16

Author Closing Comment

by:Swadhin Ray
ID: 39920039
thanks this helps a lot.
0

Featured Post

3 Use Cases for Connected Systems

Our Dev teams are like yours. They’re continually cranking out code for new features/bugs fixes, testing, deploying, testing some more, responding to production monitoring events and more. It’s complex. So, we thought you’d like to see what’s working for us.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

If like me you are one who spends a lot of time working and scripting with cmd.exe, sometimes it is handy to be able to quickly view a calendar for a given month and year. This script will quickly do just that!  Save the code posted below to a .bat …
I have published numerous articles here at Experts Exchange that present programs/scripts written in a language called AutoHotkey. Each of those articles has a brief paragraph describing where to download the product and how to install it. I have al…
Internet Business Fax to Email Made Easy - With  eFax Corporate (http://www.enterprise.efax.com), you'll receive a dedicated online fax number, which is used the same way as a typical analog fax number. You'll receive secure faxes in your email, f…
Learn how to create flexible layouts using relative units in CSS.  New relative units added in CSS3 include vw(viewports width), vh(viewports height), vmin(minimum of viewports height and width), and vmax (maximum of viewports height and width).

864 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

23 Experts available now in Live!

Get 1:1 Help Now