Solved

Open URL from batch file and copy the text to a file

Posted on 2014-03-07
9
4,115 Views
Last Modified: 2014-03-11
Hello Experts,

I want to make a batch file which should something like ....

Once run the batch file that should perform the below activities:

Open a URL

copy all the text from the URL
paste it on one text file.


Now when every I run the batch that should do the same operation but if I run 2 times then the text copied will be twice on the same file.

For example i open www.####.com

that URL will have the only say :

ABCEDED

Now when I run it , all the text will be copied to my text file named as test.txt and then when run again this should also do the same thing but if opened the text it should be like below :

output of test.text

time taken <<time when the batch was executed >>:
ABCEDED

time taken <<time when the batch was executed >>:
ABCEDED
0
Comment
Question by:Swadhin Ray
9 Comments
 
LVL 35

Expert Comment

by:Kimputer
ID: 39912204
Get wget from sourceforge > http://sourceforge.net/projects/gnuwin32/files/wget/
Get it working on system wide your PC (put it somewhere with the dependency dll's, preferably in system32)
batch file will have the lines:

time < null >> logfile
wget "url"

The trick is to know which file is created (probably index.html but could be slightly different, so test it)

Next is to have the batch file do:

type index.html >> logfile
del index.html

because otherwise wget will add the file again next time with a different filename (adding ".1" or ".2" etc)
0
 
LVL 83

Expert Comment

by:oBdA
ID: 39912312
"Open" and "paste" is not a big deal. What may be problematic is "copy all the text", as it takes megabytes of browser suites to just "display text" coming from a website. If you don't want the html source code, the plain text interesting you might be extractable, but that requires knowledge of the exact source you want.
The script below uses curl.exe (project page http://curl.haxx.se/download.html, direct Windows download http://www.paehl.com/open_source/?download=curl_735_0_ssl.zip. Unzip curl.exe (no dependencies) from the download and put it either into the script's folder or in any folder that's in the path.
@echo off
setlocal enabledelayedexpansion
set OutFile=C:\Temp\test.txt
set URL=http://checkip.dyndns.org/
>>"%OutFile%" echo --------------------------------------------------------------------------------
>>"%OutFile%" echo Source of 'URL', %Date% %Time%
>>"%OutFile%" curl.exe -s -S %URL%
if errorlevel 1 (
	echo "Download of '%URL%' failed!"
) else (
	echo "Download of %URL% successful."
)
>>"%OutFile%" echo --------------------------------------------------------------------------------

Open in new window

0
 
LVL 69

Expert Comment

by:Qlemo
ID: 39912384
VBS and PowerShell have powerful means to parse web page content, so that would be another option.
0
3 Use Cases for Connected Systems

Our Dev teams are like yours. They’re continually cranking out code for new features/bugs fixes, testing, deploying, testing some more, responding to production monitoring events and more. It’s complex. So, we thought you’d like to see what’s working for us.

 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912428
@oBda: I tried your solution but when I do it is not selecting all the text that are there when we open in browser :

my output file gets :


--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 19:23:25.04
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 19:23:51.21
--------------------------------------------------------------------------------

Open in new window

0
 
LVL 83

Expert Comment

by:oBdA
ID: 39912564
That's somewhat not enough.
Is there any output on the screen when you run this? If you started the script using a double-click in Explorer, please open a command prompt, enter "cd /d Path to the script", and then run the script again from there.
What's the output when you enter "curl.exe http://checkip.dyndns.org" in the command prompt?
Are you using a proxy to connect to the internet?
In line 6, there are percent signs missing around "URL", but that's just cosmetic.
0
 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912643
When I run your script I get the below:

--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 20:27:37.82
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Source of 'URL', Fri 03/07/2014 20:27:46.64
--------------------------------------------------------------------------------

Open in new window

0
 
LVL 83

Accepted Solution

by:
oBdA earned 500 total points
ID: 39912686
That's the content of the log file; but what's the output in the command prompt window?
Please open a command prompt by entering "cmd.exe" in the run box, then enter "cd /d Path to the script", and then run the script again from there.
What's the output when you enter curl.exe http://checkip.dyndns.org in the command prompt?
Are you using a proxy to connect to the internet?
0
 
LVL 16

Author Comment

by:Swadhin Ray
ID: 39912779
Ok go it ..

But I have another problem ..
My URL is having username and password .
Once login to the URL I need to open another tab and then open another link for which I need to get the data what the provided script is doing
0
 
LVL 16

Author Closing Comment

by:Swadhin Ray
ID: 39920039
thanks this helps a lot.
0

Featured Post

DevOps Toolchain Recommendations

Read this Gartner Research Note and discover how your IT organization can automate and optimize DevOps processes using a toolchain architecture.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When you receive another warning that your shared drive is almost full and you have asked your users to clean out old files again and again, here is a single command that may help. This command will place all the files that have not been used rec…
How to remove superseded packages in windows w60 or w61 installation media (.wim) or online system to prevent unnecessary space. w60 means Windows Vista or Windows Server 2008. w61 means Windows 7 or Windows Server 2008 R2. There are various …
In a recent question (https://www.experts-exchange.com/questions/28997919/Pagination-in-Adobe-Acrobat.html) here at Experts Exchange, a member asked how to add page numbers to a PDF file using Adobe Acrobat XI Pro. This short video Micro Tutorial sh…
This video shows how to quickly and easily add an email signature for all users on Exchange 2016. The resulting signature is applied on a server level by Exchange Online. The email signature template has been downloaded from: www.mail-signatures…

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question