Solved

How do I get curl to pull back a page that has options on it after a redirect?

Posted on 2011-09-27
6
437 Views
Last Modified: 2013-11-13
I am using bash script with curl in Ubuntu 11.04 to capture a web page. I am new to this and this seems to be a complicated case in that it has a redirect and then options on the page for the data. I can't seem to get the options to change. I have tried both -b to send the cookie and -d to send a setting. Perhaps its the order I am doing this as I also have a -L for the redirect and -A to send as Chrome.

So the page has a drop down for the language and it defaults to a language I can't use. I want to select a different language in that drop down. When I do this on the web page, the URL does not change at all.
0
Comment
Question by:ccbe
  • 3
  • 2
6 Comments
 
LVL 40

Expert Comment

by:noci
ID: 36716116
Please do a view page as source in a browser.
First check if the page is using java script or not.

If not then curl can be used straight away, but you may need several steps (request/responses) to continue to use it..

If javascript is used you need to work out what it does and emulate it using bash (or perl etc).
That might complicate things. Afaict there is no js interpreter that is usable for such things.

0
 

Author Comment

by:ccbe
ID: 36717492
Thanks for the help.

It is using javascript.

Clearly I'm a newbie, but isn't there a simple way to avoid this complication by using the cookie? The site has my settings when I return in Chrome or FF. I was thinking that because of the redirect I am sending the cookie at the wrong time?

Would it be:

curl -L -b cookie.txt "http:\\www.site.com"

If not, what happens to the sites request for the cookie after the redirect?

Here is an example  of the setting I want to change:

<span id="UPB"><div class='prefsbox'>
<select name="ctl00$LDDL" onchange="javascript:setTimeout(&#39;__doPostBack(\&#39;ctl00$LDDL\&#39;,\&#39;\&#39;)&#39;, 0)" id="LDDL">
                  <option value="1">English (N. American)</option>
                  <option selected="selected" value="2">Italiano (Italian)</option>
                  <option value="3">Suomi (Finnish)</option>
                  <option value="4">Deutsch (German)</option>
                  <option value="5">Norsk (Norwegian)</option>
                  <option value="6">Portugu&#234;s (Portuguese)</option>
                  <option value="7">Espa&#241;ol (Spanish)</option>
                  <option value="8">Svenska (Swedish)</option>
</select></div>
      </span>


0
 
LVL 40

Expert Comment

by:noci
ID: 36718187
Well there clearly is form around this, that provides the url,
what does settimeout do (most porbably, start a timer and tun the doPostBack).

Then what does doPostBack do?  You realy need to get into the small items... if there is a form post, THAT what's needed to set a variable/cookie.
0
Space-Age Communications Transitions to DevOps

ViaSat, a global provider of satellite and wireless communications, securely connects businesses, governments, and organizations to the Internet. Learn how ViaSat’s Network Solutions Engineer, drove the transition from a traditional network support to a DevOps-centric model.

 
LVL 9

Assisted Solution

by:parparov
parparov earned 100 total points
ID: 36718325
You may want to use Rhino Shell:
https://developer.mozilla.org/en/Rhino_Shell

for scripting the Javascript commands
0
 

Author Comment

by:ccbe
ID: 36719187
nocl: I don't know what any of the pieces do, I didn't put the site together. I am just trying to get some info off of it automatically so I can create a report over time.

parpov: Does this allow me to do what I said above? Again, I am a newbie at this, I looked at the page you gave me, I just don't want to go down another road filled with roadblocks.

Thanks to you both for the help.
0
 
LVL 40

Accepted Solution

by:
noci earned 400 total points
ID: 36719752
When the selection of the language causes the onSelect to be executed.
what is done is described in __doPostBack()
The code for it is in one of the downloaded css sheets, the html file or just referenced before.

@parpov, interesting tool i have to look into that...

checking out.../compiling ... it seems to work.

It might be usable if it can connect to the remote site, at least the actions of the onSelect can be executed.
But the steps still need to be setup in some order like you would click & select through a page when done.
Curl may be used for cookie management & http queries .., but it needs quite some work...
either way.
There is insufficient info to provide more detailed steps.
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Computer science students often experience many of the same frustrations when going through their engineering courses. This article presents seven tips I found useful when completing a bachelors and masters degree in computing which I believe may he…
Today, the web development industry is booming, and many people consider it to be their vocation. The question you may be asking yourself is – how do I become a web developer?
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …
With the power of JIRA, there's an unlimited number of ways you can customize it, use it and benefit from it. With that in mind, there's bound to be things that I wasn't able to cover in this course. With this summary we'll look at some places to go…

828 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question