sgaggerj
asked on
converting html to a string / page scraping
Hi all,
I'm new to PHP programming so please bear with me.
I'm developing a java app for mobile phones that will take user input, send it to the web and get the response.
the response is formatted html that unfortunately has more page elements than i need and does not display well on the phone.
the page i send data to / from i have no control over.
what i was thinking is that if i wrote my own php script that recieved the request from the user, sent it on to the correct page
recieved the result from that page, stripped all the useless info and returned a simple string so that when the phones app
recieved the string it would be ready to display w/ no processing required.
something similar to page scraping i guess.
my reasons for trying it this way are
1) minimize the overhead on the phone, leaving the processing to the server which i think should be faster
2) minimize the data sent/recieved fom the phone to a minimum
3) to see the difference in the lag time and data transmission size between the current implementation (which recieves the whole html and scrapes it on the phone) and this implementation.
4) minimize the size of the app on the phone.
how do i go about doing this?
the user enters a string and that is transmitted to the site
the response that i need is always after a </form> tag
two elements later will always be either
a) "<p align=\"center\">"
(indicating nothing found)
or
b) "<b>"
(indicating something found)
and the data continues until a </div> is encountered.
the rest is junk
i know how to get the argument passed to the page from the app, but from there i'm kind of lost.
thoughts, suggestions?
any help is greatly appreciated!
TIA!
J
I'm new to PHP programming so please bear with me.
I'm developing a java app for mobile phones that will take user input, send it to the web and get the response.
the response is formatted html that unfortunately has more page elements than i need and does not display well on the phone.
the page i send data to / from i have no control over.
what i was thinking is that if i wrote my own php script that recieved the request from the user, sent it on to the correct page
recieved the result from that page, stripped all the useless info and returned a simple string so that when the phones app
recieved the string it would be ready to display w/ no processing required.
something similar to page scraping i guess.
my reasons for trying it this way are
1) minimize the overhead on the phone, leaving the processing to the server which i think should be faster
2) minimize the data sent/recieved fom the phone to a minimum
3) to see the difference in the lag time and data transmission size between the current implementation (which recieves the whole html and scrapes it on the phone) and this implementation.
4) minimize the size of the app on the phone.
how do i go about doing this?
the user enters a string and that is transmitted to the site
the response that i need is always after a </form> tag
two elements later will always be either
a) "<p align=\"center\">"
(indicating nothing found)
or
b) "<b>"
(indicating something found)
and the data continues until a </div> is encountered.
the rest is junk
i know how to get the argument passed to the page from the app, but from there i'm kind of lost.
thoughts, suggestions?
any help is greatly appreciated!
TIA!
J
ASKER CERTIFIED SOLUTION
membership
Create a free account to see this answer
Signing up is free and takes 30 seconds. No credit card required.
ASKER
thanks Brian - sorry it took me so long to get back to this q.
J