phone numebrs and emails

Hello experts,

If I ahve a way to create a web crwaler in java to coillect and browse thru urls beginning at a particular page,, is there a way using regualr expression to find out contact information such as emails phones zip codes address from contacus pages of the websites that we are crawling?

Any lnks or tutorials would be much appreciated.

Thanks
A001
anup001Asked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

radarshCommented:
Try http://www.regular-expressions.info

There is an example on an Email Validation RegEx. It might be helpful. Now, phone numbers,
it depends on the format you are using... If you post the format, I can help you with the
RegEx.

________
radarsh

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
anup001Author Commented:
i think we will use the regular format XXX-XXX-XXXX or (xxx)-XXX-xxxx

I know the use of regualr expressions but i meant to ask what would be the exact way of extracting such information from a crawler like thing.

Thanks
A001
ksivananthCommented:
for the ph. no. the pattern will be as follow as,

"(*[0-9][0-9][0-9])*-[0-9][0-9][0-9]-[0-9][0-9][0-9][0-9]". May be sombody write a simpler one :)
WebstormCommented:
phone numbers
   "([0-9]{3}|\\([0-9]{3}\\))-[0-9]{3}-[0-9]{4}"
if the number of digits can vary :
   "([0-9]+|\\([0-9]+\\))-[0-9]+-[0-9]+"

for email addresses :
   "[A-Za-z_.0-9]+@[A-Za-z_.0-9]+(\\.[A-Za-z_.0-9]+)+"

It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Java

From novice to tech pro — start learning today.