We help IT Professionals succeed at work.

Getting links from a page

xgomes
xgomes asked
on
183 Views
Last Modified: 2008-02-20


I am looking for a regular expression which will collect all the links(url) and anchor text in a page or URL. The data will then be stored in variable.

So for an url like http://www.yahoo.com/
It will collect all the links in the page along with the respective anchor text.

Comment
Watch Question

Commented:
The following should work nicely:

/\<a [^>]*href=\"([^\"]*)[^>]*\>(.*)\<\/a\>/Ui

The first match is the url and the second the anchor text.

Author

Commented:
This might be dumb question, but how do I get them in an array.

The array must have all the links in the page.

Thanks
Commented:
This one is on us!
(Get your first solution completely free - no credit card required)
UNLOCK SOLUTION

Gain unlimited access to on-demand training courses with an Experts Exchange subscription.

Get Access
Why Experts Exchange?

Experts Exchange always has the answer, or at the least points me in the correct direction! It is like having another employee that is extremely experienced.

Jim Murphy
Programmer at Smart IT Solutions

When asked, what has been your best career decision?

Deciding to stick with EE.

Mohamed Asif
Technical Department Head

Being involved with EE helped me to grow personally and professionally.

Carl Webster
CTP, Sr Infrastructure Consultant
Empower Your Career
Did You Know?

We've partnered with two important charities to provide clean water and computer science education to those who need it most. READ MORE

Ask ANY Question

Connect with Certified Experts to gain insight and support on specific technology challenges including:

  • Troubleshooting
  • Research
  • Professional Opinions
Unlock the solution to this question.
Join our community and discover your potential

Experts Exchange is the only place where you can interact directly with leading experts in the technology field. Become a member today and access the collective knowledge of thousands of technology experts.

*This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

OR

Please enter a first name

Please enter a last name

8+ characters (letters, numbers, and a symbol)

By clicking, you agree to the Terms of Use and Privacy Policy.