Solved

PHP Preg Match All

Posted on 2009-07-05
5
524 Views
Last Modified: 2012-05-07
I am trying to match UL's in an HTML file via PHP. I want to match #firstUL and get all of the LI's out of only that ID. Same thing for #secondLI.

preg_match_all("/id=\"firstUL(.+)\<\/div>/", $file, $match1);
preg_match_all("/<li><a href=\"(.*)\">(.*)<\/a><\/li>/", $match1[0][0], $match2);

The outcome I am looking for is:

[0] -> [0] = '/index.php', [1] = 'Example'
[1] -> [0] = '/index.php', [1] = 'Example'
[2] -> [0] = '/index.php', [1] = 'Example'
<div id="firstUL">

	<div class="someclass">

		<ul>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

		</ul>

		<ul>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

		</ul>

	</div>

</div>

<div id="secondUL">

	<div class="someclass">

		<ul>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

		</ul>

		<ul>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

			<li><a href="/index.php">Example</a></li>

		</ul>

	</div>

</div>

Open in new window

0
Comment
Question by:jwzk
  • 3
  • 2
5 Comments
 
LVL 39

Expert Comment

by:Roger Baklund
Comment Utility
You need the 's' pattern modifer to match newlines:

preg_match_all("/id=\"firstUL(.+)\<\/div>/s", $file, $match1);
0
 
LVL 39

Accepted Solution

by:
Roger Baklund earned 500 total points
Comment Utility
For the result array you wanted, try this:
preg_match_all("/id=\"firstUL(.+)\<\/div>/s", $file, $match1);

preg_match_all("/<li><a href=\"(.*)\">(.*)<\/a><\/li>/", $match1[0][0], $match2);

$result = array();

for($i=0;$i<count($match2[1]);$i++)

  $result[]=array($match2[1][$i],$match2[2][$i]);

echo '<pre>';

var_dump($result);

echo '</pre>';

Open in new window

0
 

Author Comment

by:jwzk
Comment Utility
Perfect, the only problem is the first preg_match_all is still getting both UL's.

This works, but is it the best method? I would prefer to not have to set the second one incase it changes.

preg_match_all("/id=\"firstUL(.+)\id=\"secondUL/s", $fileData, $match1);
0
 
LVL 39

Expert Comment

by:Roger Baklund
Comment Utility
Use the 'U' (ungreedy) pattern modifier:

preg_match_all("/id=\"firstUL(.+)\<\/div>/sU", $file, $match1);
0
 

Author Closing Comment

by:jwzk
Comment Utility
Perfect + the U modifier, thanks!
0

Featured Post

How to run any project with ease

Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
- Combine task lists, docs, spreadsheets, and chat in one
- View and edit from mobile/offline
- Cut down on emails

Join & Write a Comment

Entering a date in Microsoft Access can be tricky. A typo can cause month and day to be shuffled, entering the day only causes an error, as does entering, say, day 31 in June. This article shows how an inputmask supported by code can help the user a…
This article discusses how to create an extensible mechanism for linked drop downs.
Learn how to match and substitute tagged data using PHP regular expressions. Demonstrated on Windows 7, but also applies to other operating systems. Demonstrated technique applies to PHP (all versions) and Firefox, but very similar techniques will w…
In this fourth video of the Xpdf series, we discuss and demonstrate the PDFinfo utility, which retrieves the contents of a PDF's Info Dictionary, as well as some other information, including the page count. We show how to isolate the page count in a…

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

9 Experts available now in Live!

Get 1:1 Help Now