Question

PHP DOM getElementById

Asked by: walker6o9

I'm trying to use php dom's getElementById, and I'm just doing the basically tutorial, but it doesn't work.  I just get the:
The element whose id is books is:
as my output.

My book.xml code:

 <books>
  TEST BOOK
  </books>


and my php code:

<?php
 
$doc = new DomDocument;
 
// We need to validate our document before refering to the id
$doc->validateOnParse = true;
$doc->Load('book.xml');
 
echo "The element whose id is books is: " . $doc->getElementById('books')->tagName . "\n";
 
?>

                                  
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:

Select allOpen in new window

This Question has been solved and asker verified All Experts Exchange premium technology solutions are available to subscription members.

Subscribe now for full access to Experts Exchange and get

Instant Access to this Solution

  • Plus...
  • 30 Day FREE access, no risk, no obligation
  • Collaborate with the world's top tech experts
  • Unlimited access to our exclusive solution database
  • Never be left without tech help again

Subscribe Now

Asked On
2009-05-26 at 10:41:03ID24438933
Topic

PHP Scripting Language

Participating Experts
2
Points
500
Comments
30

Trusted by hundreds of thousands everyday for fast, accurate and reliable tech support.

  • "The time we save is the biggest benefit of Experts Exchange to Warner Bros. What could take multiple guys 2 hours or more each to find is accessed in around 15 minutes on Experts Exchange." Mike Kapnisakis, Warner Bros.
  • "Our team likes having a resource that is more secure than just using Google and most experts using this service really know their stuff. It's nice to look here first versus using Google." Dayna Sellner, Lockheed Martin
  • "Anytime that I've been stumped with a problem, 9 out of 10 times Experts Exchange has either the accepted solution or an open discussion of the potential solution to the problem." Kenny Red, eBay Inc.

See what Experts Exchange can do for you.

Got a question?

We've got the answer.

Experts Exchange has been collecting answers to technology questions since 1996…3 million and counting! If you have a question, chances are we already have your answer.

Screenshot of Experts Exchange Knowledgebase

Need individual assistance?

Our experts are ready to help.

If you can't find the exact answer you're looking for, ask our exclusive community of 50,000 experts. You’ll get a personalized answer from a trusted professional.

Screenshot of Experts Exchange Knowledgebase

Want to learn from the best?

Read articles from industry experts.

Thousands of free tech tips, tricks, how-to’s and tutorials are available in our peer reviewed articles section. See for yourself how smart our experts are, no login required.

Screenshot of an Article

Working on a long term project?

Store your work and research.

Save solutions to your questions, answers you’ve discovered through searching plus helpful articles in your personal knowledgebase for easy future access.

Screenshot of Experts Exchange Knowledgebase

Access the answers to your technology questions today.

Subscribe Now

30-day free trial. Register in 60 seconds.

What Makes Experts Exchange Unique?

Members of the expert community talk about why the experience at Experts Exchange is different than what you will find anywhere else.

Trusted by the world's most respected brands.

image of each brand's logo

Faithfully serving IT professionals since 1996.

Experts Exchange Logo

Try it out and discover for yourself.

Subscribe Now

30-day free trial. Register in 60 seconds.

Related Solutions

  1. PHP/XML Alternative to DOM for opening a file
    Hi I am trying to get a PHP/XML tutorial working on my web space, hosted by a third party ISP. phpinfo() reveals the following: ----------------------- OS: FreeBSD 4.6.2 i386 Configure Command './configure' '--prefix=' '--with-mysql=/usr/local/mysql' '--disable-rpath' '...
  2. PHP and DOM
    Hi, I have a problem with saving a DOM document: I am developing a webapp using PHP and MySQL. The key of this website is to create/receive xml docs, and store information in the db. The receive part - parsing - works perfectly. The problem is when I try to save XML documen...
  3. PHP and XML DOM Document
    I'm writing abit of code using the XML DOM object in PHP - but it seems I don't have all that I need. Looking round for answers all the forums messages on this problem are from abouut 2002 - am I that far behind the times? Anyhow the line of code I'm using - to load a file in...
  4. php not recognising dom functions?
    Iv been strugling with this for a while, I have php version 5.2.1.3 which from what i belive means I dont have to download dom? but on running textbook code am getting error: faultCode1faultStringFatal error:Call to undefined function domxml_open_file() in C:\Apache\htdocs\...

Free Tech Articles

  1. WARNING: 5 Reasons why you should NEVER fix a computer for free.
    It is in our nature to love the puzzle. We are obsessed. The lot of us. We love puzzles. We love the challenge. We thrive on finding the answer. We hate disarray. It bothers us deep in our soul. W...
  2. SCCM OSD Basic troubleshooting
    SCCM 2007 OSD is a fantastic way to deploy operating systems, however, like most things SCCM issues can sometimes be difficult to resolve due to the sheer volume of logs to sift through and the dispe...
  3. Migrate Small Business Server 2003 to Exchange 2010 and Windows 2008 R2
    This guide is intended to provide step by step instructions on how to migrate from Small Business Server 2003 to Windows 2008 R2 with Exchange 2010. For this migration to work you will need the fo...
  4. Create a Win7 Gadget
    This article shows you how to create a simple "Gadget" -- a sort of mini-application supported by Windows 7 and Vista. Gadgets can be dropped anywhere on the desktop to provide instant information, ...
  5. Outlook continually prompting for username and password
    There have been a lot of questions recently regarding Outlook prompting for a username and password whilst using Exchange 2007. There are a few reasons why this would happen and I will try to cover t...
  6. Backup Exchange 2010 Information Store using Windows Backup
    There seems to be quite a lot of confusion around the ability to backup Exchange 2010 using the built in Windows Backup feature. This stems from the omission of this feature prior to Exchange 2007 s...

Cloud Class Webinars

  1. Avoiding Bugs in Microsoft Access
    Alison Balter takes and in-depth look at avoiding bugs in Access. In this webinar you will learn about using the immediate window to debug your applications, invoking the debugger, using breakpoints to troubleshoot, stepping through code, setting the next statement to execute, ...
  2. Top 10 Best New Features in Visio 2010
    Scott Helmers gives live demonstrations of the top 10 new features in Visio 2010. This webinar will teach you how to create compelling diagrams by adding shapes to the page with a single click, linking the shapes in a diagram to data in Excel (or SQL Server, or SharePoint), ...
  3. IT Consultant Business Secrets Revealed
    Michael Munger, Experts Exchange tech pro and IT consultant, pulls back the curtain on his very successful businesses and answers question on every IT consultant and business owner should know about. He shares secrets on what he did to solve the 5 most common problems in IT, ...
  4. Disaster Recovery and Business Continuity
    Quest CTO, Mike Billon, gives an overview of the steps involved in building a dunamic disaster recovery plan. Through case studies and an examination of software/hardware tooles for monitoring and testing, you'll gain a better understandin of where you are, where you want ...
  5. Organize Your Visio Diagrams with Containers and Lists
    Scott Helmers uses cross functional flowcharts, wireframe diagrams, data graphic legends and seating charts to teach you: how to ustilize all three new structured diagram components in Visio 2010, the best practices for organizeing shapes in previous version of Visio, how to organize ...
  6. How to Us Objects, Properties, Events and Methods in Microsoft Access
    Alison Dalter gives an in-depbth look at objects, properties, events and methods in Microsoft Access. In this webinar you will learn about using the object browser, referring to objects, working with properties and methods, working with object variables, understanding the ...

Join the Community

Give a Little. Get a Lot.

Join the community of experts here and help other tech pros by answering question in your area of expertise. You can earn FREE access to all Experts Exchange's premium features and resources.

Join the Community

Answers

 

by: Ray_PaseurPosted on 2009-05-26 at 10:55:28ID: 24475702

Just curious - have you tried using SimpleXML instead?

 

by: walker6o9Posted on 2009-05-26 at 10:57:58ID: 24475729

Yes, I can do this with SimpleXML, but for what I'm ultimately going to try to do getElementById with PHP DOM will work a lot better.

 

by: walker6o9Posted on 2009-05-26 at 10:59:03ID: 24475739

I'm trying to do the tutorial here:

http://theserverpages.com/php/manual/en/function.dom-domdocument-getelementbyid.php

but it doesn't say what the code for book.xml should be, and it seems to me like I'm writing it wrong.

 

by: Ray_PaseurPosted on 2009-05-26 at 11:04:14ID: 24475789

Here is how I would go about this...

<?php // RAY_simplexml_9.php
error_reporting(E_ALL);
echo "<pre>";
 
// TEST DATA FROM THE OP
$xml = '
  <books>
  TEST BOOK
  </books>';
 
// MAKE AN OBJECT
$obj = SimpleXML_Load_String($xml);
 
// VISUALIZE THE DATA IN OBJECT FORM
var_dump($obj);
 
// PROVIDE A WRAPPER SO WE CAN DO THIS THE RIGHT WAY
$valid_xml = '<rezults>' . $xml . '</rezults>';
$valid_obj = SimpleXML_Load_String($valid_xml);
 
// SHOW THE DATA VALUE
echo (string)$valid_obj->books;
                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
18:
19:
20:
21:
22:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-26 at 11:07:36ID: 24475830

I appreciate your response, but I already know how to use SimpleXML for that particular purpose.  What I'm trying to do though is be able to grab the information in both HTML and XML files with php based on their element ID tag.

 

by: cxrPosted on 2009-05-26 at 11:10:01ID: 24475853

First, you must have id attributes in the xml:

<?xml version="1.0" encoding="iso-8859-1"?>
<books>
  <book id="books">TEST BOOK</book>
  <book id="books2">TEST BOOK2</book>
</books>

Second, you need to identify the id attributes. It can be done like this:

$doc = new DomDocument("1.0");
 
// We need to validate our document before refering to the id
$doc->validateOnParse = true;
$doc->Load('book.xml');
 
// define id attributes
foreach($doc->getElementsByTagName('book') as $book)
  $book->setIdAttribute('id',true);
 
echo "The element whose id is books is: " . $doc->getElementById('books')->tagName . "\n";

                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-26 at 11:15:27ID: 24475904

If I wanted to change book.xml to book.html, what would the html file look like?  Or would I need to change the php code?

 

by: walker6o9Posted on 2009-05-26 at 11:17:35ID: 24475932

CXR- Also, the above example returns
The element whose id is books is: book.

Is there a way to have it return TEST BOOK or TEST BOOK2

P.S. Thank you.

 

by: Ray_PaseurPosted on 2009-05-26 at 11:30:14ID: 24476055

what the code for book.xml - Yeah, I've been wondering about that myself!

 

by: Ray_PaseurPosted on 2009-05-26 at 11:33:01ID: 24476082

Maybe this will be helpful.

http://us3.php.net/manual/en/class.domdocument.php#91072

Also, you might want getElementsByTagName instead of looking for the ID.

 

by: cxrPosted on 2009-05-26 at 12:57:03ID: 24476959

>> Is there a way to have it return TEST BOOK or TEST BOOK2

Use the "nodeValue" property:

http://php.net/manual/en/class.domnode.php#domnode.props.nodevalue

echo $doc->getElementById('books')->nodeValue;

                                              
1:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-26 at 14:30:48ID: 24477915

That's working really well.  I tried using an .html file, instead of an .xml, and it works fine (I'm swapping xml text in for html text).

The issue I'm having though is if there are any links or inner divs inside, it seems that this won't work if there are any interior <> .  Is there a way to to make it work with <>?

 

by: walker6o9Posted on 2009-05-26 at 15:16:46ID: 24478242

CXR -

This line appears to be causing some problems:
foreach($doc->getElementsByTagName('book') as $book)
  $book->setIdAttribute('id',true);


But I can't seem to get it to work without that line.  However, the example provided by the php manual does not have it?

<?php
 
$doc = new DomDocument;
 
// We need to validate our document before refering to the id
$doc->validateOnParse = true;
$doc->Load('book.xml');
 
echo "The element whose id is books is: " . $doc->getElementById('books')->tagName . "\n";
 
?> 

                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:

Select allOpen in new window

 

by: cxrPosted on 2009-05-26 at 15:32:57ID: 24478379

The example in the manual does not work for me either. This excerpt from the manual gives a hint, though:

" For this function to work, you will need either to set some ID attributes with DOMElement::setIdAttribute  or a DTD which
  defines an attribute to be of type ID. In the later case, you will need to validate your document with
  DOMDocument::validate or DOMDocument->validateOnParse before using this function. "

http://php.net/manual/en/domdocument.getelementbyid.php

The book.xml used in the manual probably has a DTD, that is why validateOnParse is set to true in the example. Without a DTD, you need to use setIdAttribute() if you are going to use getElementById().

 

by: walker6o9Posted on 2009-05-26 at 15:34:37ID: 24478394

So, basically, this won't work for parsing html then?

 

by: cxrPosted on 2009-05-26 at 15:50:13ID: 24478495

Depends on the html... it should work for valid xhtml (which also is xml), but will probably not work well with malformed "quirks mode" html, which is what you most often will find on the net.

 

by: walker6o9Posted on 2009-05-26 at 16:06:36ID: 24478609

Can you show me an example of this grabbing an element by id from an XHTML file?

 

by: cxrPosted on 2009-05-26 at 18:53:14ID: 24479406

The challenge is to find a valid xhtml page... ;)

Try this:

$doc = new DomDocument;
$doc->resolveExternals = true;
$doc->Load('http://www.w3.org/');
echo "The element whose id is slogan is: " . $doc->getElementById('slogan')->tagName . "<br />\n";
var_dump($doc->getElementById('slogan')->nodeValue);
                                              
1:
2:
3:
4:
5:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-27 at 11:11:33ID: 24485896

That worked fine.  Then I tried to copy and paste the source code from www.w3.org into a file called "test.html", and put it in the same folder as my php code, and I got:
The element whose id is slogan is:
NULL


<?php
 
$doc = new DomDocument;
$doc->resolveExternals = true;
$doc->Load('test.html');
echo "The element whose id is slogan is: " . $doc->getElementById('slogan')->tagName . "<br />\n";
var_dump($doc->getElementById('slogan')->nodeValue);
 
?>

                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-27 at 11:50:03ID: 24486196

Ignore my last comment, I made a mistake.

This seems to work really well, but it only returns text.  For example, the id logo returns blank.  Is there a way to get it to return

<img alt="The World Wide Web Consortium (W3C)" height="48" width="315" src="/Icons/w3c_main" />

 

by: walker6o9Posted on 2009-05-27 at 11:52:50ID: 24486236

Or include linlks for example, rather than just straight text.

 

by: cxrPosted on 2009-05-27 at 12:03:05ID: 24486338

The element with id=logo is a h1 element, and it contains an img element.

To get the child element of the h1 element, you can use the firstChild property. To extract an element as a string, you can use the saveXML() method of the DOMDocument object. To see the string in a html context, you must use htmlentities().

Try this:

$doc = new DomDocument;
$doc->resolveExternals = true;
$doc->Load('http://www.w3.org/');
echo "The element whose id is logo is: " . $doc->getElementById('logo')->tagName . "<br />\n";
$logo = $doc->getElementById('logo');
$img = $doc->saveXML($logo->firstChild);
echo htmlentities($img);
                                              
1:
2:
3:
4:
5:
6:
7:

Select allOpen in new window

 

by: cxrPosted on 2009-05-27 at 12:05:57ID: 24486364

>> Or include linlks for example, rather than just straight text.

What do you mean?

 

by: walker6o9Posted on 2009-05-27 at 12:13:17ID: 24486423

If you had an h1 tag that looked like this:

<h1 id="logo">This is a <a href="link.html">link</a> test</h1>

And you wanted to grab

This is a <a href="link.html">link</a> test

 

by: cxrPosted on 2009-05-27 at 13:12:08ID: 24486999

That h1 element contains multiple child elements: A text node with value "This is a ", an anchor element containing another text node with value "link", and finally a text node with value " test".

If you wanted to include the h1 tags and the content in your output, it would be easier, you could just do

$s = $doc->saveXML($logo);

To get just the content of the h1 element, you must loop and output each of the child nodes:

$str = '<h1 id="logo">This is a <a href="link.html">link</a> test</h1>';
$doc = new DomDocument;
$doc->LoadXML($str);
foreach($doc->getElementsByTagName('h1') as $h1)
  $h1->setIdAttribute('id',true);
echo "The element whose id is logo is: " . $doc->getElementById('logo')->tagName . "<br />\n";
$logo = $doc->getElementById('logo');
foreach($logo->childNodes as $o) {
  $s = $doc->saveXML($o);
  echo htmlentities($s);
}

                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-27 at 14:57:23ID: 24487975

The worked well.  I tried to incorporate it into a search and replace, where the header is replaced by text from an .xml file.  This works really well, EXCEPT, if there is a link.  

So this

<h1 id="logo">This is a link test</h1>

Will get changed, but this won't

<h1 id="logo">This is a <a href="link.html">link</a> test</h1>

Even though they both output with the echo statement fine.

<?php
 
$docx = new DomDocument("1.0");
 
// We need to validate our document before refering to the id
$docx->validateOnParse = true;
$docx->Load('book.xml');
 
// define id attributes
foreach($docx->getElementsByTagName('div') as $bookx)
  $bookx->setIdAttribute('id',true);
 
$desired_content = $docx->getElementById('books')->nodeValue;
 
$outputfile = 'output.html';
$url = 'test.html';
$content = file_get_contents($url);
$between = '';
 
$doc = new DomDocument;
$doc->resolveExternals = true;
$doc->Load('test.html');
$logo = $doc->getElementById('logo');
foreach($logo->childNodes as $o) {
  $s = $doc->saveXML($o);
  $between = $between.$s; 
}
$output = str_replace($between, $desired_content, $content);
echo $between;
$fh = fopen($outputfile, 'w') or die("Can't open file the output file");
fwrite($fh, $output);
fclose($fh);
?>
                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
18:
19:
20:
21:
22:
23:
24:
25:
26:
27:
28:
29:
30:
31:
32:
33:

Select allOpen in new window

 

by: walker6o9Posted on 2009-05-27 at 15:01:27ID: 24488003

Typo on like 28.  Code should have read like this

<?php
$docx = new DomDocument("1.0");
 
// We need to validate our document before refering to the id
$docx->validateOnParse = true;
$docx->Load('book.xml');
 
// define id attributes
foreach($docx->getElementsByTagName('div') as $bookx)
  $bookx->setIdAttribute('id',true);
 
 $desired_content = $docx->getElementById('books')->nodeValue;
 
$outputfile = 'output.html';
$url = 'test.html';
$content = file_get_contents($url);
$between = '';
 
$doc = new DomDocument;
$doc->resolveExternals = true;
$doc->Load('test.html');
$logo = $doc->getElementById('logo');
foreach($logo->childNodes as $o) {
  $s = $doc->saveXML($o);
  $between = $between.$s;
}
$output = str_replace($desired_content, $between, $content);
echo $between;
$fh = fopen($outputfile, 'w') or die("Can't open file the output file");
fwrite($fh, $output);
fclose($fh);
?>
                                              
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
18:
19:
20:
21:
22:
23:
24:
25:
26:
27:
28:
29:
30:
31:
32:

Select allOpen in new window

 

by: cxrPosted on 2009-05-27 at 16:10:46ID: 24488500

It seems to me the first version was correct... the syntax for str_replace is:

str_replace  ( mixed $search  , mixed $replace  , mixed $subject  [, int &$count  ] )

The string to search for comes first, and the string to insert comes second.

If you can't figure it out, please open a new question. This question was about using getElementById(). Thank you! :)

 

by: cxrPosted on 2009-05-27 at 16:13:02ID: 24488515

Btw, I said earlier that malformed html would not work well, I forgot about this method:

http://php.net/manual/en/domdocument.loadhtmlfile.php

 

by: walker6o9Posted on 2009-05-27 at 16:30:44ID: 31585412

Sorry, didn't realize we'd drifted into a new topic.  Thanks for your help, I appreciate it.

20120131-EE-VQP-002

3 Ways to Join

30-Day Free Trial

The Experts

98% positive feedback on 31,087 answers since March 2000. angeliii is a Microsoft Most Valuable Professional for his work with MS SQL Server & Develoment.

He has also proven his knowledge of Visual Basic Programming, PHP Scripting and Oracle Databases.

The Experts

97% positive feedback on 10,752 answers since July 2000. lrmoore has more than 18 years experience in the networking industry.

The six-time Mircosoft MVPs specialties include firewalls, virtual private networking, and network management.

Testimonials

"...and excellent source for support... Kind of like having your very own IT dept." Electriciansnet

Testimonials

"I was apprehensive at signing up at first. However... it has already made my life as an IT administrator much easier." JaCrews

Testimonials

"WOW! You guys have great, active, and knowledgeable people on here." moore50

Business Clients

Business Clients

In the Press

"If you’ve got a question... Experts Exchange can supply an answer.”

In the Press

"...an invaluable aid for both IT professionals and those who require tech support."

In the Press

"where IT professionals provide quick answers on just about any topic"

Business Account Plans

Loading Advertisement...