Solved

bug in processing XPATH with this simple regular expression?

Posted on 2013-06-16
13
362 Views
Last Modified: 2013-06-24
Take this input:
<div id="ps-content">foobar</div>

and apply this XPATH 2+:
//div[matches(@id,"^s-content$")]

I tried with BaseX and it matched!  despite the fact that "^" in regex language means "the start of the string".

Why does it match?
0
Comment
Question by:lucavilla
  • 7
  • 5
13 Comments
 
LVL 60

Expert Comment

by:Geert Bormans
ID: 39251883
I can't answer why it matches

I can confirm it should not match
and using Saxon, it rightfully does not match

I suggest you report it to BaseX as a bug to look into, because a bug it is
0
 

Author Comment

by:lucavilla
ID: 39252297
Thanks for testing it with Saxon!

I now also tested it with Xidel too and it correctly doesn't match,  while it matches if I add the "p".

Do you know where is the right place to report the bug?
0
 
LVL 60

Expert Comment

by:Geert Bormans
ID: 39252303
I did not find any bugtracker tool on the site
(I was at an XML conference this weekend in London and actually was surprised not to meet any of the basex guys there, I could have asked)
anyway, you could try baseX talk
basex-talk(at)mailman.uni-konstanz(dot)de.
0
Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 

Author Comment

by:lucavilla
ID: 39254450
Done, I reported the bug to that email (mailing list): https://mailman.uni-konstanz.de/pipermail/basex-talk/2013-June/005241.html

Let's see what they say...

PS: wow, an XML conference, interesting!
0
 
LVL 60

Expert Comment

by:Geert Bormans
ID: 39254526
yeah, well actually it was pretty interesting

http://xmllondon.com/
a whole weekend, me and the other nerds :-)
0
 

Author Comment

by:lucavilla
ID: 39254574
I read the program at http://xmllondon.com/programme.jsp
Really interesting!
Too bad that I'm so far... being in Italy...
0
 

Accepted Solution

by:
lucavilla earned 0 total points
ID: 39268339
Solved.
The cause turned out to be Windows (7) command-line related.
If I put the XPATH in a file it works correctly!
0
 

Author Comment

by:lucavilla
ID: 39269026
I've requested that this question be closed as follows:

Accepted answer: 0 points for lucavilla's comment #a39268339

for the following reason:

Solution found by experimenting
0
 
LVL 60

Expert Comment

by:Geert Bormans
ID: 39269027
If you want to accept your own answer (a bit inappropriate here actually) make sure it contains a solution. 'win 7 command-line related' is too vague.

Note I confirmed it is a bug and it should not return values, that on its own is an acceptable reassurance

What did baseX reply? What exactly is the command-line related issue? How can your accepted answer help others that run in the same issue?
If you pasted the XPath in a file and it works ... it still could be a baseX issue, them doing the wrong thing with windows command lines. Still they should look into it. I can imagine that they parse out the command line the wrong way
0
 

Assisted Solution

by:lucavilla
lucavilla earned 0 total points
ID: 39269439
According to Microsoft about NT command shell:
" all reserved shell characters not in double quotes must be escaped. These characters have special meaning to the Windows NT command shell. The reserved shell characters are:
& | ( ) < > ^
To pass reserved shell characters as part of an argument for a command, either the entire argument must be enclosed in double quotes, or the reserved character must be escaped. Prefix a reserved character with a carat (^) character to escape it."

In the command-line that I tried it likely ignored the "^" in the regular expression considering it just an escape character of the following character. This is why it always matched no matter where I put that "^".
0
 
LVL 60

Expert Comment

by:Geert Bormans
ID: 39269667
did you try adding '^^' instead of '^', should work if I read this
0
 

Author Comment

by:lucavilla
ID: 39269717
yes and it worked but it was becoming too difficult to consider the various special characters in a XPATH to escape them automatically.
Added to the possibility to reach the command-line length limit, I preferred to avoid the problem :)
0

Featured Post

Webinar: Aligning, Automating, Winning

Join Dan Russo, Senior Manager of Operations Intelligence, for an in-depth discussion on how Dealertrack, leading provider of integrated digital solutions for the automotive industry, transformed their DevOps processes to increase collaboration and move with greater velocity.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Problem How to write an Xquery that works like a SQL outer join, providing placeholders for absent data on the outer side?  I give a bit more background at the end. The situation expressed as relational data Let’s work through this.  I’ve …
Browsing the questions asked to the Experts of this forum, you will be amazed to see how many times people are headaching about monster regular expressions (regex) to select that specific part of some HTML or XML file they want to extract. The examp…
In a recent question (https://www.experts-exchange.com/questions/29004105/Run-AutoHotkey-script-directly-from-Notepad.html) here at Experts Exchange, a member asked how to run an AutoHotkey script (.AHK) directly from Notepad++ (aka NPP). This video…
Finding and deleting duplicate (picture) files can be a time consuming task. My wife and I, our three kids and their families all share one dilemma: Managing our pictures. Between desktops, laptops, phones, tablets, and cameras; over the last decade…

730 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question