creating preg_match using xpath in php

試著忘記壹切 提交于 2019-12-23 23:56:10


I am trying to get the contents using XPATH in php.

<div class='post-body entry-content' id='post-body-37'>
<div style="text-align: left;">
<div style="text-align: center;">

I am using below php code to get the output.

 $dom = new DOMDocument;
 $xpath = new DOMXPath($dom);
 $regex = 'post-(content|[a-z]+)';
 $items = $xpath->query("div[ php:functionString('preg_match', '$regex', @class) > 0]");

It returns output as below

DOMNodeList {#580 
+length: 0 


For a simple task like this - getting the div nodes with class attribute starting with post- and containing content, you should be using regular simple XPath queries:

$xp->query('//div[starts-with(@class,"post-") and contains(@class, "content")]');

Here, - //div - get all divs that... - starts-with(@class,"post-") - have "class" attribute starting with "post-" - and - and... - contains(@class, "content") - contain "content" substring in the class attribute value.

To use the php:functionString you need to register the php namespace (with $xpath->registerNamespace("php", "");) and the PHP functions (to register them all use $xp->registerPHPFunctions();).

For complex scenrios, when you need to analyze the values even deeper, you may want to create and register your own functions:

function example($attr) {
    return preg_match('/post-(content|[a-z]+)/i', $attr) > 0;

and then inside XPath:

$divs = $xp->query("//div[php:functionString('example', @class)]");

Here, functionString passes the string contents of @class attribute to the example function, not the object (as would be the case with php:function).

See IDEONE demo:

function example($attr) {
    return preg_match('/post-(content|[a-z]+)/i', $attr) > 0;
$html = <<<HTML
<div class='post-body entry-content' id='post-body-37'>
<div style="text-align: left;">
<div style="text-align: center;">
$dom = new DOMDocument;

$xp = new DOMXPath($dom);
$xp->registerNamespace("php", "");
$divs = $xp->query("//div[php:functionString('example', @class)]");
foreach ($divs as $div) {
    echo $div->nodeValue;    

See also a nice article about the using of PhpFunctions inside XPath in Using PHP Functions in XPath Expressions.


Here is a working version with the different advices you get in comments:


$dom = new DOMDocument;

$xpath = new DOMXPath($dom);
// you need to register the namespace "php" to make it available in the query
$xpath->registerNamespace("php", ""); 

// add delimiters to your pattern
$regex = '~post-(content|[a-z]+)~';

// search your node anywhere in the DOM tree with "//"
$items = $xpath->query("//div[php:functionString('preg_match', '$regex', @class)>0]");


Obviously, this kind of pattern is useless since you can get the same result with available XPATH string functions like contains.

