正文

用PHP5轻松解析XML2007-09-20 13:58:00

【评论】 【打印】 【字体: 】 本文链接:http://blog.pfan.cn/tiay/29549.html

分享到:

用 sax 方式的时候,要自己构建3个函数,而且要直接用这三的函数来返回数据,要求较强的逻辑。在处理不同结构的 xml 的时候,还要重新进行构造这三个函数,麻烦! 用 dom 方式,倒是好些,但是他把每个节点都看作是一个 node,,操作起来要写好多的代码,麻烦! 网上有好多的开源的 xml 解析的类库,以前看过几个,但是心里总是觉得不踏实,感觉总是跟在别人的屁股后面。 这几天在搞 Java,挺累的,所以决定换换脑袋,写点 PHP 代码,为了防止以后 XML 解析过程再令我犯难,就花了一天的时间写了下面一个 XML 解析的类,于是就有了下面的东西。 实现方式是通过包装“sax方式的解析结果”来实现的。总的来说,对于我个人来说挺实用的,性能也还可以,基本上可以完成大多数的处理要求。 功能: 1\ 对基本的 XML 文件的节点进行 查询 / 添加 / 修改 / 删除 工作。 2\ 导出 XML 文件的所有数据到一个数组里面。 3\ 整个设计采用了 OO 方式,在操作结果集的时候,使用方法类似于 dom 缺点: 1\ 每个节点最好都带有一个id(看后面的例子),每个“节点名字”=“节点的标签_节点的id”,如果这个 id 值没有设置,程序将自动给他产生一个 id,这个 id 就是这个节点在他的上级节点中的位置编号,从 0 开始。 2\ 查询某个节点的时候可以通过用“|”符号连接“节点名字”来进行。这些“节点名字”都是按顺序写好的上级节点的名字。 使用说明: 运行下面的例子,在执行结果页面上可以看到函数的使用说明 代码是通过 PHP5 来实现的,在 PHP4 中无法正常运行。 由于刚刚写完,所以没有整理文档,下面的例子演示的只是一部分的功能,代码不是很难,要是想知道更多的功能,可以研究研究源代码。 目录结构: test.php test.xml xml / SimpleDocumentBase.php xml / SimpleDocumentNode.php xml / SimpleDocumentRoot.php xml / SimpleDocumentParser.php 文件:test.xml <?xml version="1.0" encoding="GB2312"?> <shop> <name>华联</name> <address>北京长安街-9999号</address> <desc>连锁超市</desc> <cat id="food">   <goods id="food11">    <name>food11</name>    <price>12.90</price>   </goods>   <goods id="food12">    <name>food12</name>    <price>22.10</price>    <desc creator="hahawen">好东西推荐</desc>   </goods> </cat> <cat>   <goods id="tel21">    <name>tel21</name>    <price>1290</price>   </goods> </cat> <cat id="coat">   <goods id="coat31">    <name>coat31</name>    <price>112</price>   </goods>   <goods id="coat32">    <name>coat32</name>    <price>45</price>   </goods> </cat> <special id="hot">   <goods>    <name>hot41</name>    <price>99</price>   </goods> </special> </shop> 文件:test.php <?php require_once "xml/SimpleDocumentParser.php"; require_once "xml/SimpleDocumentBase.php"; require_once "xml/SimpleDocumentRoot.php"; require_once "xml/SimpleDocumentNode.php";$test = new SimpleDocumentParser(); $test->parse("test.xml"); $dom = $test->getSimpleDocument();echo "<pre>";echo "<hr><font color=red>"; echo "下面是通过函数getSaveData()返回的整个xml数据的数组"; echo "</font><hr>"; print_r($dom->getSaveData());echo "<hr><font color=red>"; echo "下面是通过setValue()函数,给给根节点添加信息,添加后显示出结果xml文件的内容"; echo "</font><hr>"; $dom->setValue("telphone", "123456789"); echo htmlspecialchars($dom->getSaveXml());echo "<hr><font color=red>"; echo "下面是通过getNode()函数,返回某一个分类下的所有商品的信息"; echo "</font><hr>"; $obj = $dom->getNode("cat_food"); $nodeList = $obj->getNode(); foreach($nodeList as $node){     $data = $node->getValue();     echo "<font color=red>商品名:".$data[name]."</font><br>";     print_R($data);     print_R($node->getAttribute()); }echo "<hr><font color=red>"; echo "下面是通过findNodeByPath()函数,返回某一商品的信息"; echo "</font><hr>"; $obj = $dom->findNodeByPath("cat_food|goods_food11"); if(!is_object($obj)){     echo "该商品不存在"; }else{     $data = $obj->getValue();     echo "<font color=red>商品名:".$data[name]."</font><br>";     print_R($data);     print_R($obj->getAttribute()); }echo "<hr><font color=red>"; echo "下面是通过setValue()函数,给商品\"food11\"添加属性, 然后显示添加后的结果"; echo "</font><hr>"; $obj = $dom->findNodeByPath("cat_food|goods_food11"); $obj->setValue("leaveword", array("value"=>"这个商品不错", "attrs"=>array("author"=>"hahawen", "date"=>date('Y-m-d')))); echo htmlspecialchars($dom->getSaveXml());echo "<hr><font color=red>"; echo "下面是通过removeValue()/removeAttribute()函数,给商品\"food11\"改变和删除属性, 然后显示操作后的结果"; echo "</font><hr>"; $obj = $dom->findNodeByPath("cat_food|goods_food12"); $obj->setValue("name", "new food12"); $obj->removeValue("desc"); echo htmlspecialchars($dom->getSaveXml());echo "<hr><font color=red>"; echo "下面是通过createNode()函数,添加商品, 然后显示添加后的结果"; echo "</font><hr>"; $obj = $dom->findNodeByPath("cat_food"); $newObj = $obj->createNode("goods", array("id"=>"food13")); $newObj->setValue("name", "food13"); $newObj->setValue("price", 100); echo htmlspecialchars($dom->getSaveXml());echo "<hr><font color=red>"; echo "下面是通过removeNode()函数,删除商品, 然后显示删除后的结果"; echo "</font><hr>"; $obj = $dom->findNodeByPath("cat_food"); $obj->removeNode("goods_food12"); echo htmlspecialchars($dom->getSaveXml()); ?> 文件:SimpleDocumentParser.php <?php /** *================================================* * @author     hahawen(大龄青年)  * @since      2004-12-04 * @copyright  Copyright (c) 2004, NxCoder Group * *================================================*/ /** * class SimpleDocumentParser * use SAX parse xml file, and build SimpleDocumentObject * all this pachage's is work for xml file, and method is action as DOM. * * @package SmartWeb.common.xml * @version 1.0 */ class SimpleDocumentParser {     private $domRootObject = null;     private $currentNO = null;      private $currentName  = null;      private $currentValue = null;      private $currentAttribute = null;     public      function getSimpleDocument()      {          return $this->domRootObject;      }     public function parse($file)      {          $xmlParser = xml_parser_create();          xml_parser_set_option($xmlParser,XML_OPTION_CASE_FOLDING,          0);          xml_parser_set_option($xmlParser,XML_OPTION_SKIP_WHITE, 1);          xml_parser_set_option($xmlParser,          XML_OPTION_TARGET_ENCODING, 'UTF-8');          xml_set_object($xmlParser, $this);         xml_set_element_handler($xmlParser, "startElement", "endElement");          xml_set_character_data_handler($xmlParser,          "characterData");         if (!xml_parse($xmlParser, file_get_contents($file)))         die(sprintf("XML error: %s at line %d", xml_error_string(xml_get_error_code($xmlParser)),          xml_get_current_line_number($xmlParser)));         xml_parser_free($xmlParser);     }     private function startElement($parser, $name, $attrs)      {          $this->currentName = $name;          $this->currentAttribute = $attrs;          if($this->currentNO == null)          {              $this->domRootObject = new SimpleDocumentRoot($name);             $this->currentNO = $this->domRootObject;          }          else         {              $this->currentNO = $this->currentNO->createNode($name, $attrs);         }      }     private function endElement($parser, $name)      {          if($this->currentName==$name)         {              $tag = $this->currentNO->getSeq();              $this->currentNO  = $this->currentNO->getPNodeObject();              if($this->currentAttribute!=null && sizeof($this->currentAttribute)>0)              $this->currentNO->setValue($name, array('value'=>$this->currentValue, 'attrs'=>$this->currentAttribute));             else              $this->currentNO->setValue($name, $this->currentValue);             $this->currentNO->removeNode($tag);          }          else         {              $this->currentNO = (is_a($this->currentNO, 'SimpleDocumentRoot'))?   null:              $this->currentNO->getPNodeObject();          }      }     private function characterData($parser,  $data)      {          $this->currentValue = iconv('UTF-8', 'GB2312', $data);      }      function __destruct()      {          unset($this->domRootObject);      }} ?> 文件:SimpleDocumentBase.php <?php /** *=================================================* * @author     hahawen(大龄青年)  * @since      2004-12-04 * @copyright  Copyright (c) 2004, NxCoder Group * *=================================================*/ /** * abstract class SimpleDocumentBase * base class for xml file parse * all this pachage's is work for xml file, and method is action as DOM. * * 1\ add/update/remove data of xml file. * 2\ explode data to array. * 3\ rebuild xml file * * @package SmartWeb.common.xml * @abstract * @version 1.0 */ abstract class SimpleDocumentBase {     private $nodeTag = null;     private $attributes = array();      private $values =      array();     private $nodes = array();     function __construct($nodeTag)      {          $this->nodeTag = $nodeTag;      }     public function getNodeTag()      {          return $this->nodeTag;      }     public function setValues($values){          $this->values = $values;      }     public function setValue($name, $value)      {          $this->values[$name] = $value;      }     public function getValue($name=null)      {          return $name==null?          $this->values: $this->values[$name];      }      public function removeValue($name)      {          unset($this->values["$name"]);      }     public function setAttributes($attributes){          $this->attributes = $attributes;      }     public function setAttribute($name, $value)      {          $this->attributes[$name] = $value;      }     public function getAttribute($name=null)      {          return $name==null? $this->attributes:          $this->attributes[$name];      }     public function removeAttribute($name)      {          unset($this->attributes["$name"]);      }     public function getNodesSize()      {          return sizeof($this->nodes);      }     protected function setNode($name, $nodeId)      {          $this->nodes[$name]          = $nodeId;      }     public abstract function createNode($name, $attributes);     public abstract function removeNode($name);     public abstract function getNode($name=null);     protected function getNodeId($name=null)      {          return $name==null? $this->nodes: $this->nodes[$name];      }     protected function createNodeByName($rootNodeObj, $name, $attributes, $pId)      {          $tmpObject = $rootNodeObj->createNodeObject($pId, $name, $attributes);          $key = isset($attributes[id])?          $name.'_'.$attributes[id]: $name.'_'.$this->getNodesSize();          $this->setNode($key, $tmpObject->getSeq());          return $tmpObject;      }     protected function removeNodeByName($rootNodeObj, $name)      {          $rootNodeObj->removeNodeById($this->getNodeId($name));          if(sizeof($this->nodes)==1)          $this->nodes = array();          else          unset($this->nodes[$name]);      }     protected function getNodeByName($rootNodeObj, $name=null)      {          if($name==null)          {              $tmpList = array();              $tmpIds = $this->getNodeId();              foreach($tmpIds as $key=>$id)              $tmpList[$key] = $rootNodeObj->getNodeById($id);              return $tmpList;          }          else          {              $id = $this->getNodeId($name);              if($id===null)              {                  $tmpIds = $this->getNodeId();                 foreach($tmpIds as $tkey=>$tid)                  {                      if(strpos($key, $name)==0)                      {                          $id = $tid;                          break;                      }                  }              }              return $rootNodeObj->getNodeById($id);          }      }     public function findNodeByPath($path)      {          $pos = strpos($path, '|');          if($pos<=0)          {              return $this->getNode($path);          }          else          {             $tmpObj = $this->getNode(substr($path, 0,              $pos));             return is_object($tmpObj)?              $tmpObj->findNodeByPath(substr($path,              $pos+1)):              null;          }      }     public function getSaveData()      {          $data = $this->values;          if(sizeof($this->attributes)>0)         $data[attrs] = $this->attributes;          $nodeList = $this->getNode();          if($nodeList==null)         return $data;          foreach($nodeList as $key=>$node)          {              $data[$key] = $node->getSaveData();          }         return $data;      }      public function getSaveXml($level=0)      {         $prefixSpace          = str_pad("",          $level, "\t");          $str = "$prefixSpace<$this->nodeTag";          foreach($this->attributes as $key=>$value)          $str .= " $key=\"$value\"";         $str .= ">\r\n";          foreach($this->values as $key=>$value){             if(is_array($value))              {                  $str .= "$prefixSpace\t<$key";                 foreach($value[attrs] as $attkey=>$attvalue)                 $str .= " $attkey=\"$attvalue\"";                 $tmpStr = $value[value];              }              else             {                  $str .= "$prefixSpace\t<$key";                 $tmpStr = $value;              }              $tmpStr = trim(trim($tmpStr, "\r\n"));             $str .= ($tmpStr===null || $tmpStr==="")? " />\r\n": ">$tmpStr</$key>\r\n";         }         foreach($this->getNode() as $node)          $str .= $node->getSaveXml($level+1)."\r\n";          $str .= "$prefixSpace</$this->nodeTag>";         return $str;      }      function __destruct()      {          unset($this->nodes, $this->attributes, $this->values);     }} ?> 文件:SimpleDocumentRoot.php <?php /** *==============================================* * @author     hahawen(大龄青年)  * @since      2004-12-04 * @copyright  Copyright (c) 2004, NxCoder Group * *==============================================*/ /** * class SimpleDocumentRoot * xml root class, include values/attributes/subnodes. * all this pachage's is work for xml file, and method is action as DOM. * * @package SmartWeb.common.xml * @version 1.0 */class SimpleDocumentRoot extends SimpleDocumentBase {     private $prefixStr = '';     private $nodeLists = array();    function __construct($nodeTag)     {         parent::__construct($nodeTag);     }    public function createNodeObject($pNodeId, $name, $attributes)     {         $seq = sizeof($this->nodeLists);         $tmpObject = new SimpleDocumentNode($this,         $pNodeId, $name, $seq);         $tmpObject->setAttributes($attributes);        $this->nodeLists[$seq] = $tmpObject;         return $tmpObject;     }    public function removeNodeById($id)     {         if(sizeof($this->nodeLists)==1)         $this->nodeLists = array();         else         unset($this->nodeLists[$id]);     }    public function getNodeById($id)     {         return $this->nodeLists[$id];     }    public function createNode($name, $attributes)     {         return $this->createNodeByName($this, $name, $attributes, -1);     }    public function removeNode($name)     {         return $this->removeNodeByName($this, $name);     }    public function getNode($name=null)     {         return $this->getNodeByName($this, $name);     }    public function getSaveXml()     {         $prefixSpace = "";         $str = $this->prefixStr."\r\n";         return $str.parent::getSaveXml(0);     } } ?> 文件:SimpleDocumentNode.php <?php/***===============================================** @author     hahawen(大龄青年)  * @since      2004-12-04* @copyright  Copyright (c) 2004, NxCoder Group**===============================================*//*** class SimpleDocumentNode* xml Node class, include values/attributes/subnodes.* all this pachage's is work for xml file, and method is action as DOM.** @package SmartWeb.common.xml* @version 1.0*/class SimpleDocumentNode extends SimpleDocumentBase{     private $seq = null;     private $rootObject = null;     private $pNodeId = null;     function __construct($rootObject, $pNodeId, $nodeTag, $seq)     {         parent::__construct($nodeTag);         $this->rootObject = $rootObject;         $this->pNodeId = $pNodeId;         $this->seq = $seq;     }     public function getPNodeObject()     {         return ($this->pNodeId==-1)?         $this->rootObject:         $this->rootObject->getNodeById($this->pNodeId);     }     public function getSeq(){         return $this->seq;     }     public function createNode($name, $attributes)     {         return $this->createNodeByName($this->rootObject,         $name, $attributes,         $this->getSeq());     }     public function removeNode($name)     {         return $this->removeNodeByName($this->rootObject, $name);     }      public function getNode($name=null)     {         return $this->getNodeByName($this->rootObject,         $name);     }}?> 下面是例子运行对结果 --------------------------------------------------------------------------------下面是通过函数getSaveData()返回的整个xml数据的数组 --------------------------------------------------------------------------------Array(    [name] => 华联    [address] => 北京长安街-9999号    [desc] => 连锁超市    [cat_food] => Array        (            [attrs] => Array                (                    [id] => food                )             [goods_food11] => Array                (                    [name] => food11                    [price] => 12.90                    [attrs] => Array                        (                            [id] => food11                        )                 )             [goods_food12] => Array                (                    [name] => food12                    [price] => 22.10                    [desc] => Array                        (                            [value] => 好东西推荐                            [attrs] => Array                                (                                    [creator] => hahawen                                )                         )                     [attrs] => Array                        (                            [id] => food12                        )                 )         )     [cat_1] => Array        (            [goods_tel21] => Array                (                    [name] => tel21                    [price] => 1290                    [attrs] => Array                        (                            [id] => tel21                        )                 )         )     [cat_coat] => Array        (            [attrs] => Array                (                    [id] => coat                )             [goods_coat31] => Array                (                    [name] => coat31                    [price] => 112                    [attrs] => Array                        (                            [id] => coat31                        )                 )             [goods_coat32] => Array                (                    [name] => coat32                    [price] => 45                    [attrs] => Array                        (                            [id] => coat32                        )                 )         )     [special_hot] => Array        (            [attrs] => Array                (                    [id] => hot                )             [goods_0] => Array                (                    [name] => hot41                    [price] => 99                )         ) ) --------------------------------------------------------------------------------下面是通过setValue()函数,给给根节点添加信息,添加后显示出结果xml文件的内容 --------------------------------------------------------------------------------<?xml version="1.0" encoding="GB2312" ?><shop><name>华联</name><address>北京长安街-9999号</address><desc>连锁超市</desc><telphone>123456789</telphone><cat id="food">  <goods id="food11">   <name>food11</name>   <price>12.90</price>  </goods>  <goods id="food12">   <name>food12</name>   <price>22.10</price>   <desc creator="hahawen">好东西推荐</desc>  </goods></cat><cat>  <goods id="tel21">   <name>tel21</name>   <price>1290</price>  </goods></cat><cat id="coat">  <goods id="coat31">   <name>coat31</name>   <price>112</price>  </goods>  <goods id="coat32">   <name>coat32</name>   <price>45</price>  </goods></cat><special id="hot">  <goods>   <name>hot41</name>   <price>99</price>  </goods></special></shop> --------------------------------------------------------------------------------下面是通过getNode()函数,返回某一个分类下的所有商品的信息 --------------------------------------------------------------------------------商品名:food11Array(    [name] => food11    [price] => 12.90)Array(    [id] => food11)商品名:food12Array(    [name] => food12    [price] => 22.10    [desc] => Array        (            [value] => 好东西推荐            [attrs] => Array                (                    [creator] => hahawen                )         ) )Array(    [id] => food12) --------------------------------------------------------------------------------下面是通过findNodeByPath()函数,返回某一商品的信息 --------------------------------------------------------------------------------商品名:food11Array(    [name] => food11    [price] => 12.90)Array(    [id] => food11) --------------------------------------------------------------------------------下面是通过setValue()函数,给商品"food11"添加属性, 然后显示添加后的结果 --------------------------------------------------------------------------------<?xml version="1.0" encoding="GB2312" ?><shop><name>华联</name><address>北京长安街-9999号</address><desc>连锁超市</desc><telphone>123456789</telphone><cat id="food">  <goods id="food11">   <name>food11</name>   <price>12.90</price>   <leaveword author="hahawen" date="2004-12-05">这个商品不错</leaveword>  </goods>  <goods id="food12">   <name>food12</name>   <price>22.10</price>   <desc creator="hahawen">好东西推荐</desc>  </goods></cat><cat>  <goods id="tel21">   <name>tel21</name>   <price>1290</price>  </goods></cat><cat id="coat">  <goods id="coat31">   <name>coat31</name>   <price>112</price>  </goods>  <goods id="coat32">   <name>coat32</name>   <price>45</price>  </goods></cat><special id="hot">  <goods>   <name>hot41</name>   <price>99</price>  </goods></special></shop> --------------------------------------------------------------------------------下面是通过removeValue()/removeAttribute()函数,给商品"food11"改变和删除属性, 然后显示操作后的结果 --------------------------------------------------------------------------------<?xml version="1.0" encoding="GB2312" ?><shop><name>华联</name><address>北京长安街-9999号</address><desc>连锁超市</desc><telphone>123456789</telphone><cat id="food">  <goods id="food11">   <name>food11</name>   <price>12.90</price>   <leaveword author="hahawen" date="2004-12-05">这个商品不错</leaveword>  </goods>  <goods id="food12">   <name>new food12</name>   <price>22.10</price>  </goods></cat><cat>  <goods id="tel21">   <name>tel21</name>   <price>1290</price>  </goods></cat><cat id="coat">  <goods id="coat31">   <name>coat31</name>   <price>112</price>  </goods>  <goods id="coat32">   <name>coat32</name>   <price>45</price>  </goods></cat><special id="hot">  <goods>   <name>hot41</name>   <price>99</price>  </goods></special></shop> --------------------------------------------------------------------------------下面是通过createNode()函数,添加商品, 然后显示添加后的结果 --------------------------------------------------------------------------------<?xml version="1.0" encoding="GB2312" ?><shop><name>华联</name><address>北京长安街-9999号</address><desc>连锁超市</desc><telphone>123456789</telphone><cat id="food">  <goods id="food11">   <name>food11</name>   <price>12.90</price>   <leaveword author="hahawen" date="2004-12-05">这个商品不错</leaveword>  </goods>  <goods id="food12">   <name>new food12</name>   <price>22.10</price>  </goods>  <goods id="food13">   <name>food13</name>   <price>100</price>  </goods></cat><cat>  <goods id="tel21">   <name>tel21</name>   <price>1290</price>  </goods></cat><cat id="coat">  <goods id="coat31">   <name>coat31</name>   <price>112</price>  </goods>  <goods id="coat32">   <name>coat32</name>   <price>45</price>  </goods></cat><special id="hot">  <goods>   <name>hot41</name>   <price>99</price>  </goods></special></shop> --------------------------------------------------------------------------------下面是通过removeNode()函数,删除商品, 然后显示删除后的结果 --------------------------------------------------------------------------------<?xml version="1.0" encoding="GB2312" ?><shop><name>华联</name><address>北京长安街-9999号</address><desc>连锁超市</desc><telphone>123456789</telphone><cat id="food">  <goods id="food11">   <name>food11</name>   <price>12.90</price>   <leaveword author="hahawen" date="2004-12-05">这个商品不错</leaveword>  </goods>  <goods id="food13">   <name>food13</name>   <price>100</price>  </goods></cat><cat>  <goods id="tel21">   <name>tel21</name>   <price>1290</price>  </goods></cat><cat id="coat">  <goods id="coat31">   <name>coat31</name>   <price>112</price>  </goods>  <goods id="coat32">   <name>coat32</name>   <price>45</price>  </goods></cat><special id="hot">  <goods>   <name>hot41</name>   <price>99</price>  </goods></special></shop>

阅读(4353) | 评论(0)


版权声明:编程爱好者网站为此博客服务提供商,如本文牵涉到版权问题,编程爱好者网站不承担相关责任,如有版权问题请直接与本文作者联系解决。谢谢!

评论

暂无评论
您需要登录后才能评论,请 登录 或者 注册