PROSAGA码农传奇-运维监控与响应-如何获取没有HTML标签的文本

以下是HTML：
<pre><code><div class="ajaxcourseindentfix">
 <h3>CPSC 353 - Introduction to Computer Security (3) </h3>
 <hr>Security goals, security systems, access controls, networks and security, integrity, cryptography fundamentals, authentication. Attacks: software, network, website; management considerations, security standards in government and industry; security issues in requirements, architecture, design, implementation, testing, operation, maintenance, acquisition, and services.
 
 Prerequisite: <a href="preview_course_nopop.php?catoid=16&coid=96570" onclick="acalogPopup()">CPSC 253U</a>
   or <a href="#" onclick="acalogPopup()" target="_blank">CPSC 254</a>
   and <a href="#" onclick="acalogPopup()" target="_blank">CPSC 351</a>
  
 , declared major/minor in CPSC, CPEN, or CPEI
 
</div>
</code></pre>我需要从此HTML中获取以下文本：从第6行- 或 从第7行- 和 ，在CPSC，CPEN或CPEI中声明为主要/次要我可以使用以下XPath获得href [课程号：CPSC 254等…]：
<pre><code> # This xpath gives me all the tags followed by h3 and then I iterate through them in my script. 
//div[@class='ajaxcourseindentfix']/h3/following-sibling::text()[2]/following-sibling::*
</code></pre>更新资料并且，然后输入具有以下XPath的文本：
<pre><code># This xpath gives me all the text after the h3 tag. 
//div[@class='ajaxcourseindentfix']/h3/following-sibling::text()[2]/following-sibling::text()
</code></pre>我需要以与URL 1相同的方式来获取这些课程名称/前提条件。通过这种方法，我先获取所有HREF，然后获取所有文本。有没有更好的方法来实现这一目标？我不想遍历2个XPath，先获取HREF，然后再获取Text，然后将其合并以形成必备字符串。1 <a href="http://catalog.fullerton.edu/ajax/preview_course.php?catoid=16&coid=99648&show">http://catalog.fullerton.edu/ajax/preview_course.php?catoid=16&coid=99648&show</a>