Skip Menu |
 

This queue is for tickets about the HTML-Tree CPAN distribution.

Report information
The Basics
Id: 124734
Status: new
Priority: 0/
Queue: HTML-Tree

People
Owner: Nobody in particular
Requestors: tbruner [...] sandia.gov
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: angle bracket enclosed text
Date: Thu, 8 Mar 2018 23:20:05 +0000
To: "bug-HTML-Tree [...] rt.cpan.org" <bug-HTML-Tree [...] rt.cpan.org>
From: "Bruner, Todd" <tbruner [...] sandia.gov>
Download (untitled) / with headers
text/plain 643b
Hi, I’m trying to parse the following source HTML containing a smtp message id: $html = <<EOF; <html> <div> <CAEr2S5-HuU1MjnUQtqS6Ri-i3ZaYcTm_+cjf6mkmOgwGJHjPJA@mail.gmail.com> </div> </html> EOF my $tree = HTML::TreeBuilder->new; $tree->implicit_tags(1); $tree->p_strict(1); $tree->no_space_compacting(1); $tree->parse_content($html); $tree->elementify; Later as I walk the tree, when I get to the <div> node, $node->is_empty evaluates to true. Is there a way other than pre-processing the div node to remove the angle brackets to get HTML::Tree to recognize the text in that node? Thanks, Todd


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.