This queue is for tickets about the HTML-Parser CPAN distribution.

Report information
The Basics
Id:
17962
Status:
resolved
Priority:
Low/Low
Queue:

People
Owner:
Nobody in particular
Requestors:
LGODDARD [...] cpan.org
Cc:
AdminCc:

BugTracker
Severity:
Critical
Broken in:
3.19
Fixed in:
(no value)



Subject: Mis-represents data.
Please see below dumper of an HTML::TokeParser token: compare $VAR1-> [1]->{href} and $VAR1->[4]. The latter is correct. This is for the latest binary for Win32 ActivePerl - which is an old version, I admit. No VC++ here, so I can't say if this is really a current bug or not. $VAR1 = [ 'a', { 'href' => '/index.php? currpage=2&days=1&jobtype=0&keywords=PERL〈=en&orderby=4&task=JobSearc h&xc=0' }, [ 'href' ], '<a href="/index.php? currpage=2&days=1&jobtype=0&keywords=PERL&lang=en&orderby=4&task=JobSea rch&xc=0">' ];
Can't tell if there is anything wrong without a test case that include the HTML that you parses. Please provide a minimal program that demonstrates the bug.
From: lgoddard@cpan.org
On Sun Mar 12 17:34:10 2006, GAAS wrote:
Show quoted text
> Can't tell if there is anything wrong without a test case that
include
Show quoted text
> the HTML that you parses. Please provide a minimal program that > demonstrates the bug.
I've attached a full example with perl code, raw HTML data, and the URI of the (dynamic) data source. Hope that helps. lee
use Data::Dumper; use HTML::TokeParser; my $p = HTML::TokeParser->new( *DATA); while (my $token = $p->get_tag('a')){ warn Dumper $token } __DATA__ <!-- On 13 March 2006 12:04 from http://www.theitjobboard.co.uk/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobSearch&xc=0 --> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title>perl html jobs</title> <meta name="description" content="The IT Job Board - Search for thousands of perl html jobs from the IT Job Board" /> <meta name="keywords" content="IT jobs, jobs, perl html jobs" /> <style> <!-- --> </style> <style> <!-- @import url(/css/style.css); --> </style> <script type="text/javascript" src="/scripts/createlinks.js"></script> <script type="text/javascript" src="/scripts/websurveyorpopup.js"></script> <siteconinfo><!-- SMWH-IS-3 --></siteconinfo> </head> <body onload="javascript: createlinks('en');"><div id="container"><div id="top"><div id="logo"><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=Home&xc=0"><img src="/images/en/logo_small.gif" border="0" alt="perl html jobs" /></a><div><br /><p style="font-size:14px;"><strong>34362&nbsp;Live IT Jobs</strong></p></div></div><div id="searchform"><form name="searchform" action="/index.php" method="get" id="searchboxform"><div id="left"><div id="srchkeywordsnote"><STRONG></STRONG>&nbsp;</div><div id="srchkeywords"><span class="label">Keywords</span><input type="text" name="keywords" value="PERL AND HTML" tabindex="1" id="keywords" /></div><div id="srchcountry"><span class="label">Country</span><select name="locations[]" id="locations_box" tabindex="2"><option value="">All locations</option><option value="180">UK</option><option value="124">Netherlands</option><option value="93">Germany</option><option value="69">France</option><option value="308">Switzerland</option><option value="13">Belgium</option><option value="170">Republic Of Ireland</option><option value="3">Austria</option><option value="301">Rest Of The World</option><option value="254">Other European</option></select></div><div id="srchjobtype"><span class="label">Job Type</span><select name="jobtype" tabindex="3"><option value="0">Any</option><option value="1">Contract</option><option value="2">Permanent</option></select></div><div id="srchposted"><span class="srchpostedlabel">Posted within</span><select name="days" tabindex="4"><option value="0">All </option><option value="7">7 days</option><option value="6">6 days</option><option value="5">5 days</option><option value="4">4 days</option><option value="3">3 days</option><option value="2">2 days</option><option value="1" selected="selected">1 day</option></select></div><div id="srchsortby"><span class="label">Sort by</span><select name="orderby" tabindex="6"><option value="1">Relevance</option><option value="2">Date posted</option><option value="3">Salary</option><option value="4" selected="selected">Non Agency/Agency</option></select></div><div id="srchsubmit"><input type="submit" name="submit" value="Search" tabindex="7" class="button" id="submit" /></div></div><input type="hidden" name="task" value="JobSearch" id="task" /><input type="hidden" name="xc" value="0" id="xc" /><input type="hidden" name="lang" value="en" id="lang" /></form><div id="right"><div id="righttop"><div><a href="/index.php?task=help&lang=en&xc=0&id=search_tips&nocache=1142247679" title="Search tips" id="pop_search">Search tips</a></div><div><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=WizardJobSearch&xc=0" title="Advanced Search" id="pop_wizard">Advanced Search</a></div></div><div id="rightbottom"><div id="otherjobs"><span class="rightlabel"><strong>Other Job Boards</strong></span><select name="otherjob" onchange="javascript:if (this.selectedIndex!=0) document.location=this.options[this.selectedIndex].value;"><option value="#">Please select ---></option><option value="http://www.itjobboard.nl/">IT Job Board.nl</option><option value="http://www.theunixjobboard.com/">The UNIX Job Board</option><option value="http://www.theitjobboard.be/">IT Job Board.be</option></select></div></div></div></div></div><div id="bodycontainer"><div class="hrspacer"></div><div id="responselabel"></div><div id="response"><div id="responsestat"><p style="display:inline;font-weight:normal;font-size:12px;">You are viewing jobs&nbsp;<strong>1</strong>&nbsp;to&nbsp;<strong>20</strong>&nbsp;of&nbsp;<strong>24</strong>&nbsp;</p><h1>perl html jobs</h1></div></div><div id="emaillink"><div id="distributemycv"><p><a href="javascript:window.open('http://apply.theitjobboard.com/index.php?brand=21&amp;jb=itjb&amp;xc=0&amp;lang=1',%20'cva',%20'width=500,%20height=398');%20void(0);" title="Apply to jobs before they are advertised! Distribute your CV to make sure that you are considered for all relevant advertised and non-advertised jobs. Simply fill out your details and if your profile matches our clients needs your CV will be sent directly to the top specialist IT recruitment consultants in your area.">Distribute My CV</a></p></div><div id="emailmejobslikethis"><p><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JBE&xc=0&jbereferrer=jobsearchresults" title="Email me jobs. Sign up to Jobs by Email Simply type in your email address and keywords to receive daily email updates of the latest and most relevant IT vacancies Edit your existing criteria Use the Edit function to edit your Jobs by email criteria Unsubscribe your email address If you do not wish to receive any more jobs by email from The IT Job Board. please use the unsubscribe function from our Email me jobs page."><img src="/images/en/envelope.gif" border="0" /></a> <a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JBE&xc=0&jbereferrer=jobsearchresults" title="Email me jobs. Sign up to Jobs by Email Simply type in your email address and keywords to receive daily email updates of the latest and most relevant IT vacancies Edit your existing criteria Use the Edit function to edit your Jobs by email criteria Unsubscribe your email address If you do not wish to receive any more jobs by email from The IT Job Board. please use the unsubscribe function from our Email me jobs page.">email me jobs like this</a></p></div></div><div class="hrspacer"></div><div id="results"><div class="jobdet"><p>1. <a href="/it-jobs-details/all/PERL_AND_HTML/1824553/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1824553&pagename=">Software Engineer - Amsterdam</a><br />Software Engineer PLEASE QUOTE THE VACANCY REFERENCE NUMBER IN YOUR APPLICATION Vacancy Reference Number: 0209-001 The RIPE Network Coordination Centre (RIPE NCC) is one of five Regional Internet Regi...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:01:07&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong>&nbsp;&nbsp;Direct Client:&nbsp;<span style="font-weight:bold;background-color:#FEFECC;">RIPE NCC</span><br />SALARY:<strong>&nbsp; &#128; 41372 and &#128; 51718...&nbsp;</strong>LOCATION:<strong>&nbsp;Amsterdam</strong><br /></p></div><div class="jobdet"><p>2. <a href="/it-jobs-details/all/PERL_AND_HTML/1840072/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1840072&pagename=">Software Engineers (Java, C#, SQL, XML / HTML, PHP, Python, Perl)</a><br />Job Title: Software Engineers (Java, C#, SQL, XML / HTML, PHP, Python, Perl) Location: Edinburgh, Scotland, UK Salary: &#163;25,000 - &#163;34,000 Job Type: Perm Job Description: 3MRT works in the fi...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:01:07&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong>&nbsp;&nbsp;Direct Client:&nbsp;<span style="font-weight:bold;background-color:#FEFECC;">3MRT LTD</span><br />SALARY:<strong>&nbsp;&#163;25000 - &#163;34000&nbsp;</strong>LOCATION:<strong>&nbsp;Edinburgh, Scotland, UK</strong><br /></p></div><div class="jobdet"><p>3. <a href="/it-jobs-details/all/PERL_AND_HTML/1993112/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1993112&pagename=">Anti-Spam Engine and Tools Developer</a><br />My Buckinghamshire based client requires a Anti-Spam Engine and Tools Developer to be responsible for defining clear engineering level requirements for projects while dev...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:22:54&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&#163;35 - &#163;45 per an...&nbsp;</strong>LOCATION:<strong>&nbsp;England - Thames Valley</strong><br /></p></div><div class="jobdet"><p>4. <a href="/it-jobs-details/all/PERL_AND_HTML/1992981/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992981&pagename=">Software Developer Technical Lead, Work with Complex Interactive Technologies!</a><br />My client is a market leader in providing interactive games software. They have enjoyed a profitable and strong year since its inception over 10 years ago. They are urg...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:22:42&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&#163;42-45000 plus company benefits...&nbsp;</strong>LOCATION:<strong>&nbsp;UK</strong><br /></p></div><div class="jobdet"><p>5. <a href="/it-jobs-details/all/PERL_AND_HTML/1992925/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992925&pagename=">Identity Managment - Advanced Security Support Engineer - Berkshire</a><br />Identity Management - Advanced Security Support Engineer - Berkshire required for global security solution market leader. Acting as a part of a global support team suppor...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:15:26&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;Upto 45K + Benefits + Onca...&nbsp;</strong>LOCATION:<strong>&nbsp;England - Thames Valley</strong><br /></p></div><div class="jobdet"><p>6. <a href="/it-jobs-details/all/PERL_AND_HTML/1992763/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992763&pagename=">Java Web Developer</a><br /><P>WORK ANYWHERE! </P>Required Skills:* Minimum 3-5 years experience in web development* Java, JSP, Servlets, UNIX, SQL, XML a must* Relational Databases ...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:14:38&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;England - London</strong><br /></p></div><div class="jobdet"><p>7. <a href="/it-jobs-details/all/PERL_AND_HTML/1992678/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992678&pagename=">Software Engineer (M/V)</a><br /><p>OmschrijvingThe Software Engineering Department (SED) is responsible for the development and maintenance of software within the company. The department also pr...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:13:16&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;Noord-Holland</strong><br /></p></div><div class="jobdet"><p>8. <a href="/it-jobs-details/all/PERL_AND_HTML/1989063/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1989063&pagename=">Google Internal Applications Java Engineer - EU Headquarters</a><br /><p>Internal Applications Java Engineer &#150; EU HeadquartersThis position is located in the European Headquarters in Dublin, Ireland.We are looking for proactive engi...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:11:30&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;Rest Of The World</strong><br /></p></div><div class="jobdet"><p>9. <a href="/it-jobs-details/all/PERL_AND_HTML/1765403/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1765403&pagename=">SQL Server Development &amp; Business System Support - Oxford</a><br /> Computer Futures Solutions are seeking a strong graduate with a 2:1 or above from a reputable university to join their technical Software Support team. Technologies will include MS SQL Server, Delphi...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:04:30&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&#163;20000 - &#163;27000&nbsp;</strong>LOCATION:<strong>&nbsp;Oxford, Oxfordshire - Th...</strong><br /></p></div><div class="jobdet"><p>10. <a href="/it-jobs-details/all/PERL_AND_HTML/1992349/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992349&pagename=">Google Wireless Software Engineer in Test - London</a><br />Wireless Software Engineer in TestPositions based in London, UK.The world-class Engineering team at Google aims to solve the mostfascinating and difficult problems in com...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:04:25&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;Negotiable&nbsp;</strong>LOCATION:<strong>&nbsp;England - London</strong><br /></p></div><div class="jobdet"><p>11. <a href="/it-jobs-details/all/PERL_AND_HTML/1992258/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992258&pagename=">Software Developer - Perl OO Unix Sybase</a><br />This is a development role within Market Data systems for a top-tier Asset Management bank. You must have Object Oriented (OO), Perl, Unix and Sybase with strong HTML or ...<br />LAST POSTED:<strong>&nbsp;13/03/2006 10:02:24&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&#163;60000 - &#163;65000 ...&nbsp;</strong>LOCATION:<strong>&nbsp;England - London - City</strong><br /></p></div><div class="jobdet"><p>12. <a href="/it-jobs-details/all/PERL_AND_HTML/1984249/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1984249&pagename=">Technical Consultant</a><br /><P>Technical Consultant, Bracknell, &#163;50 - &#163;60k</P><P>This person on a daily basis works with our elite customers to help design and implement thei...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:54:31&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;England - Thames Valley</strong><br /></p></div><div class="jobdet"><p>13. <a href="/it-jobs-details/all/PERL_AND_HTML/1992034/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1992034&pagename=">Support / Developer, Unix, Perl, Shell, HTML, VB - Investment Bank</a><br /><P>Support / Developer, Unix, Perl, Shell, HTML, VB - Investment Banking, London.</P><P>The main function of the role will include:supporting Data...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:44:46&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&#163;40 - &#163;50k basic + bonu...&nbsp;</strong>LOCATION:<strong>&nbsp;England - London</strong><br /></p></div><div class="jobdet"><p>14. <a href="/it-jobs-details/all/PERL_AND_HTML/1979323/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1979323&pagename=">Software Architect - C / Perl</a><br /><P>My Award Winning Communications client based in Berkshire have a fantastic opportunity for a Software Architect to join their team. They are expanding rapidly ...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:43:20&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;England - Thames Valley</strong><br /></p></div><div class="jobdet"><p>15. <a href="/it-jobs-details/all/PERL_AND_HTML/1973126/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1973126&pagename=">Technical Architect - C/Perl/SMS/MMS</a><br />My client is a Pan-European telco company providing services to all the leading mobile technology companies. Their clients range from advertising agencies, to broadcaster...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:34:35&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;England - London</strong><br /></p></div><div class="jobdet"><p>16. <a href="/it-jobs-details/all/PERL_AND_HTML/1991714/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1991714&pagename=">PHP / SQL / HTML / Perl Developer</a><br />Our client is a well established company operating in the financial services sector. The majority of the company revenue is generated through existing links to external w...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:21:53&nbsp;</strong>TYPE:<strong>&nbsp;Contract</strong><br />SALARY:<strong>&nbsp;&#163;20000 - &#163;35000 per...&nbsp;</strong>LOCATION:<strong>&nbsp;England - North West</strong><br /></p></div><div class="jobdet"><p>17. <a href="/it-jobs-details/all/PERL_AND_HTML/1991683/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1991683&pagename=">Software Engineer (M/V)</a><br /><p>OmschrijvingThe Software Engineering Department (SED) is responsible for the development and maintenance of software within the company. The department also pr...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:20:55&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;Noord-Holland</strong><br /></p></div><div class="jobdet"><p>18. <a href="/it-jobs-details/all/PERL_AND_HTML/1974113/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1974113&pagename=">Google Internal Systems Software Engineer in Test - EU Headquarters</a><br />Internal Systems Software Engineer in TestThis position is located in the EU Headquarters in Dublin, Ireland. We are looking for software QA engineers to work with the te...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:10:22&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;Rest Of The World</strong><br /></p></div><div class="jobdet"><p>19. <a href="/it-jobs-details/all/PERL_AND_HTML/1974110/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1974110&pagename=">Google Intranet Web Tools Engineer - EU Headquarters</a><br />Intranet Web Tools Engineer This position is located in our European Operations HQ in Dublin, Ireland.Google&#039;s IT Services team is seeking a Web Application Engineer tode...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:10:21&nbsp;</strong>TYPE:<strong>&nbsp;Permanent</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;Rest Of The World</strong><br /></p></div><div class="jobdet"><p>20. <a href="/it-jobs-details/all/PERL_AND_HTML/1917959/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&aid=1917959&pagename=">Web developer</a><br /><P>This position will be responsible for designing, developing and managing the corporate web site infrastructure for the company. Reporting to the Internet Manag...<br />LAST POSTED:<strong>&nbsp;13/03/2006 09:06:17&nbsp;</strong>TYPE:<strong>&nbsp;Contract</strong><br />SALARY:<strong>&nbsp;&nbsp;</strong>LOCATION:<strong>&nbsp;England - East Anglia</strong><br /></p></div><div id="prevnext"><div id="prevnextpage">Pages:1 | <a href="/index.php?currpage=2&days=1&jobtype=0&keywords=PERL+AND+HTML&lang=en&orderby=4&task=JobSearch&xc=0">2</a></div><div id="prevnextlabel"><a href="/index.php?currpage=2&days=1&jobtype=0&keywords=PERL+AND+HTML&lang=en&orderby=4&task=JobSearch&xc=0" title="Next">Next</a></div></div><div class="hrspacer"></div></div><div id="nonagencyjobs"><div id="ads">Relevant non-agency jobs</div><div class="naj">3MRT LTD<br /><a href="/it-jobs-details/all/PERL_AND_HTML/1840072/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&cid=2417662&aid=1840072&pagename=">Software Engineers (Java, C#, SQL, XML / HTML, PHP, Python, Perl)</a></div><div class="naj">RIPE NCC<br /><a href="/it-jobs-details/all/PERL_AND_HTML/1824553/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobDetails&xc=0&cid=2204064&aid=1824553&pagename=">Software Engineer - Amsterdam</a></div><div id="viewfeatured"><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=FeaturedClients&xc=0">Click here to view all our Featured Employers </a></div></div><div class="clear"></div></div></div><div id="footer"><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=Home&xc=0">Jobs Home</a>&nbsp;&nbsp; | &nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JBE&xc=0&jbereferrer=navigation">Email Me IT Jobs</a>&nbsp;&nbsp;<a href="javascript:window.open('http://apply.theitjobboard.com/index.php?brand=21&amp;jb=itjb&amp;xc=0&amp;lang=1',%20'cva',%20'width=500,%20height=398');%20void(0);">Upload CV</a>&nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=Preferences&xc=0">My Preferences</a>&nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=FeaturedClients&xc=0">Companies Hiring Directly</a>&nbsp;&nbsp; | &nbsp;&nbsp;<a href="/advertise/home/index.php?lang=en&xc=0">Advertise Your&nbsp;IT Jobs</a><br /><a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=AboutUs&xc=0"><FONT size=1>About The IT Job Board</FONT></a>&nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=ContactUs&xc=0"><FONT size=1>Contact The IT Job Board</FONT></a>&nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=Help&xc=0&type=privacy" id="pop_privacy"><FONT size=1>Privacy and Registration</FONT></a>&nbsp;&nbsp;<a href="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=Help&xc=0&type=tandc" id="pop_tandc"><FONT size=1>Terms&nbsp;and&nbsp;Conditions</FONT></a>&nbsp;&nbsp;<a href="/jobs/it-jobs-in-world.html?lang=en"><FONT size=1>Browse IT Jobs</FONT></a></div><form name="admin" action="/login.php" method="post"><input type="image" name="access" src="/image.php?image=tiny.gif" border="0" width="1" height="1" accesskey="p" id="access" /><input type="hidden" name="refer" value="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobSearch&xc=0" id="refer" /></form><form name="admin" action="/logoff.php" method="post"><input type="image" name="access" src="/image.php?image=tiny.gif" border="0" width="1" height="1" accesskey="o" id="access" /><input type="hidden" name="refer" value="/index.php?currpage=1&days=1&jobtype=0&keywords=perl+html&lang=en&orderby=4&task=JobSearch&xc=0" id="refer" /></form></body></html>
The reason "&lang" is expanded is that its an official HTML entity name; see http://www.w3.org/TR/REC-html40/sgml/entities.html#h-24.3.1 Browsers has used to expand entities even if the trailing ";" is missing, but there seems to be an exception for the non-Latin1 entities out-there. I tested this piece of HTML in Firefox/Konqeror: <html> <body> <a href="foo?a=1&eth=1&times=3&lang=4&Gamma=5&lang;=6">foo &lang;&lang=</a> </body> </html> and they both expand "&eth", "&times" and "&lang;" into the corresponding char but leaves "&lang" and "&Gamma" alone. Strangely enough Firefox expands "&lang" outside of the attribute so it actually plays by even more rules. HTML is such a mess!
Subject: Re: [rt.cpan.org #17962] Mis-represents data.
Date: Tue, 21 Mar 2006 15:38:06 +0100
To: bug-HTML-Parser@rt.cpan.org
From: Lee Goddard <lee@leegoddard.net>
Gisle_Aas via RT wrote:
Show quoted text
><URL: http://rt.cpan.org/Ticket/Display.html?id=17962 > > >The reason "&lang" is expanded is that its an official HTML entity >name; see http://www.w3.org/TR/REC-html40/sgml/entities.html#h-24.3.1 > >Browsers has used to expand entities even if the trailing ";" is >missing, but there seems to be an exception for the non-Latin1 >entities out-there. I tested this piece of HTML in Firefox/Konqeror: > > <html> > <body> > <a href="foo?a=1&eth=1&times=3&lang=4&Gamma=5&lang;=6">foo >&lang;&lang=</a> > </body> > </html> > >and they both expand "&eth", "&times" and "&lang;" into the >corresponding char but leaves "&lang" and "&Gamma" alone. Strangely >enough Firefox expands "&lang" outside of the attribute so it actually >plays by even more rules. > >HTML is such a mess! >
HTML: it's getting better all the time (couldn't get much worse), to coin a phrase... If only everyone would agree with the standard. I don't have the energy to track down the URI spec today, but logically (HTML/logic: ha!): the semi-colon in &lang; above ought to be URI-encoded, right? Otherwise it might be interpreted as a new-style delimiter as the ampersand was the old-style delimiter. What should happen when those two appaer together, I duuno. Ho hum. Any thoughts how you might deal with the mess? My vote is to not look for entities in URIs... Cheers lee


This service runs on Request Tracker, is sponsored by The Perl Foundation, and maintained by Best Practical Solutions.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.