Skip Menu |
 
rt.cpan.org will be shut down on March 1st, 2021.

This queue is for tickets about the XML-LibXML CPAN distribution.

Report information
The Basics
Id: 53532
Status: open
Priority: 0/
Queue: XML-LibXML

People
Owner: Nobody in particular
Requestors: IKEGAMI [...] cpan.org
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in:
  • 1.66
  • 1.70
Fixed in: (no value)



MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 657
Download (untitled) / with headers
text/plain 657b
appendTextChild is sensitive to the internal format Perl is using to store the string containg its second argument. Show quoted text
---------- BEGIN CODE ---------- use strict; use warnings; use XML::LibXML qw( ); my $s = "abcd\x{f6}efgh"; if ($ARGV[0]) { # One internal format utf8::downgrade($s); } else { # Other internal format utf8::upgrade($s); } XML::LibXML::Element->new('foo')->appendTextChild('Node', $s);
---------- END CODE ----------
---------- BEGIN OUTPUT ----------
>perl test.pl 0
>perl test.pl 1
error : xmlEncodeEntitiesReentrant : char out of range
---------- END OUTPUT ---------- Interestingly, the resulting XML is identical.
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Type: multipart/mixed; boundary="----------=_1265122463-18223-60"
Message-ID: <rt-3.8.HEAD-18223-1265122463-404.53532-0-0 [...] rt.cpan.org>
X-RT-Original-Encoding: utf-8
Content-Length: 0
Content-Disposition: inline
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 532
Download (untitled) / with headers
text/plain 532b
More seriously, for other Latin-1 characters, the output gets mangled and no warning is emitted. # perl test.pl f6 0 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdöefgh</Node></foo> # perl test.pl f6 1 error : xmlEncodeEntitiesReentrant : char out of range <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdöefgh</Node></foo> # perl test.pl e1 0 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdáefgh</Node></foo> # perl test.pl e1 1 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdᥦgh</Node></foo>
MIME-Version: 1.0
Subject: test.pl
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Type: text/x-perl-script; name="test.pl"
Content-Disposition: inline; filename="test.pl"
Content-Transfer-Encoding: binary
Content-Length: 449
Download test.pl
text/x-perl 449b
#!/usr/bin/perl use strict; use warnings; use XML::LibXML qw( ); my $s = "abcd" . chr(hex($ARGV[0])) . "efgh"; if ($ARGV[1]) { # One internal format utf8::downgrade($s); } else { # Other internal format utf8::upgrade($s); } my $e = XML::LibXML::Element->new('foo'); $e->appendTextChild('Node', $s); my $d = XML::LibXML::Document->new("1.0","UTF-8"); $d->setDocumentElement($e); $e->appendTextChild('Node', $s); print $d->toString();
MIME-Version: 1.0
In-Reply-To: <rt-3.8.HEAD-18223-1265122463-404.53532-0-0 [...] rt.cpan.org>
X-Mailer: MIME-tools 5.427 (Entity 5.427)
References: <rt-3.8.HEAD-18223-1265122463-404.53532-0-0 [...] rt.cpan.org>
Content-Type: multipart/mixed; boundary="----------=_1265123131-18076-69"
Message-ID: <rt-3.8.HEAD-18076-1265123131-696.53532-0-0 [...] rt.cpan.org>
X-RT-Original-Encoding: utf-8
Content-Length: 0
Content-Disposition: inline
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 1349
Download (untitled) / with headers
text/plain 1.3k
Please disregard previous comment + attachment, it confuses two issues that were to be reported seperately. test1.pl demonstrates how some Latin-1 input characters get silently corrupted: # perl test1.pl f6 0 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdöefgh</Node></foo> # perl test1.pl f6 1 error : xmlEncodeEntitiesReentrant : char out of range <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdöefgh</Node></foo> # perl test1.pl e1 0 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdáefgh</Node></foo> # perl test1.pl e1 1 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdᥦgh</Node></foo> test2.pl demonstrates how invalid utf-8 output can result: #perl /srv/scratch/test2.pl f6 0 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcdöefgh</Node></foo> #perl /srv/scratch/test2.pl f6 1 <?xml version="1.0" encoding="UTF-8"?> <foo><Node>abcd?efgh</Node></foo> #perl /srv/scratch/test2.pl f6 1 | hexdump -Cv 00000000 3c 3f 78 6d 6c 20 76 65 72 73 69 6f 6e 3d 22 31 |<?xml version="1| 00000010 2e 30 22 20 65 6e 63 6f 64 69 6e 67 3d 22 55 54 |.0" encoding="UT| 00000020 46 2d 38 22 3f 3e 0a 3c 66 6f 6f 3e 3c 4e 6f 64 |F-8"?>.<foo><Nod| 00000030 65 3e 61 62 63 64 f6 65 66 67 68 3c 2f 4e 6f 64 |e>abcd.efgh</Nod| 00000040 65 3e 3c 2f 66 6f 6f 3e 0a |e></foo>.| 00000049
MIME-Version: 1.0
Subject: test2.pl
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Type: text/x-perl-script; name="test2.pl"
Content-Disposition: inline; filename="test2.pl"
Content-Transfer-Encoding: binary
Content-Length: 416
Download test2.pl
text/x-perl 416b
#!/usr/bin/perl use strict; use warnings; use XML::LibXML qw( ); my $s = "abcd" . chr(hex($ARGV[0])) . "efgh"; if ($ARGV[1]) { # One internal format utf8::downgrade($s); } else { # Other internal format utf8::upgrade($s); } my $e = XML::LibXML::Element->new('foo'); my $d = XML::LibXML::Document->new("1.0","UTF-8"); $d->setDocumentElement($e); $e->appendTextChild('Node', $s); print $d->toString();
Subject: test1.pl
MIME-Version: 1.0
Content-Type: text/x-perl-script; name="test1.pl"
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline; filename="test1.pl"
Content-Transfer-Encoding: binary
Content-Length: 415
Download test1.pl
text/x-perl 415b
#!/usr/bin/perl use strict; use warnings; use XML::LibXML qw( ); my $s = "abcd" . chr(hex($ARGV[0])) . "efgh"; if ($ARGV[1]) { # One internal format utf8::downgrade($s); } else { # Other internal format utf8::upgrade($s); } my $e = XML::LibXML::Element->new('foo'); $e->appendTextChild('Node', $s); my $d = XML::LibXML::Document->new("1.0","UTF-8"); $d->setDocumentElement($e); print $d->toString();
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-2356-1290715764-1007.53532-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
From: daniel.frett [...] ccci.org
X-RT-Original-Encoding: utf-8
Content-Length: 2043
Download (untitled) / with headers
text/plain 1.9k
This is actually documented behavior, in the XML::LibXML documentation under the "ENCODINGS SUPPORT IN XML::LIBXML" section it says: 3. DOM methods also accept binary strings in the original encoding of the document to which the node belongs (UTF-8 is assumed if the node is not attached to any document). Exploiting this feature is NOT RECOMMENDED since it is considered a bad practice. my $doc = XML::LibXML:Document->new('1.0','iso-8859-2'); my $text = $doc->createTextNode($some_latin2_encoded_byte_string); # WORKS, BUT NOT RECOMMENDED! I personally would prefer if XML::LibXML would by default treat all strings as character strings and automatically convert them, especially since the documented functionality is considered "bad practice". But because it is documented behavior, this will break backwards compatability. Maybe an alternative to help move people away from the bad practice of setting byte strings directly and encourage the use of character strings would be to add a global flag that can turn on/off treating all strings as character strings. Then through a deprecation schedule over several versions, start off with the option disabled by default, then enable it option by default, and (maybe?) at some point remove the byte string support. On Fri Jan 08 23:47:13 2010, ikegami wrote: Show quoted text
> appendTextChild is sensitive to the internal format Perl is using to > store the string containg its second argument. > > ---------- BEGIN CODE ---------- > use strict; > use warnings; > > use XML::LibXML qw( ); > > my $s = "abcd\x{f6}efgh"; > > if ($ARGV[0]) { > # One internal format > utf8::downgrade($s); > } else { > # Other internal format > utf8::upgrade($s); > } > > XML::LibXML::Element->new('foo')->appendTextChild('Node', $s); > ---------- END CODE ---------- > > ---------- BEGIN OUTPUT ----------
> >perl test.pl 0
>
> >perl test.pl 1
> error : xmlEncodeEntitiesReentrant : char out of range > ---------- END OUTPUT ---------- > > Interestingly, the resulting XML is identical.
From ikesuke [...] gmail.com Thu Nov 25 16: 49:11 2010
MIME-Version: 1.0
X-Spam-Status: No, score=-5.269 tagged_above=-99.9 required=10 tests=[AWL=0.841, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_HI=-5, SPF_NEUTRAL=0.779, T_TO_NO_BRKTS_FREEMAIL=0.01] autolearn=ham
In-Reply-To: <rt-3.8.HEAD-2356-1290715764-115.53532-6-0 [...] rt.cpan.org>
X-Spam-Flag: NO
References: <RT-Ticket-53532 [...] rt.cpan.org> <rt-3.8.HEAD-2356-1290715764-115.53532-6-0 [...] rt.cpan.org>
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
Message-ID: <AANLkTikG7tHx5Ork2k0fiint6ygb3psHYfaoQtQuHuY+ [...] mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
X-Spam-Score: -5.269
Authentication-Results: hipster.bestpractical.com (amavisd-new); dkim=pass header.i= [...] gmail.com
Authentication-Results: hipster.bestpractical.com (amavisd-new); domainkeys=pass header.sender=ikesuke [...] gmail.com
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id 20578241217 for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Thu, 25 Nov 2010 16:49:11 -0500 (EST)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id v7qfMDIPY6+H for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Thu, 25 Nov 2010 16:49:09 -0500 (EST)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id 3459024120F for <bug-XML-LibXML [...] rt.cpan.org>; Thu, 25 Nov 2010 16:49:09 -0500 (EST)
Received: (qmail 6708 invoked by uid 103); 25 Nov 2010 21:49:08 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 25 Nov 2010 21:49:08 -0000
Received: from mail-fx0-f50.google.com (HELO mail-fx0-f50.google.com) (209.85.161.50) by 16.mx.develooper.com (qpsmtpd/0.80) with ESMTP; Thu, 25 Nov 2010 13:49:07 -0800
Received: by fxm14 with SMTP id 14so1139431fxm.9 for <bug-XML-LibXML [...] rt.cpan.org>; Thu, 25 Nov 2010 13:49:03 -0800 (PST)
Received: by 10.223.87.3 with SMTP id u3mr1288531fal.131.1290721743595; Thu, 25 Nov 2010 13:49:03 -0800 (PST)
Received: by 10.223.78.206 with HTTP; Thu, 25 Nov 2010 13:49:03 -0800 (PST)
Delivered-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
Subject: Re: [rt.cpan.org #53532] appendTextChild is sensitive to internal format of text
Domainkey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; b=aLpM4EbAlPztL2u6+ADp4t6Bf7LQZGifiKizfujnL6892WiLVJXS7rw1a+Sk+7NZHN 1Qvk1mCphwx4xlaicB66VEjU200zxVsptLH0ynYwFj4Ds+jRCd3VC8JrMJSS1S42xxZB YyYY4M4EO9BZ6/O7z6yHp9fvlu/ZZmNH6fWxM=
Return-Path: <ikesuke [...] gmail.com>
Dkim-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:sender:received :in-reply-to:references:date:x-google-sender-auth:message-id:subject :from:to:content-type:content-transfer-encoding; bh=ZFG8Gtg636Qsbaq9+ZE4rDtlLpDKyqbUkAainlKDPeI=; b=vd19gsMYOb090tplOutSLrAbUr7ux74X8Smfi8D/DzUR80A0p9ik7oq0Z6jTiE9gwU MeysPwHEj+GnA6CYr8kc4YRwMZyUYUk/sVE0rwqZniDxUAGsi8Pp+8yGFyx2hbySxgkb lgT4DXVLSn3C4WT39y7btp0uPXCZJd9s2hix4=
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
X-RT-Mail-Extension: xml-libxml
X-Google-Sender-Auth: HSvPsSNi44Eu4ArtWV7tUCSuiys
Sender: ikesuke [...] gmail.com
Date: Thu, 25 Nov 2010 16:49:03 -0500
X-Spam-Level:
To: bug-XML-LibXML [...] rt.cpan.org
Content-Transfer-Encoding: quoted-printable
From: Eric Brine <ikegami [...] adaelis.com>
RT-Message-ID: <rt-3.8.HEAD-2361-1290721751-1819.53532-0-0 [...] rt.cpan.org>
Content-Length: 637
Download (untitled) / with headers
text/plain 637b
On Thu, Nov 25, 2010 at 3:09 PM, Daniel Frett via RT <bug-XML-LibXML@rt.cpan.org> wrote: Show quoted text
> especially since the documented functionality is considered > "bad practice".
It's bad practice because it makes the following two statements not equivalent even though "é" is Unicode character E9: use utf8; ->createTextNode("abcdéf"); use utf8; ->createTextNode("abcd\x{E9}f"); Workaround for Perl 5.12+: use utf8; ->createTextNode("abcd\N{U+E9}f"); Show quoted text
> But because it is documented behavior, this will break backwards > compatability.
That's unfortunate. Your solution sounds perfectly reasonable, though. Thanks for having a look, Eric
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-12439-1311192717-146.53532-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 67
Resolving as rejected per the discussion. Regards, -- Shlomi Fish
From ikesuke [...] gmail.com Wed Jul 20 17: 17:32 2011
CC: IKEGAMI [...] cpan.org
MIME-Version: 1.0
X-Spam-Status: No, score=-5.044 tagged_above=-99.9 required=10 tests=[AWL=1.065, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_NEUTRAL=0.779, T_TO_NO_BRKTS_FREEMAIL=0.01] autolearn=ham
In-Reply-To: <rt-3.8.HEAD-12439-1311192718-728.53532-6-0 [...] rt.cpan.org>
X-Spam-Flag: NO
References: <RT-Ticket-53532 [...] rt.cpan.org> <rt-3.8.HEAD-12439-1311192718-728.53532-6-0 [...] rt.cpan.org>
X-Virus-Checked: Checked by ClamAV on 16.mx.develooper.com
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
Message-ID: <CALJW-qFPnh3QZwQSv9AoCAwD3Sb14P9Xu2tH1BbAbJkwESAB2w [...] mail.gmail.com>
Content-Type: multipart/alternative; boundary=bcaec55407cc1f6b1704a886c299
X-Spam-Score: -5.044
Authentication-Results: hipster.bestpractical.com (amavisd-new); dkim=pass header.i= [...] gmail.com
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id C6CCC2405A6 for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Wed, 20 Jul 2011 17:17:32 -0400 (EDT)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id VfHzNZU6GdSr for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Wed, 20 Jul 2011 17:17:31 -0400 (EDT)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id 401FF24058D for <bug-XML-LibXML [...] rt.cpan.org>; Wed, 20 Jul 2011 17:17:31 -0400 (EDT)
Received: (qmail 27343 invoked by uid 103); 20 Jul 2011 21:17:30 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 20 Jul 2011 21:17:30 -0000
Received: from mail-fx0-f41.google.com (HELO mail-fx0-f41.google.com) (209.85.161.41) by 16.mx.develooper.com (qpsmtpd/0.80/v0.80-19-gf52d165) with ESMTP; Wed, 20 Jul 2011 14:17:27 -0700
Received: by fxg9 with SMTP id 9so2386430fxg.0 for <bug-XML-LibXML [...] rt.cpan.org>; Wed, 20 Jul 2011 14:17:23 -0700 (PDT)
Received: by 10.205.80.3 with SMTP id zs3mr2464066bkb.273.1311196642765; Wed, 20 Jul 2011 14:17:22 -0700 (PDT)
Received: by 10.204.42.129 with HTTP; Wed, 20 Jul 2011 14:17:22 -0700 (PDT)
Delivered-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
Subject: Re: [rt.cpan.org #53532] appendTextChild is sensitive to internal format of text
Return-Path: <ikesuke [...] gmail.com>
Dkim-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=xAJZt0K21iAqh0ovWjgtHg0kG8/U+dPhBuLfthQaq+Q=; b=dEjg0N9uxZtgSoBbeS+PacWTuREocanlZ8jYJk6pwWpREpQ6BNg5/xNaaUVKLX6RZa cx8t6HBBZN2XZq7Ywjsa8MgP2sk4NApyj0kAi7ZmAm4SZCXM3lNd6lK2MiqkyIKANrsI Ql4p9H2Q6qaBVX4yP/q65kYP09vqjsv6guyp4=
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
X-RT-Mail-Extension: xml-libxml
X-Google-Sender-Auth: dev8BtrTpO4NkQFybGkxz_nYHKg
Sender: ikesuke [...] gmail.com
Date: Wed, 20 Jul 2011 17:17:22 -0400
X-Spam-Level:
To: bug-XML-LibXML [...] rt.cpan.org
From: Eric Brine <ikegami [...] adaelis.com>
RT-Message-ID: <rt-3.8.HEAD-12439-1311196653-820.53532-0-0 [...] rt.cpan.org>
Content-Length: 0
Content-Type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
Content-Length: 361
Download (untitled) / with headers
text/plain 361b
On Wed, Jul 20, 2011 at 4:11 PM, Shlomi Fish via RT < bug-XML-LibXML@rt.cpan.org> wrote: Show quoted text
> <URL: https://rt.cpan.org/Ticket/Display.html?id=53532 > > > Resolving as rejected per the discussion. >
The *ticket* shouldn't be rejected. The only thing that was rejected was *changing createTextNode*. That does not preclude adding a function that actually works.
Content-Type: text/html; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-RT-Original-Encoding: utf-8
Content-Length: 747
From ikesuke [...] gmail.com Wed Jul 20 17: 26:29 2011
CC: IKEGAMI [...] cpan.org
MIME-Version: 1.0
X-Spam-Status: No, score=-5.1 tagged_above=-99.9 required=10 tests=[AWL=1.009, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_NEUTRAL=0.779, T_TO_NO_BRKTS_FREEMAIL=0.01] autolearn=ham
In-Reply-To: <CALJW-qFPnh3QZwQSv9AoCAwD3Sb14P9Xu2tH1BbAbJkwESAB2w [...] mail.gmail.com>
X-Spam-Flag: NO
References: <RT-Ticket-53532 [...] rt.cpan.org> <rt-3.8.HEAD-12439-1311192718-728.53532-6-0 [...] rt.cpan.org> <CALJW-qFPnh3QZwQSv9AoCAwD3Sb14P9Xu2tH1BbAbJkwESAB2w [...] mail.gmail.com>
X-Virus-Checked: Checked by ClamAV on 16.mx.develooper.com
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
Message-ID: <CALJW-qHcCoo0ZL9eTo34-w2Bo0mKaKctr=CPxk+6xmd1CBwGrw [...] mail.gmail.com>
Content-Type: multipart/alternative; boundary=0016e6d99e93e2ce5504a886e17f
X-Spam-Score: -5.1
Authentication-Results: hipster.bestpractical.com (amavisd-new); dkim=pass header.i= [...] gmail.com
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id 43F992405A6 for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Wed, 20 Jul 2011 17:26:29 -0400 (EDT)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hKheqXqXZMTg for <cpan-bug+XML-LibXML [...] hipster.bestpractical.com>; Wed, 20 Jul 2011 17:26:25 -0400 (EDT)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id EC47324058D for <bug-XML-LibXML [...] rt.cpan.org>; Wed, 20 Jul 2011 17:26:24 -0400 (EDT)
Received: (qmail 28172 invoked by uid 103); 20 Jul 2011 21:26:24 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 20 Jul 2011 21:26:24 -0000
Received: from mail-ew0-f50.google.com (HELO mail-ew0-f50.google.com) (209.85.215.50) by 16.mx.develooper.com (qpsmtpd/0.80/v0.80-19-gf52d165) with ESMTP; Wed, 20 Jul 2011 14:26:19 -0700
Received: by ewy10 with SMTP id 10so821881ewy.9 for <bug-XML-LibXML [...] rt.cpan.org>; Wed, 20 Jul 2011 14:26:16 -0700 (PDT)
Received: by 10.204.83.217 with SMTP id g25mr1755687bkl.221.1311197175663; Wed, 20 Jul 2011 14:26:15 -0700 (PDT)
Received: by 10.204.42.129 with HTTP; Wed, 20 Jul 2011 14:26:15 -0700 (PDT)
Delivered-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
Subject: Re: [rt.cpan.org #53532] appendTextChild is sensitive to internal format of text
Return-Path: <ikesuke [...] gmail.com>
Dkim-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=D/sQYWJy/RrhdHFndu/mas/cOT3O/5lv6l8GjyjOpiw=; b=A+jWyX9EkiZ0oOYh02KS7kylTcBhXGRkcSKB9m1Jp3Cdq4hYGU4wDwdRVXAlhefey/ 7NyjF8llddhJFJtghejs3Pwu2cbzrsQtQtAIzksvFawLFZ2zn6KlOC9kF8Jp0YRXWbJW pwFO09dIUnnC1KtabxYSPHaqThP3Qw6x4OqjU=
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: cpan-bug+XML-LibXML [...] hipster.bestpractical.com
X-RT-Mail-Extension: xml-libxml
X-Google-Sender-Auth: cN6FLOzE-FlYSlizejFWQNPJPYQ
Sender: ikesuke [...] gmail.com
Date: Wed, 20 Jul 2011 17:26:15 -0400
X-Spam-Level:
To: bug-XML-LibXML [...] rt.cpan.org
From: Eric Brine <ikegami [...] adaelis.com>
RT-Message-ID: <rt-3.8.HEAD-12441-1311197189-1644.53532-0-0 [...] rt.cpan.org>
Content-Length: 0
Content-Type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
Content-Length: 732
Download (untitled) / with headers
text/plain 732b
On Wed, Jul 20, 2011 at 5:17 PM, Eric Brine <ikegami@adaelis.com> wrote: Show quoted text
> On Wed, Jul 20, 2011 at 4:11 PM, Shlomi Fish via RT < > bug-XML-LibXML@rt.cpan.org> wrote: >
>> <URL: https://rt.cpan.org/Ticket/Display.html?id=53532 > >> >> Resolving as rejected per the discussion. >>
> > The *ticket* shouldn't be rejected. The only thing that was rejected was > *changing createTextNode*. That does not preclude adding a function that > actually works. >
Possible ways forward: - Adding a note in the documentation that utf8::upgrade must be called on text passed to createTextNode. - Adding new function that always treats the argument as text. - Adding a config option that makes createTextNode always treat the argument as text.
Content-Type: text/html; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-RT-Original-Encoding: utf-8
Content-Length: 1409


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.