Skip Menu |
 
rt.cpan.org will be shut down on March 1st, 2021.

This queue is for tickets about the Data-Visitor CPAN distribution.

Report information
The Basics
Id: 103101
Status: resolved
Priority: 0/
Queue: Data-Visitor

People
Owner: DOY [...] cpan.org
Requestors: zefram [...] fysh.org
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: 0.31



MIME-Version: 1.0
X-Spam-Status: No, score=-2.283 tagged_above=-99.9 required=10 tests=[AWL=-0.372, BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=ham
Content-Disposition: inline
X-Spam-Flag: NO
content-type: text/plain; charset="utf-8"
Message-ID: <20150325113915.GE11492 [...] fysh.org>
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
X-Spam-Score: -2.283
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id 593832403E5 for <cpan-bug+Data-Visitor [...] hipster.bestpractical.com>; Wed, 25 Mar 2015 07:39:26 -0400 (EDT)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bkAEyMvlhu5n for <cpan-bug+Data-Visitor [...] hipster.bestpractical.com>; Wed, 25 Mar 2015 07:39:25 -0400 (EDT)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id 05D922400AC for <bug-Data-Visitor [...] rt.cpan.org>; Wed, 25 Mar 2015 07:39:24 -0400 (EDT)
Received: (qmail 20084 invoked by alias); 25 Mar 2015 11:39:24 -0000
Received: from river.fysh.org (HELO river.fysh.org) (5.135.154.127) by la.mx.develooper.com (qpsmtpd/0.28) with ESMTP; Wed, 25 Mar 2015 04:39:22 -0700
Received: from zefram by river.fysh.org with local (Exim 4.80 #2 (Debian)) id 1YajeV-0007aO-Fm; Wed, 25 Mar 2015 11:39:15 +0000
Delivered-To: cpan-bug+Data-Visitor [...] hipster.bestpractical.com
Subject: encoding error at configure time
Return-Path: <zefram [...] fysh.org>
X-RT-Mail-Extension: data-visitor
X-Original-To: cpan-bug+Data-Visitor [...] hipster.bestpractical.com
X-Spam-Check-BY: la.mx.develooper.com
Date: Wed, 25 Mar 2015 11:39:15 +0000
X-Spam-Level:
To: bug-Data-Visitor [...] rt.cpan.org
From: Zefram <zefram [...] fysh.org>
X-RT-Original-Encoding: ascii
X-RT-Interface: Email
Content-Length: 2301
Download (untitled) / with headers
text/plain 2.2k
$ /opt/perl-5.20.2/bin/perl Makefile.PL Checking if your kit is complete... Looks good Generating a Unix-style Makefile Writing Makefile for Data::Visitor "\x{00c3}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00bc}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00c3}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00bc}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00c3}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00bc}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00c3}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. "\x{00bc}" does not map to ascii at /opt/perl-5.20.2/lib/site_perl/5.20.2/ExtUtils/MakeMaker.pm line 1183. Writing MYMETA.yml and MYMETA.json The problematic characters here arise from the author name (Marcel's surname) specified in Makefile.PL. There are two problems here. Firstly, EU:MM is trying to write this non-ASCII text to files for which it declares :encoding(locale), so there is no guarantee that non-ASCII text is encodable. I don't know who to blame between EU:MM (using an encoding that's inadequate for the job) and Data-Visitor's Makefile.PL (including non-portable characters in the data that's to be stored by a low-powered mechanism). Secondly, the string that Makefile.PL passes to EU:MM doesn't correctly represent Marcel's name, it's *the UTF-8 encoding of* Marcel's name. The Makefile.PL source specifies the UTF-8 octets in the form of octal escapes. The result is that EU:MM attempts to encode text that's already encoded. On my system, apparently after giving the "does not map to ascii" warning it falls back to using UTF-8 encoding, and so the MYMETA files end up with the name double-UTF8-encoded (octets 0xc3 0x83 0xc2 0xbc in the place of the single non-ASCII character of the real name). I'm guessing that whoever packages this distro has a Latin-1 locale, and so ends up with no warnings and a single layer of UTF-8 encoding in the MYMETA files. -zefram
MIME-Version: 1.0
In-Reply-To: <20150325113915.GE11492 [...] fysh.org>
X-Mailer: MIME-tools 5.504 (Entity 5.504)
Content-Disposition: inline
X-RT-Interface: Web
References: <20150325113915.GE11492 [...] fysh.org>
Content-Type: text/plain; charset="utf-8"
Message-ID: <rt-4.0.18-17693-1427302388-662.103101-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
X-RT-Encrypt: 0
X-RT-Sign: 0
Content-Length: 370
Download (untitled) / with headers
text/plain 370b
On 2015-03-25 04:39:27, zefram@fysh.org wrote: Show quoted text
> The problematic characters here arise from the author name (Marcel's > surname) specified in Makefile.PL. There are two problems here.
Vanilla (and modern) EUMM does support wide characters properly, so given that Module::Install will likely never be fully fixed, it would be sensible to move off of it. I shall do so.
MIME-Version: 1.0
X-Spam-Status: No, score=-4.266 tagged_above=-99.9 required=10 tests=[AWL=1.645, BAYES_00=-1.9, FROM_OUR_RT=-4, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=ham
In-Reply-To: <rt-4.0.18-17693-1427302389-1736.103101-6-0 [...] rt.cpan.org>
Content-Disposition: inline
X-Spam-Flag: NO
X-RT-Interface: API
References: <RT-Ticket-103101 [...] rt.cpan.org> <20150325113915.GE11492 [...] fysh.org> <rt-4.0.18-17693-1427302389-1736.103101-6-0 [...] rt.cpan.org>
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
Message-ID: <20150325171950.GA6924 [...] fysh.org>
content-type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
X-Spam-Score: -4.266
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id 50C35240346 for <cpan-bug+Data-Visitor [...] hipster.bestpractical.com>; Wed, 25 Mar 2015 13:20:01 -0400 (EDT)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3TFIhMmbxp9v for <cpan-bug+Data-Visitor [...] hipster.bestpractical.com>; Wed, 25 Mar 2015 13:20:00 -0400 (EDT)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id 220C32401F3 for <bug-Data-Visitor [...] rt.cpan.org>; Wed, 25 Mar 2015 13:19:59 -0400 (EDT)
Received: (qmail 16619 invoked by alias); 25 Mar 2015 17:19:59 -0000
Received: from river.fysh.org (HELO river.fysh.org) (5.135.154.127) by la.mx.develooper.com (qpsmtpd/0.28) with ESMTP; Wed, 25 Mar 2015 10:19:57 -0700
Received: from zefram by river.fysh.org with local (Exim 4.80 #2 (Debian)) id 1Yaoy6-0001wC-OB; Wed, 25 Mar 2015 17:19:50 +0000
Delivered-To: cpan-bug+Data-Visitor [...] hipster.bestpractical.com
Subject: Re: [rt.cpan.org #103101] encoding error at configure time
Return-Path: <zefram [...] fysh.org>
X-Spam-Check-BY: la.mx.develooper.com
X-Original-To: cpan-bug+Data-Visitor [...] hipster.bestpractical.com
X-RT-Mail-Extension: data-visitor
Date: Wed, 25 Mar 2015 17:19:50 +0000
X-Spam-Level:
To: Karen Etheridge via RT <bug-Data-Visitor [...] rt.cpan.org>
From: Zefram <zefram [...] fysh.org>
RT-Message-ID: <rt-4.0.18-17693-1427304002-1145.103101-0-0 [...] rt.cpan.org>
Content-Length: 480
Download (untitled) / with headers
text/plain 480b
Karen Etheridge via RT wrote: Show quoted text
>Vanilla (and modern) EUMM does support wide characters properly, so >given that Module::Install will likely never be fully fixed, it would >be sensible to move off of it. I shall do so.
I don't see where Module::Install gets involved in Data-Visitor. That looks like it's using EU:MM directly. Did you get confused with [rt.cpan.org #103112], which is about the same problem showing up via Module::Install in Module-Install's test suite? -zefram
MIME-Version: 1.0
In-Reply-To: <rt-4.0.18-17693-1427304002-1145.103101-0-0 [...] rt.cpan.org>
X-Mailer: MIME-tools 5.504 (Entity 5.504)
Content-Disposition: inline
X-RT-Interface: Web
References: <RT-Ticket-103101 [...] rt.cpan.org> <20150325113915.GE11492 [...] fysh.org> <rt-4.0.18-17693-1427302389-1736.103101-6-0 [...] rt.cpan.org> <20150325171950.GA6924 [...] fysh.org> <rt-4.0.18-17693-1427304002-1145.103101-0-0 [...] rt.cpan.org>
Content-Type: text/plain; charset="utf-8"
Message-ID: <rt-4.0.18-14032-1427316145-288.103101-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
X-RT-Encrypt: 0
X-RT-Sign: 0
Content-Length: 593
Download (untitled) / with headers
text/plain 593b
On 2015-03-25 10:20:02, zefram@fysh.org wrote: Show quoted text
> Karen Etheridge via RT wrote:
> >Vanilla (and modern) EUMM does support wide characters properly, so > >given that Module::Install will likely never be fully fixed, it would > >be sensible to move off of it. I shall do so.
> > I don't see where Module::Install gets involved in Data-Visitor. > That looks like it's using EU:MM directly. Did you get confused with > [rt.cpan.org #103112], which is about the same problem showing up via > Module::Install in Module-Install's test suite?
Looks like it! Sorry, too many tabs open at one time :D
MIME-Version: 1.0
In-Reply-To: <20150325113915.GE11492 [...] fysh.org>
X-Mailer: MIME-tools 5.504 (Entity 5.504)
Content-Disposition: inline
X-RT-Interface: Web
References: <20150325113915.GE11492 [...] fysh.org>
Content-Type: text/plain; charset="utf-8"
Message-ID: <rt-4.0.18-14796-1596482964-1612.103101-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
X-RT-Encrypt: 0
X-RT-Sign: 0
Content-Length: 45
This is now solved, with a package retooling.


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.