Skip Menu |
 

This queue is for tickets about the Encode CPAN distribution.

Report information
The Basics
Id: 59668
Status: resolved
Priority: 0/
Queue: Encode

People
Owner: Nobody in particular
Requestors: tom [...] trh.demon.nl
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



From tom [...] trh.demon.nl Fri Jul 23 20: 46:29 2010
MIME-Version: 1.0 (Apple Message framework v753.1)
X-Spam-Status: No, score=-10.599 tagged_above=-99.9 required=10 tests=[BAYES_00=-2.599, RCVD_IN_DNSWL_HI=-8] autolearn=ham
X-Mailer: Apple Mail (2.753.1)
X-Spam-Flag: NO
content-type: text/plain; charset="utf-8"; delsp="yes"; format="flowed"
Message-ID: <1E9E5F4E-3F8F-4665-9299-644B6963F9E7 [...] trh.demon.nl>
X-Virus-Scanned: Debian amavisd-new at bestpractical.com
X-Virus-Scanned: by XS4ALL Virus Scanner
X-Spam-Score: -10.599
Received: from localhost (localhost [127.0.0.1]) by hipster.bestpractical.com (Postfix) with ESMTP id 0B54E240AEB for <cpan-bug+Encode [...] hipster.bestpractical.com>; Fri, 23 Jul 2010 20:46:29 -0400 (EDT)
Received: from hipster.bestpractical.com ([127.0.0.1]) by localhost (hipster.bestpractical.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DyW3olNjVcAg for <cpan-bug+Encode [...] hipster.bestpractical.com>; Fri, 23 Jul 2010 20:46:26 -0400 (EDT)
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by hipster.bestpractical.com (Postfix) with SMTP id 70241240AE4 for <bug-Encode [...] rt.cpan.org>; Fri, 23 Jul 2010 20:46:26 -0400 (EDT)
Received: (qmail 27701 invoked by uid 103); 24 Jul 2010 00:48:17 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 24 Jul 2010 00:48:17 -0000
Received: from smtp-vbr6.xs4all.nl (HELO smtp-vbr6.xs4all.nl) (194.109.24.26) by 16.mx.develooper.com (qpsmtpd/0.80) with ESMTP; Fri, 23 Jul 2010 17:48:15 -0700
Received: from [192.168.2.243] (trh.demon.nl [83.160.194.18]) by smtp-vbr6.xs4all.nl (8.13.8/8.13.8) with ESMTP id o6O0mBQR071525 for <bug-Encode [...] rt.cpan.org>; Sat, 24 Jul 2010 02:48:11 +0200 (CEST) (envelope-from tom [...] trh.demon.nl)
Delivered-To: cpan-bug+Encode [...] hipster.bestpractical.com
Subject: nextstep encoding is broken - missing ASCII characters
Return-Path: <tom [...] trh.demon.nl>
X-RT-Mail-Extension: encode
X-Original-To: cpan-bug+Encode [...] hipster.bestpractical.com
X-Spam-Check-BY: 16.mx.develooper.com
Date: Sat, 24 Jul 2010 02:47:46 +0200
X-Spam-Level:
To: bug-Encode [...] rt.cpan.org
Content-Transfer-Encoding: 7bit
From: Tom Hageman <tom [...] trh.demon.nl>
X-RT-Original-Encoding: US-ASCII
Content-Length: 2944
Download (untitled) / with headers
text/plain 2.8k
Hi Dan, When I tried to use the nextstep encoding (in order to convert some old NeXT-era .rtf files to Mac OS X) it failed with a lot of error messages like: $./recode.pl ~/@basil/Mailboxes/Dev/EnhanceMail.mbox/NEXTTOYOU- Enhancemen.attach/index.rtf test-recode-NEXTTOYOU-Enhancemen.rtf nextstep "\x7B" does not map to Unicode at ./recode.pl line 8. nextstep "\x5C" does not map to Unicode at ./recode.pl line 8. nextstep "\x72" does not map to Unicode at ./recode.pl line 8. nextstep "\x74" does not map to Unicode at ./recode.pl line 8. nextstep "\x66" does not map to Unicode at ./recode.pl line 8. nextstep "\x30" does not map to Unicode at ./recode.pl line 8. [...] This suggests that the nextstep encoding does not define a mapping for the regular ASCII characters (range \x20 - \x7f). A quick look at http://cpansearch.perl.org/src/DANKOGAI/Encode-2.39/ ucm/nextstep.ucm ($Id: nextstep.ucm,v 2.0 2004/05/16 20:55:28 dankogai Exp $) seems to confirm that hunch. Tested with Perl 5.8.6 (shipped with Mac OS X 10.4.11 PPC) Encode- Show quoted text
>VERSION 2.08, Encode::Byte->VERSION 2.00, but apparently still an
issue in the most recent version, see above. test script (straightforward adaptation from example in the POD): === recode.pl === #! /usr/bin/perl # via PerlIO my $infile = $ARGV[0]; my $outfile = $ARGV[1]; open my $in, "<:encoding(nextstep)", $infile or die; open my $out, ">:encoding(MacRoman)", $outfile or die; while(<$in>) { print $out $_; } === === input === {\rtf0\ansi{\fonttbl\f0\fswiss Helvetica;} \margl120 \margr120 {\colortbl;\red204\green0\blue17;\red0\green0\blue0;\red88\green88 \blue88;\red85\green19\blue134;} \pard\tx533\tx1067\tx1601\tx2135\tx2668\tx3202\tx3736\tx4270\tx4803 \tx5337\f0\b0\i0\ulnone\fs24\fc0\cf0 Hi Tom,\ [...] === === output === \x7B\x5C\x72\x74\x66\x30\x5C\x61\x6E\x73\x69\x7B\x5C\x66\x6F\x6E\x74 \x74\x62\x6C\x5C\x66\x30\x5C\x66\x73\x77\x69\x73\x73\x20\x48\x65\x6C \x76\x65\x74\x69\x63\x61\x3B\x7D \x5C\x6D\x61\x72\x67\x6C\x31\x32\x30 \x5C\x6D\x61\x72\x67\x72\x31\x32\x30 \x7B\x5C\x63\x6F\x6C\x6F\x72\x74\x62\x6C\x3B\x5C\x72\x65\x64\x32\x30 \x34\x5C\x67\x72\x65\x65\x6E\x30\x5C\x62\x6C\x75\x65\x31\x37\x3B\x5C \x72\x65\x64\x30\x5C\x67\x72\x65\x65\x6E\x30\x5C\x62\x6C\x75\x65\x30 \x3B\x5C\x72\x65\x64\x38\x38\x5C\x67\x72\x65\x65\x6E\x38\x38\x5C\x62 \x6C\x75\x65\x38\x38\x3B\x5C\x72\x65\x64\x38\x35\x5C\x67\x72\x65\x65 \x6E\x31\x39\x5C\x62\x6C\x75\x65\x31\x33\x34\x3B\x7D \x5C\x70\x61\x72\x64\x5C\x74\x78\x35\x33\x33\x5C\x74\x78\x31\x30\x36 \x37\x5C\x74\x78\x31\x36\x30\x31\x5C\x74\x78\x32\x31\x33\x35\x5C\x74 \x78\x32\x36\x36\x38\x5C\x74\x78\x33\x32\x30\x32\x5C\x74\x78\x33\x37 \x33\x36\x5C\x74\x78\x34\x32\x37\x30\x5C\x74\x78\x34\x38\x30\x33\x5C \x74\x78\x35\x33\x33\x37\x5C\x66\x30\x5C\x62\x30\x5C\x69\x30\x5C\x75 \x6C\x6E\x6F\x6E\x65\x5C\x66\x73\x32\x34\x5C\x66\x63\x30\x5C\x63\x66 \x30\x20\x48\x69\x20\x54\x6F\x6D\x2C\x5C [...] === Best regards, Tom Hageman.
MIME-Version: 1.0
In-Reply-To: <1E9E5F4E-3F8F-4665-9299-644B6963F9E7 [...] trh.demon.nl>
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
References: <1E9E5F4E-3F8F-4665-9299-644B6963F9E7 [...] trh.demon.nl>
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-24889-1284833038-450.59668-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 3229
Download (untitled) / with headers
text/plain 3.1k
Bingo. Fixed in my repository. 2.40 coming soon. Dan the Maintainer Thereof On Fri Jul 23 20:48:21 2010, tom@trh.demon.nl wrote: Show quoted text
> Hi Dan, > > When I tried to use the nextstep encoding (in order to convert some > old NeXT-era .rtf files to Mac OS X) it failed with a lot of error > messages like: > > $./recode.pl ~/@basil/Mailboxes/Dev/EnhanceMail.mbox/NEXTTOYOU- > Enhancemen.attach/index.rtf test-recode-NEXTTOYOU-Enhancemen.rtf > nextstep "\x7B" does not map to Unicode at ./recode.pl line 8. > nextstep "\x5C" does not map to Unicode at ./recode.pl line 8. > nextstep "\x72" does not map to Unicode at ./recode.pl line 8. > nextstep "\x74" does not map to Unicode at ./recode.pl line 8. > nextstep "\x66" does not map to Unicode at ./recode.pl line 8. > nextstep "\x30" does not map to Unicode at ./recode.pl line 8. > [...] > > This suggests that the nextstep encoding does not define a mapping > for the regular ASCII characters (range \x20 - \x7f). > > A quick look at http://cpansearch.perl.org/src/DANKOGAI/Encode-2.39/ > ucm/nextstep.ucm ($Id: nextstep.ucm,v 2.0 2004/05/16 20:55:28 > dankogai Exp $) seems to confirm that hunch. > > Tested with Perl 5.8.6 (shipped with Mac OS X 10.4.11 PPC) Encode-
> >VERSION 2.08, Encode::Byte->VERSION 2.00, but apparently still an
> issue in the most recent version, see above. > > test script (straightforward adaptation from example in the POD): > === recode.pl === > #! /usr/bin/perl > # via PerlIO > > my $infile = $ARGV[0]; > my $outfile = $ARGV[1]; > open my $in, "<:encoding(nextstep)", $infile or die; > open my $out, ">:encoding(MacRoman)", $outfile or die; > while(<$in>) { print $out $_; } > === > > === input === > {\rtf0\ansi{\fonttbl\f0\fswiss Helvetica;} > \margl120 > \margr120 > {\colortbl;\red204\green0\blue17;\red0\green0\blue0;\red88\green88 > \blue88;\red85\green19\blue134;} > \pard\tx533\tx1067\tx1601\tx2135\tx2668\tx3202\tx3736\tx4270\tx4803 > \tx5337\f0\b0\i0\ulnone\fs24\fc0\cf0 Hi Tom,\ > [...] > === > > === output === > \x7B\x5C\x72\x74\x66\x30\x5C\x61\x6E\x73\x69\x7B\x5C\x66\x6F\x6E\x74 > \x74\x62\x6C\x5C\x66\x30\x5C\x66\x73\x77\x69\x73\x73\x20\x48\x65\x6C > \x76\x65\x74\x69\x63\x61\x3B\x7D > \x5C\x6D\x61\x72\x67\x6C\x31\x32\x30 > \x5C\x6D\x61\x72\x67\x72\x31\x32\x30 > \x7B\x5C\x63\x6F\x6C\x6F\x72\x74\x62\x6C\x3B\x5C\x72\x65\x64\x32\x30 > \x34\x5C\x67\x72\x65\x65\x6E\x30\x5C\x62\x6C\x75\x65\x31\x37\x3B\x5C > \x72\x65\x64\x30\x5C\x67\x72\x65\x65\x6E\x30\x5C\x62\x6C\x75\x65\x30 > \x3B\x5C\x72\x65\x64\x38\x38\x5C\x67\x72\x65\x65\x6E\x38\x38\x5C\x62 > \x6C\x75\x65\x38\x38\x3B\x5C\x72\x65\x64\x38\x35\x5C\x67\x72\x65\x65 > \x6E\x31\x39\x5C\x62\x6C\x75\x65\x31\x33\x34\x3B\x7D > \x5C\x70\x61\x72\x64\x5C\x74\x78\x35\x33\x33\x5C\x74\x78\x31\x30\x36 > \x37\x5C\x74\x78\x31\x36\x30\x31\x5C\x74\x78\x32\x31\x33\x35\x5C\x74 > \x78\x32\x36\x36\x38\x5C\x74\x78\x33\x32\x30\x32\x5C\x74\x78\x33\x37 > \x33\x36\x5C\x74\x78\x34\x32\x37\x30\x5C\x74\x78\x34\x38\x30\x33\x5C > \x74\x78\x35\x33\x33\x37\x5C\x66\x30\x5C\x62\x30\x5C\x69\x30\x5C\x75 > \x6C\x6E\x6F\x6E\x65\x5C\x66\x73\x32\x34\x5C\x66\x63\x30\x5C\x63\x66 > \x30\x20\x48\x69\x20\x54\x6F\x6D\x2C\x5C > [...] > === > > Best regards, > Tom Hageman. >


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.