Skip Menu |
 

This queue is for tickets about the Encode CPAN distribution.

Report information
The Basics
Id: 41167
Status: open
Priority: 0/
Queue: Encode

People
Owner: Nobody in particular
Requestors: MARKOV [...] cpan.org
Cc:
AdminCc:

Bug Information
Severity: Normal
Broken in: 2.26
Fixed in: (no value)



MIME-Version: 1.0
X-Mailer: MIME-tools 5.426 (Entity 5.426)
Charset: utf8
X-RT-Original-Encoding: utf-8
Content-Type: multipart/mixed; boundary="----------=_1227476524-3816-12"
Content-Length: 0
Content-Type: text/plain
Content-Disposition: inline
Content-Transfer-Encoding: binary
Content-Length: 601
Download (untitled) / with headers
text/plain 601b
IANA defines a long list of character-sets which can be used for any internet communication, and to my surprise many of this list are not understood by Encode::Alias. My MailBox email processing automatically translates incoming message bodies into "Perl's internal representation", and this sometimes fails. Not too often, but sometimes. The IANA list can be found at http://www.iana.org/assignments/character-sets The attached script shows the missing aliases. Maybe you can add at least the simpelest definitions. Especially UNICODE-1-1-UTF-7, which simply is UTF7 and used in some email RFCs.
Subject: check-iana-charsets
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----------=_1227476422-3816-10"
X-Mailer: MIME-tools 5.426 (Entity 5.426)
Charset: utf8
Content-Length: 0
Content-Type: text/plain
Content-Disposition: inline
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: iso-8859-1
Content-Length: 0
Content-Type: application/octet-stream; name="check-iana-charsets"
Content-Disposition: inline; filename="check-iana-charsets"
Content-Transfer-Encoding: base64
Content-Length: 0
Subject: check-iana-output
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----------=_1227476524-3816-11"
X-Mailer: MIME-tools 5.426 (Entity 5.426)
Charset: utf8
Content-Length: 0
Content-Type: text/plain
Content-Disposition: inline
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: iso-8859-1
Content-Length: 0
Content-Type: application/octet-stream; name="check-iana-output"
Content-Disposition: inline; filename="check-iana-output"
Content-Transfer-Encoding: base64
Content-Length: 0
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
Charset: utf8
Content-Type: text/plain
Message-ID: <rt-3.6.HEAD-29719-1232576703-1275.41167-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 774
Download (untitled) / with headers
text/plain 774b
Seems like the attachment is missing. Would you send it to me again? Dan the Maintainer Thereof On Sun Nov 23 16:42:10 2008, MARKOV wrote: Show quoted text
> IANA defines a long list of character-sets which can be used for any > internet communication, and to my surprise many of this list are not > understood by Encode::Alias. > > My MailBox email processing automatically translates incoming message > bodies into "Perl's internal representation", and this sometimes fails. > Not too often, but sometimes. > > The IANA list can be found at > http://www.iana.org/assignments/character-sets > > The attached script shows the missing aliases. Maybe you can add at > least the simpelest definitions. Especially UNICODE-1-1-UTF-7, which > simply is UTF7 and used in some email RFCs.
MIME-Version: 1.0
X-Spam-Status: No, hits=0.0 required=8.0 tests=
In-Reply-To: <rt-3.6.HEAD-29719-1232576703-1275.41167-6-0 [...] rt.cpan.org>
Content-Disposition: inline
References: <RT-Ticket-41167 [...] rt.cpan.org> <rt-3.6.HEAD-29719-1232576703-1275.41167-6-0 [...] rt.cpan.org>
X-Virus-Checked: Checked by ClamAV on 16.mx.develooper.com
Message-ID: <20090121222948.GA30636 [...] earth.overmeer.net>
Content-Type: multipart/mixed; boundary="Qxx1br4bt0+wmkIi"
Received: from la.mx.develooper.com (x1.develooper.com [63.251.223.170]) by diesel.bestpractical.com (Postfix) with SMTP id 006074D8040 for <bug-Encode [...] rt.cpan.org>; Wed, 21 Jan 2009 17:30:09 -0500 (EST)
Received: (qmail 19495 invoked by uid 103); 21 Jan 2009 22:30:09 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 21 Jan 2009 22:30:09 -0000
Received: from mail.overmeer.net (HELO earth.overmeer.net) (194.109.195.227) by 16.mx.develooper.com (qpsmtpd/0.43rc1) with ESMTP; Wed, 21 Jan 2009 14:29:55 -0800
Received: by earth.overmeer.net (Postfix, from userid 500) id 39C609AA26; Wed, 21 Jan 2009 23:29:48 +0100 (CET)
Delivered-To: cpan-bug+Encode [...] diesel.bestpractical.com
Subject: Re: [rt.cpan.org #41167]
User-Agent: Mutt/1.5.9i
Return-Path: <markov [...] overmeer.net>
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: bug-Encode [...] rt.cpan.org
Date: Wed, 21 Jan 2009 23:29:48 +0100
X-Spam-Level: *
To: DANKOGAI via RT <bug-Encode [...] rt.cpan.org>
From: Mark Overmeer <mark [...] overmeer.net>
RT-Message-ID: <rt-3.6.HEAD-29719-1232577030-1953.41167-0-0 [...] rt.cpan.org>
Content-Length: 0
Content-Type: text/plain; charset="utf-8"
Content-Disposition: inline
X-RT-Original-Encoding: utf-8
Content-Length: 1293
Download (untitled) / with headers
text/plain 1.2k
* DANKOGAI via RT (bug-Encode@rt.cpan.org) [090121 22:25]: Show quoted text
> <URL: https://rt.cpan.org/Ticket/Display.html?id=41167 > > Seems like the attachment is missing. Would you send it to me again?
Here it is. Hope it works this time. Show quoted text
> Dan the Maintainer Thereof > > On Sun Nov 23 16:42:10 2008, MARKOV wrote:
> > IANA defines a long list of character-sets which can be used for any > > internet communication, and to my surprise many of this list are not > > understood by Encode::Alias. > > > > My MailBox email processing automatically translates incoming message > > bodies into "Perl's internal representation", and this sometimes fails. > > Not too often, but sometimes. > > > > The IANA list can be found at > > http://www.iana.org/assignments/character-sets > > > > The attached script shows the missing aliases. Maybe you can add at > > least the simpelest definitions. Especially UNICODE-1-1-UTF-7, which > > simply is UTF7 and used in some email RFCs.
-- Regards, MarkOv ------------------------------------------------------------------------ Mark Overmeer MSc MARKOV Solutions Mark@Overmeer.net solutions@overmeer.net http://Mark.Overmeer.net http://solutions.overmeer.net
Content-Type: text/plain; charset="utf-8"
content-disposition: attachment; filename="check-iana-charsets"
X-RT-Original-Encoding: utf-8
Content-Length: 493

Message body is not shown because sender requested not to inline it.

Content-Type: text/plain; charset="utf-8"
content-disposition: attachment; filename="check-iana-output"
X-RT-Original-Encoding: utf-8
Content-Length: 56308
Download check-iana-output
text/plain 54.9k

Message body is not shown because sender requested not to inline it.

MIME-Version: 1.0
X-Spam-Status: No, hits=0.0 required=8.0 tests=
In-Reply-To: <rt-3.6.HEAD-25318-1247449390-1322.41167-10-0 [...] rt.cpan.org>
Content-Disposition: inline
References: <rt-3.6.HEAD-25318-1247449390-1322.41167-10-0 [...] rt.cpan.org>
Message-ID: <20090713071842.GN10519 [...] moon.overmeer.net>
Content-Type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by diesel.bestpractical.com (Postfix) with SMTP id CEA6B4D8016 for <bug-Encode [...] rt.cpan.org>; Mon, 13 Jul 2009 03:18:53 -0400 (EDT)
Received: (qmail 30228 invoked by uid 103); 13 Jul 2009 07:18:52 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 13 Jul 2009 07:18:52 -0000
Received: from mail.overmeer.net (HELO moon.overmeer.net) (194.109.195.227) by 16.mx.develooper.com (qpsmtpd/0.80) with ESMTP; Mon, 13 Jul 2009 00:18:47 -0700
Received: by moon.overmeer.net (Postfix, from userid 1000) id 96221C1B8; Mon, 13 Jul 2009 09:18:42 +0200 (CEST)
Delivered-To: cpan-bug+Encode [...] diesel.bestpractical.com
Subject: Re: [rt.cpan.org #41167] Resolved:
User-Agent: Mutt/1.5.19 (2009-01-05)
Return-Path: <markov [...] overmeer.net>
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: bug-Encode [...] rt.cpan.org
Date: Mon, 13 Jul 2009 09:18:42 +0200
X-Spam-Level: *
To: DANKOGAI via RT <bug-Encode [...] rt.cpan.org>
From: Mark Overmeer <website [...] craneveer.nl>
RT-Message-ID: <rt-3.6.HEAD-25318-1247469542-1937.41167-0-0 [...] rt.cpan.org>
Content-Length: 774
Download (untitled) / with headers
text/plain 774b
* DANKOGAI via RT (bug-Encode@rt.cpan.org) [090713 01:43]: Show quoted text
> <URL: https://rt.cpan.org/Ticket/Display.html?id=41167 > > According to our records, your request has been resolved. If you have any > further questions or concerns, please respond to this message.
My script still finds 702 character-sets (mostly special abbreviations) which are defined by IANA but not understood in the Encode release 2.35. Most of them are simple to add aliases. -- Regards, MarkOv ------------------------------------------------------------------------ Mark Overmeer MSc MARKOV Solutions Mark@Overmeer.net solutions@overmeer.net http://Mark.Overmeer.net http://solutions.overmeer.net


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.