Skip Menu |
 

This queue is for tickets about the Encode CPAN distribution.

Report information
The Basics
Id: 28780
Status: resolved
Priority: 0/
Queue: Encode

People
Owner: DANKOGAI [...] cpan.org
Requestors: aletellier [...] epsiloninteractive.com
Cc:
AdminCc:

Bug Information
Severity: Critical
Broken in: 2.23
Fixed in: (no value)



Subject: issue converting EUC-KR to ISO-2022-KR
MIME-Version: 1.0
X-Mailer: MIME-tools 5.418 (Entity 5.418)
X-RT-Original-Encoding: utf-8
Content-Type: multipart/mixed; boundary="----------=_1186728731-28001-1"
Content-Length: 0
Content-Type: text/plain; charset="utf8"
Content-Disposition: inline
Content-Transfer-Encoding: binary
Content-Length: 272
Download (untitled) / with headers
text/plain 272b
Converting from EUC-KR to ISO-2022-KR always adds $)C at the beginning of the line, . $)C24788876 992268 aldo letellier >H3gGO=J can reproduce using piconv too piconv -f EUC-KR -t ISO-2022-KR korean_euc_kr.txt > korean_iso_2022_kr.piconv.txt Attached files Aldo
Subject: korean_euc_kr.txt
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----------=_1186728544-27973-11"
X-Mailer: MIME-tools 5.418 (Entity 5.418)
Content-Length: 0
Content-Type: text/plain; charset="utf8"
Content-Disposition: inline
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 0
Content-Type: text/plain; charset="utf-8"; name="korean_euc_kr.txt"
Content-Disposition: inline; filename="korean_euc_kr.txt"
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: iso-8859-1
Content-Length: 328
Download korean_euc_kr.txt
text/plain 328b
PROFILE_ID PLIST_ID PROFILE_KEY EMAIL_ADDR KOREAN_TEXT 24788877 992268 schoi@office.bigfootinteractive.com schoi@office.bigfootinteractive.com ¾È³çÇÏ½Ê 24788878 992268 1 sshama@bigfootinteractive.com 24788876 992268 aletellier@office.bigfootinteractive.com aletellier@office.bigfootinteractive.com ¾È³çÇϽÊ
Subject: korean_iso_2022_kr.piconv.txt
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----------=_1186728730-28001-0"
X-Mailer: MIME-tools 5.418 (Entity 5.418)
Content-Length: 0
Content-Type: text/plain; charset="utf8"
Content-Disposition: inline
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 0
Content-Type: text/plain; charset="utf-8"; name="korean_iso_2022_kr.piconv.txt"
Content-Disposition: inline; filename="korean_iso_2022_kr.piconv.txt"
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8-strict
Content-Length: 332
$)CPROFILE_ID PLIST_ID PROFILE_KEY EMAIL_ADDR KOREAN_TEXT $)C24788877 992268 schoi@office.bigfootinteractive.com schoi@office.bigfootinteractive.com >H3gGO=J $)C24788878 992268 1 sshama@bigfootinteractive.com $)C24788876 992268 aletellier@office.bigfootinteractive.com aletellier@office.bigfootinteractive.com >H3gGO=J
MIME-Version: 1.0
X-Mailer: MIME-tools 5.418 (Entity 5.418)
Content-Disposition: inline
Message-Id: <rt-3.6.HEAD-9097-1187239470-218.28780-0-0 [...] rt.cpan.org>
Content-Type: text/plain; charset="utf8"
Content-Transfer-Encoding: binary
From: aletellier [...] epsiloninteractive.com
X-RT-Original-Encoding: utf-8
X-RT-Original-Encoding: utf-8
Content-Length: 711
Download (untitled) / with headers
text/plain 711b
Narrow the issue to this "Encode::KR::2022_KR::euc_iso" function call, the line: substr( $$r_str, 0, 0 ) = $ESC{'2022_KR'}; Inserts the escape sequence in the beg of the string? It should insert in the beg of each korean word. ..... sub euc_iso { no warnings qw(uninitialized); my $r_str = shift; substr( $$r_str, 0, 0 ) = $ESC{'2022_KR'}; # put the designator at the beg. $$r_str =~ s{ # move KS X 1001 characters in GR to GL ($RE{EUC_C}+) # and enclose them with SO and SI }{ my $str = $1; $str =~ tr/\xA1-\xFE/\x21-\x7E/; "\x0e" . $str . "\x0f"; }geox; $$r_str; }


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.