Subject: | Worksheet names with html characters not getting decoded |
Date: | Wed, 21 Oct 2009 18:37:05 +0530 |
To: | bug-Spreadsheet-XLSX@rt.cpan.org |
From: | mrinal roushan <mrinalroushan@gmail.com> |
Hi,
I am using Spreadsheet::XLSX 0.12 and I encountered the following problem :
If the xlsx file being processed has html characters(&,<,>,'') in the worksheet names, this causes Spreadsheet::XLSX to dump the sheet names without decoding the characters to their original form and this appears as &,<,>," in the dumped output. Therefore I added a line after 119 in Spreadsheet::XLSX as
$sheet -> {Name} = Spreadsheet::XLSX::Utility2007::unescape_HTML($sheet->{Name});
This then results in decoding the sheet name correctly.
I am using the following test script to get the dumped output :
my $converter = Text::IConv->new("utf-8","windows-1251");
my $ref = Spreadsheet::XLSX->new("file_name",$converter);
print Dumper $ref;
--
Regards
Mrinal Roushan