Skip Menu |
 

This queue is for tickets about the Regexp-Grammars CPAN distribution.

Report information
The Basics
Id: 54819
Status: rejected
Priority: 0/
Queue: Regexp-Grammars

People
Owner: Nobody in particular
Requestors: dolmen [...] cpan.org
Cc:
AdminCc:

Bug Information
Severity: Critical
Broken in: 1.002
Fixed in: (no value)



Subject: Perl memory corruption and crash on Ubuntu 9.10 amd64
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
X-RT-Original-Encoding: utf-8
Content-Type: multipart/mixed; boundary="----------=_1266658208-14820-209"
Content-Length: 0
Content-Type: text/plain; charset="UTF-8"
Content-Disposition: inline
Content-Transfer-Encoding: binary
Content-Length: 320
Download (untitled) / with headers
text/plain 320b
The attached program (make-Changes-rss.pl) makes Perl corrupt its memory and finally crash. The fully detailed report, including coredump, is on Ubuntu LaunchPad: https://bugs.launchpad.net/ubuntu/+source/perl/+bug/524817 The R::G demo files works fine on this platform. -- Olivier Mengué - http://o.mengue.free.fr/
Subject: Changes
MIME-Version: 1.0
Content-Type: application/octet-stream; name="Changes"
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline; filename="Changes"
Content-Transfer-Encoding: base64
Content-Length: 2821
Download Changes
application/octet-stream 2.7k

Message body not shown because it is not plain text.

Subject: make-Changes-rss.pl
MIME-Version: 1.0
Content-Type: application/x-perl; name="make-Changes-rss.pl"
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline; filename="make-Changes-rss.pl"
Content-Transfer-Encoding: base64
Content-Length: 1210
Download make-Changes-rss.pl
text/x-perl 1.1k
#!/usr/bin/perl use utf8; use v5.10.0; use strict; my $dist = 'POE-Component-Schedule'; my $Changes = do { open my $fd, "<:utf8", "Changes"; local $/ = undef; <$fd>; }; #print $Changes; my $parser = do { use Regexp::Grammars; qr{ \A \R* <Header> <[Release]>+ (?: # Should be at end of input... # \s* \Z #| # If not, report the fact but don't fail... <warning: Expected end-of-input> <warning: (?{ "Extra junk at index $INDEX: $CONTEXT" })> ) <token: Header> \S \V+ ( \R \V+ )* <token: Release> \R+ <debug: on> <Version> \h+ <Date> (?: <Time> <TimeZone>? )? \h+ <Author_Id> \h+ \( <Author_Name> \) \R <[Changes]>+ <debug: off> <token: Version> \d+\.\d+(_\d+)? <token: Date> \d{4}-\d{2}-\d{2} <token: Time> \d{2}:\d{2} <token: TimeZone> Z | [+-]\d{2}:\d{2} <token: Author_Id> \w+ <token: Author_Name> [^)]+ <token: Changes> (?:\t|[ ]{8}) (\S\V*) \R (?: (?:\t[ ]{2}|\h{10}) \h* (\V+) \R )* }xms; }; $Changes =~ $parser or die "format invalide !\n"; use YAML 0.71 (); print YAML::Dump(\%/), "\n"; use Data::Dumper; print Dumper(\%/), "\n";
MIME-Version: 1.0
X-Spam-Status: No, hits=0.0 required=8.0 tests=DKIM_SIGNED,DKIM_VALID,FREEMAIL_FROM,SPF_PASS
In-Reply-To: <rt-3.8.HEAD-14820-1266658234-182.54819-4-0 [...] rt.cpan.org>
References: <RT-Ticket-54819 [...] rt.cpan.org> <rt-3.8.HEAD-14820-1266658234-182.54819-4-0 [...] rt.cpan.org>
Message-ID: <832f158a1002200310p4c000abdmc851be18999fa8e2 [...] mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
X-RT-Original-Encoding: utf-8
Received: from la.mx.develooper.com (x1.develooper.com [207.171.7.70]) by diesel.bestpractical.com (Postfix) with SMTP id 478354D801D for <bug-Regexp-Grammars [...] rt.cpan.org>; Sat, 20 Feb 2010 11:38:43 -0500 (EST)
Received: (qmail 24804 invoked by uid 103); 20 Feb 2010 11:12:02 -0000
Received: from x16.dev (10.0.100.26) by x1.dev with QMQP; 20 Feb 2010 11:12:02 -0000
Received: from mail-vw0-f50.google.com (HELO mail-vw0-f50.google.com) (209.85.212.50) by 16.mx.develooper.com (qpsmtpd/0.80) with ESMTP; Sat, 20 Feb 2010 03:11:27 -0800
Received: by vws5 with SMTP id 5so217215vws.9 for <bug-Regexp-Grammars [...] rt.cpan.org>; Sat, 20 Feb 2010 03:11:16 -0800 (PST)
Received: by 10.220.126.157 with SMTP id c29mr8128858vcs.92.1266664276089; Sat, 20 Feb 2010 03:11:16 -0800 (PST)
Delivered-To: cpan-bug+Regexp-Grammars [...] diesel.bestpractical.com
Subject: Re: [rt.cpan.org #54819] Perl memory corruption and crash on Ubuntu 9.10 amd64
Domainkey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:content-type :content-transfer-encoding; b=Da16B5tZ8fNqwCQEkVIPBz+90pjoy9CujpT4bJG3W2v64ymiBGibp6GzG6PtMQMlMi I98vZ9qc7pruB/o/8LOab+iO+k+byeIypppvSI8P3W8sFs9pmYkwi0V8BiXL8Jky+PGB 7jdYKqKV4xcWmuFu2G1QHe9B9rUQd2YhQHT2Y=
Return-Path: <thoughtstream [...] gmail.com>
Dkim-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:from:date:x-google-sender-auth:message-id:subject:to :content-type:content-transfer-encoding; bh=nVwRyZYnoP/8BXQ/dQHbRWKHTTbrrsmS5OyYUtnym24=; b=KTMIVF3OPCjMkCPznXl/DLp9TaMz21EHBLUw701g3rgwxgmPnQEk2aQqxWfU1Vnmy6 dYdBiL7RpMZ4hbRcmiXcdOdPMhokltSyHzD3NLFF3HtKZee4idOap96a9BJUgONKdYyl uNZ/EvAvcfZ9qh9ukdzPLjwX1I2EwH3GIBEiY=
X-Spam-Check-BY: 16.mx.develooper.com
X-Original-To: bug-Regexp-Grammars [...] rt.cpan.org
X-Google-Sender-Auth: 81d56f5afd484f97
Date: Sat, 20 Feb 2010 22:10:56 +1100
Sender: thoughtstream [...] gmail.com
X-Spam-Level: *
To: bug-Regexp-Grammars [...] rt.cpan.org
Content-Transfer-Encoding: quoted-printable
From: Damian Conway <damian [...] conway.org>
RT-Message-ID: <rt-3.8.HEAD-14824-1266683937-1992.54819-0-0 [...] rt.cpan.org>
Content-Length: 500
Download (untitled) / with headers
text/plain 500b
2010/2/20 Olivier 'dolmen' Mengué via RT <bug-Regexp-Grammars@rt.cpan.org>: Show quoted text
> The attached program (make-Changes-rss.pl) makes Perl corrupt its memory > and finally crash.
The problem is not specifically in Regex::Grammars, but seems to be in Perl's Unicode support itself. Replacing the '<:utf8' in the open command with '<' resolves the issue. Debugging then indicates that your <Time> rule is missing the leading 'T', and that your grammar works perfectly once that rule is corrected. Damian
MIME-Version: 1.0
In-Reply-To: <rt-3.8.HEAD-14824-1266683937-1992.54819-0-0 [...] rt.cpan.org>
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
References: <RT-Ticket-54819 [...] rt.cpan.org> <rt-3.8.HEAD-14820-1266658234-182.54819-4-0 [...] rt.cpan.org> <832f158a1002200310p4c000abdmc851be18999fa8e2 [...] mail.gmail.com> <rt-3.8.HEAD-14824-1266683937-1992.54819-0-0 [...] rt.cpan.org>
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-12689-1268480427-158.54819-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 378
Download (untitled) / with headers
text/plain 378b
Le Sam 20 Fév 2010 11:38:59, damian@conway.org a écrit : Show quoted text
> The problem is not specifically in Regex::Grammars, but seems to be in > Perl's Unicode support itself. Replacing the '<:utf8' in the open > command with '<' resolves the issue.
I've reported the Perl bug here: http://rt.perl.org/rt3//Public/Bug/Display.html?id=72996 -- Olivier Mengué - http://o.mengue.free.fr/
MIME-Version: 1.0
In-Reply-To: <rt-3.8.HEAD-12689-1268480427-158.54819-0-0 [...] rt.cpan.org>
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
References: <RT-Ticket-54819 [...] rt.cpan.org> <rt-3.8.HEAD-14820-1266658234-182.54819-4-0 [...] rt.cpan.org> <832f158a1002200310p4c000abdmc851be18999fa8e2 [...] mail.gmail.com> <rt-3.8.HEAD-14824-1266683937-1992.54819-0-0 [...] rt.cpan.org> <rt-3.8.HEAD-12689-1268480427-158.54819-0-0 [...] rt.cpan.org>
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-12687-1268482702-1762.54819-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 476
Download (untitled) / with headers
text/plain 476b
Le Sam 13 Mar 2010 06:40:28, DOLMEN a écrit : Show quoted text
> Le Sam 20 Fév 2010 11:38:59, damian@conway.org a écrit :
> > The problem is not specifically in Regex::Grammars, but seems to be in > > Perl's Unicode support itself. Replacing the '<:utf8' in the open > > command with '<' resolves the issue.
As a workaround, I'm loading the text as ':raw', feed it to Regexp::Grammars, and give the result tree to Data::Recursive::Encode. -- Olivier Mengué - http://o.mengue.free.fr/
MIME-Version: 1.0
X-Mailer: MIME-tools 5.427 (Entity 5.427)
Content-Disposition: inline
Content-Type: text/plain; charset="UTF-8"
Message-ID: <rt-3.8.HEAD-12422-1342528234-1029.54819-0-0 [...] rt.cpan.org>
Content-Transfer-Encoding: binary
X-RT-Original-Encoding: utf-8
Content-Length: 65
Marking as rejected as the bug is not in Regexp::Grammars itself.


This service is sponsored and maintained by Best Practical Solutions and runs on Perl.org infrastructure.

Please report any issues with rt.cpan.org to rt-cpan-admin@bestpractical.com.