[23940] in Perl-Users-Digest

home help back first fref pref prev next nref lref last post

Perl-Users Digest, Issue: 6141 Volume: 10

daemon@ATHENA.MIT.EDU (Perl-Users Digest)
Mon Feb 16 09:10:41 2004

Date: Mon, 16 Feb 2004 06:10:12 -0800 (PST)
From: Perl-Users Digest <Perl-Users-Request@ruby.OCE.ORST.EDU>
To: Perl-Users@ruby.OCE.ORST.EDU (Perl-Users Digest)

Perl-Users Digest           Mon, 16 Feb 2004     Volume: 10 Number: 6141

Today's topics:
    Re: Reformatting text file <nospam@xx.com>
    Re: Reformatting text file <noreply@gunnar.cc>
    Re: Reformatting text file <nospam@xx.com>
    Re: Reformatting text file <gnari@simnet.is>
    Re: Reformatting text file <tadmc@augustmail.com>
    Re: Reformatting text file <tadmc@augustmail.com>
        Regular Expresions & pattern matching (mis)understandin <robert.h.stelmack*REMOVE*@boeing.com>
    Re: Regular Expresions & pattern matching (mis)understa <no@spam.for.me.invalid>
    Re: Regular Expresions & pattern matching (mis)understa <bernard.el-haginDODGE_THIS@lido-tech.net>
    Re: Regular Expresions & pattern matching (mis)understa <noreply@gunnar.cc>
    Re: Regular Expresions & pattern matching (mis)understa <tadmc@augustmail.com>
    Re: Replacing unicode characters <twhu@lucent.com>
    Re: restarting <> (stdio) <spamhole@klaas.ca>
    Re: restarting <> (stdio) <webstuff@fluidic.com>
    Re: restarting <> (stdio) (Charles DeRykus)
    Re: Sending HASH over TCP <troc@pobox.com>
        Statistics for comp.lang.perl.misc <gbacon@hiwaay.net>
    Re: Trent Curry, wsanford@wallysanford.com, and falsely (Anno Siegel)
    Re: Trent Curry, wsanford@wallysanford.com, and falsely <noreply@gunnar.cc>
    Re: Why is Perl losing ground? <ebohlman@earthlink.net>
    Re: Why is Perl losing ground? <ebohlman@earthlink.net>
    Re: XML::Parser Style => Object <mirod@xmltwig.com>
        Digest Administrivia (Last modified: 6 Apr 01) (Perl-Users-Digest Admin)

----------------------------------------------------------------------

Date: Mon, 16 Feb 2004 00:02:15 -0800
From: Eric <nospam@xx.com>
Subject: Re: Reformatting text file
Message-Id: <40307907.BF1D7B02@xx.com>

ARRRRGGGGGHHH...

      $line =~ m/([\w\d\/]+)\,([\w\s]+)/;
      $callsign = $1;
      $location = $2;
      $line = "$callsign \=Q$location\n";

I've been trying everything I can think of but sure need assistance.

Thank you.

Eric


------------------------------

Date: Mon, 16 Feb 2004 09:24:25 +0100
From: Gunnar Hjalmarsson <noreply@gunnar.cc>
Subject: Re: Reformatting text file
Message-Id: <c0pv6t$1a2kep$1@ID-184292.news.uni-berlin.de>

Eric wrote:
> ARRRRGGGGGHHH...
> 
>       $line =~ m/([\w\d\/]+)\,([\w\s]+)/;
>       $callsign = $1;
>       $location = $2;
>       $line = "$callsign \=Q$location\n";
> 
> I've been trying everything I can think of but sure need assistance.

You said that the fields in the input file are quoted, didn't you?

     $line =~ /"([\w\d\/]+)","([\w\s]+)"/;

-- 
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl



------------------------------

Date: Mon, 16 Feb 2004 00:46:22 -0800
From: Eric <nospam@xx.com>
Subject: Re: Reformatting text file
Message-Id: <4030835E.11A87FFA@xx.com>

Gunnar Hjalmarsson wrote:
> 
> You said that the fields in the input file are quoted, didn't you?
> 
>      $line =~ /"([\w\d\/]+)","([\w\s]+)"/;

Gunnar:

THANK YOU - THANK YOU - THANK YOU !!!  I took out the quotes when trying
the other stuff and forgot to put them back in.

Now, it is going on 1AM (local) and I can finally move on to the rest of
what has to be done with all the data.

I hope you have a fantastic day!!!

Regards+++++++++++++

Eric


------------------------------

Date: Mon, 16 Feb 2004 08:46:56 -0000
From: "gnari" <gnari@simnet.is>
Subject: Re: Reformatting text file
Message-Id: <c0q00a$bqv$1@news.simnet.is>

"Eric" <nospam@xx.com> wrote in message news:40303DC6.D35F88AA@xx.com...

[match problem]

> FYI, the "Callsign" field can contain one or more letters, numbers or
> forward slashes.

> The "Location" field may contain only letters OR
> letters & numbers.
what do you mean ?
is that not the same as just letters and numbers?
or do you mean letters or (letters followed by numbers?)

>...
>       $line =~ m/"([\w\d/]+)","([\w\s]+)"/; <--- questionable code

\w matches numbers too , so skip the \d

you are using / as a delimiter in the regexp, so the one in the first
[] set is terminating the expression. you can do either
    m/[\w\d\/]+/
or use another delimiter, like ! :
    m![\w/]+/

so your code might be:

    $line =~ m!"([\w/]+)","([\w]+)"!;

>        the following format:
>                callsign =Qlocation
>
>       $line =~ $1 =Q$2\n;  <- Major brainfreeze here
indeed

$line="$1 =Q$2\n";

gnari






------------------------------

Date: Mon, 16 Feb 2004 07:50:17 -0600
From: Tad McClellan <tadmc@augustmail.com>
Subject: Re: Reformatting text file
Message-Id: <slrnc31ikp.3rn.tadmc@magna.augustmail.com>

Eric <nospam@xx.com> wrote:
> ARRRRGGGGGHHH...
> 
>       $line =~ m/([\w\d\/]+)\,([\w\s]+)/;
>       $callsign = $1;
>       $location = $2;
>       $line = "$callsign \=Q$location\n";
> 
> I've been trying everything I can think of but sure need assistance.


Assistance with what?

What is it supposed to do?

What is it doing instead?


Many readers won't take the time to figure all that out, they'll just
move on to the next article where the author has made it easier for
them to answer the question.


-- 
    Tad McClellan                          SGML consulting
    tadmc@augustmail.com                   Perl programming
    Fort Worth, Texas


------------------------------

Date: Mon, 16 Feb 2004 07:52:12 -0600
From: Tad McClellan <tadmc@augustmail.com>
Subject: Re: Reformatting text file
Message-Id: <slrnc31ioc.3rn.tadmc@magna.augustmail.com>

Eric <nospam@xx.com> wrote:


>       $line =~ m/([\w\d\/]+)\,([\w\s]+)/;
>       $callsign = $1;
>       $location = $2;


You should never use the dollar-digit variables unless you
have first ensured that the match _succeeded_.

If the match fails, the dollar-digit variables will have
"stale" values from some previous match that _did_ succeed.


-- 
    Tad McClellan                          SGML consulting
    tadmc@augustmail.com                   Perl programming
    Fort Worth, Texas


------------------------------

Date: Mon, 16 Feb 2004 12:08:17 GMT
From: "Robert Stelmack" <robert.h.stelmack*REMOVE*@boeing.com>
Subject: Regular Expresions & pattern matching (mis)understanding
Message-Id: <Ht6F1t.G6y@news.boeing.com>

I have embedded some marked text in a large file to indicate chapters and
page numbers.  I want to read that file and strip out the page number to be
displayed with the line of text or following text so as to have a page
number reference for the displayed text.  I have read and reread the perlre
and looked for examples on the Internet, but I must be missing a basic
concept. I also have O'Reill's Programming Perl book, but the examples are
sometimes hard to apply to what I am trying to do. Here is what I tried to
get to work (with various syntax changes):

#!/usr/bin/perl
$_= "He had come to pass his experience along to me - if <page>10</page>I
cared to have it.";
$PAGE =~ /[<page>][0-9][<\/page>]/;
s,[<page>][0-9][</page>],,g;
printf "(p.$PAGE) contains [$_]:\n";

The output I expected was:

(p.10) contains [He had come to pass his experience along to me - if I cared
to have it.]

 ...but instead it displayed:

bash-2.05b$ test.cgi
(p.) contains [He had come to pass his experience along to me - if
<page>10</page>I cared to have it.]:


I really want to get my head around pattern matching and binding since all
my working code looks too much like my old FORTRAN programs.

Cheers,

Bob




------------------------------

Date: Mon, 16 Feb 2004 12:51:22 GMT
From: Nils Petter Vaskinn <no@spam.for.me.invalid>
Subject: Re: Regular Expresions & pattern matching (mis)understanding
Message-Id: <pan.2004.02.16.12.48.04.58185@spam.for.me.invalid>

On Mon, 16 Feb 2004 12:08:17 +0000, Robert Stelmack wrote:


> The output I expected was:
> 
> (p.10) contains [He had come to pass his experience along to me - if I cared
> to have it.]
> 
> ...but instead it displayed:
> 
> bash-2.05b$ test.cgi
> (p.) contains [He had come to pass his experience along to me - if
> <page>10</page>I cared to have it.]:

Completely untested and probably wrong, but it might give you a kick in
the right direction:

/^(.*)<page>([0-9]+)</page>(.*)/;
$page = $2;
$rest = $1 . $3;
print "(p.$page) contains $rest\n";

> I really want to get my head around pattern matching and binding since all
> my working code looks too much like my old FORTRAN programs.

Look for capturing in the regexp chapter of your favorite perl book

-- 
NPV

"the large print giveth, and the small print taketh away"
                                Tom Waits - Step right up



------------------------------

Date: Mon, 16 Feb 2004 14:00:02 +0100
From: Bernard El-Hagin <bernard.el-haginDODGE_THIS@lido-tech.net>
Subject: Re: Regular Expresions & pattern matching (mis)understanding
Message-Id: <slrnc31fg9.10c.bernard.el-haginDODGE_THIS@gdndev25.lido-tech>

On 2004-02-16, Robert Stelmack <robert.h.stelmack*REMOVE*@boeing.com> wrote:
> I have embedded some marked text in a large file to indicate chapters and
> page numbers.  I want to read that file and strip out the page number to be
> displayed with the line of text or following text so as to have a page
> number reference for the displayed text.  I have read and reread the perlre
> and looked for examples on the Internet, but I must be missing a basic
> concept. I also have O'Reill's Programming Perl book, but the examples are
> sometimes hard to apply to what I am trying to do. Here is what I tried to
> get to work (with various syntax changes):


You have made several insanely basic mistakes in your code. I don't
understand how you could have made them if you'd "read and reread the perlre".


> #!/usr/bin/perl


If you'd used warnings you would have got a clue.


> $_= "He had come to pass his experience along to me - if <page>10</page>I
> cared to have it.";
> $PAGE =~ /[<page>][0-9][<\/page>]/;
> s,[<page>][0-9][</page>],,g;
> printf "(p.$PAGE) contains [$_]:\n";
>
> The output I expected was:
>
> (p.10) contains [He had come to pass his experience along to me - if I cared
> to have it.]


Don't you also expect the colon to be there at the end?


> ...but instead it displayed:
>
> bash-2.05b$ test.cgi
> (p.) contains [He had come to pass his experience along to me - if
><page>10</page>I cared to have it.]:
>
> I really want to get my head around pattern matching and binding since all
> my working code looks too much like my old FORTRAN programs.


Then read and reread perlre.


In the meantime, try:


--------
$_= "He had come to pass his experience along to me - if <page>10</page>I
cared to have it.";

s/(.*)<page>(\d+)<\/page>(.*)/(p.$2) contains [$1$3]:\n/s;

print;
--------


-- 
Cheers,
Bernard


------------------------------

Date: Mon, 16 Feb 2004 13:58:47 +0100
From: Gunnar Hjalmarsson <noreply@gunnar.cc>
Subject: Re: Regular Expresions & pattern matching (mis)understanding
Message-Id: <c0qf9u$18h9mi$1@ID-184292.news.uni-berlin.de>

Robert Stelmack wrote:
> 
> #!/usr/bin/perl
> $_= "He had come to pass his experience along to me -
> if <page>10</page>I cared to have it.";
> $PAGE =~ /[<page>][0-9][<\/page>]/;

Several mistakes in that line.

- You need parentheses around $PAGE to enforce list context.
- You need the '=' operator to assign the captured string.
- Brackets are for character classes, and shall not be used around
   <page> etc.
- You need [0-9]+ so that it matches one or more digits.
- You need to surround the latter with parentheses to capture it.

This is what I suppose you mean:

     ($PAGE) = /<page>([0-9]+)<\/page>/;

> s,[<page>][0-9][</page>],,g;

This is what I suppose you mean:

     s,<page>[0-9]+</page>,,;

(not sure why you are using the /g modifier)

> printf "(p.$PAGE) contains [$_]:\n";

No need to use printf(). print() is sufficient.

     print "(p.$PAGE) contains [$_]:\n";

But the $PAGE variable is redundant. Instead you can do:

     s,<page>([0-9]+)</page>,,;
     print "(p.$1) contains [$_]:\n";

HTH

-- 
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl



------------------------------

Date: Mon, 16 Feb 2004 07:39:28 -0600
From: Tad McClellan <tadmc@augustmail.com>
Subject: Re: Regular Expresions & pattern matching (mis)understanding
Message-Id: <slrnc31i0g.3rn.tadmc@magna.augustmail.com>

Robert Stelmack <robert.h.stelmack*REMOVE*@boeing.com> wrote:

> I have read and reread the perlre


Have you read the regex _tutorial_ too?

   perldoc perlretut


> but I must be missing a basic
> concept. 


You are missing multiple concepts simultaneously. See below.


> I also have O'Reill's Programming Perl book, 


The best book for extracting the text-processing power from regexes is:

   "Mastering Regular Expressions" (2nd edition) O'Reilly


> #!/usr/bin/perl


You should ask for all the help you can get:

   use strict;
   use warnings;

Have you seen the Posting Guidelines that are posted here frequently?


> $_= "He had come to pass his experience along to me - if <page>10</page>I
> cared to have it.";


It is absolutely essential that we have exactly the same string as
you if we are to help you with matching that string.

Consider wrapping long strings yourself (in valid Perl) so your
newsreader won't break stuff for you. 


> $PAGE =~ /[<page>][0-9][<\/page>]/;


1) you are trying to match the pattern against the string contained
   in $PAGE, but the string is really in $_. 
   (perl would have warned you about that if you had asked it to...)

2) a "character class" matches a _single_ character.
   [<page>]  is exactly equivalent to   [aegp<>]  since the
   listed characters are the same.

3) your pattern will match only single-digit numbers, you need to
   allow multiple digit characters between the "tags".

4) you need "capturing parenthesis" around the page number digits
   if you want access to them later.

5) you don't need the m// at all if you are going to s/// with
   the same pattern. s/// does nothing if it the match fails.

6) the \d shortcut char class matches the same chars as [0-9].


> s,[<page>][0-9][</page>],,g;


After this statement all of the "tags" will be gone, and it will
be "too late" to apply further processing to them (such as print()).


> printf "(p.$PAGE) contains [$_]:\n";


You should use print() unless you make use of the formatting
that printf() provides.


> The output I expected was:
> 
> (p.10) contains [He had come to pass his experience along to me - if I cared
> to have it.]


----------------------------------
#!/usr/bin/perl
use strict;
use warnings;

$_= "He had come to pass his experience along to me - if "
  . "<page>10</page>I cared to have it.";

while ( s,<page>(\d+)</page>,,g ) {
   print "(p.$1) contains [$_]:\n";
}
----------------------------------


-- 
    Tad McClellan                          SGML consulting
    tadmc@augustmail.com                   Perl programming
    Fort Worth, Texas


------------------------------

Date: Mon, 16 Feb 2004 08:58:48 -0500
From: "Tulan W. Hu" <twhu@lucent.com>
Subject: Re: Replacing unicode characters
Message-Id: <c0qiap$1al@netnews.proxy.lucent.com>

"Bart Lateur" <bart.lateur@pandora.be> wrote in message
> >I tried the above and got the following error message
> >"\x{2019}" does not map to iso-8859-1 at utf.pl line 8, <$FILE> line 161.
> >but the pre-5.8 code just removes the characters for me.
>
> Don't use Latin-1 for the encoding, try cp1252 (AKA Windows) instead.
> That turns out to be chr(0x92) ("right single quotation mark"). For the
> whole list, see
>
> <http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT>
>
> If you don't want the Windows character set, I'd replace all "single
> quotation marks" with apostrophes, ("'", chr(39)), and all "double
> quotation marks" with quotes ('"', chr(34)).

you are right. I don't want the window only character set.
I have no control on the input files because I got them from a vendor.
I have to convert those utf8 files to latin1 for other programs.
Is 'iso-8859-1' the same as 'latin1'?




------------------------------

Date: 16 Feb 2004 06:01:23 GMT
From: Klaas <spamhole@klaas.ca>
Subject: Re: restarting <> (stdio)
Message-Id: <Xns9490E05FC9A72nothing@209.98.50.131>

After careful consideration, J muttered:

> I'm still learning Perl, so i'm sorry if I get the lingo completely
> wrong. 
> 
> I have a script that take a filename as an arguement.  I process each
> line of the file using the while( <> ) statement.  That works
> perfectly. 
> 
> However, now I need to start at the beginning of the input file.  I
> can't figure out how to do this to save my life.  Any suggestions?

Let me preface with 'I am a perl newbie'.

If you save @ARGV and reset it after <> returns false, the next time <> is 
encountered it will re-process @ARGV.

-Mike


------------------------------

Date: Mon, 16 Feb 2004 06:40:19 GMT
From: J <webstuff@fluidic.com>
Subject: Re: restarting <> (stdio)
Message-Id: <0001HW.BC55D0070075896BF03865B0@news-server.tampabay.rr.com>

On Mon, 16 Feb 2004 1:01:23 -0500, Klaas wrote
(in message <Xns9490E05FC9A72nothing@209.98.50.131>):

> After careful consideration, J muttered:
> 
>> I'm still learning Perl, so i'm sorry if I get the lingo completely
>> wrong. 
>> 
>> I have a script that take a filename as an arguement.  I process each
>> line of the file using the while( <> ) statement.  That works
>> perfectly. 
>> 
>> However, now I need to start at the beginning of the input file.  I
>> can't figure out how to do this to save my life.  Any suggestions?
> 
> Let me preface with 'I am a perl newbie'.
> 
> If you save @ARGV and reset it after <> returns false, the next time <> is 
> encountered it will re-process @ARGV.
> 
> -Mike

thank you so much.  That worked perfectly.

J



------------------------------

Date: Mon, 16 Feb 2004 06:53:05 GMT
From: ced@bcstec.ca.boeing.com (Charles DeRykus)
Subject: Re: restarting <> (stdio)
Message-Id: <Ht60GI.FA6@news.boeing.com>

In article <0001HW.BC55ADB00070B816F03865B0@news-server.tampabay.rr.com>,
J  <webstuff@fluidic.com> wrote:
>I'm still learning Perl, so i'm sorry if I get the lingo completely wrong.
>
>I have a script that take a filename as an arguement.  I process each line of 
>the file using the while( <> ) statement.  That works perfectly.
>
>However, now I need to start at the beginning of the input file.  I can't 
>figure out how to do this to save my life.  Any suggestions?
>

If you're talking about re-reading the file with <>,
something like this perhaps:

my $arg = $ARGV[0];
while ( <> ) {
    ...
}
 ...
$ARGV[0] = $arg;
while ( <>) {
 ...

check perlop for details about ARGV and friends...

hth,
--
Charles DeRykus


------------------------------

Date: Mon, 16 Feb 2004 07:16:43 GMT
From: Rocco Caputo <troc@pobox.com>
Subject: Re: Sending HASH over TCP
Message-Id: <slrnc30ror.eu.troc@eyrie.homenet>

On Sun, 15 Feb 2004 19:19:07 -0800, Brad Walton wrote:
> I was actually using that up until I had to make this change. Thanks, it is
> a very easy-to-use module and made accomplishing my task (at that time) very
> simple. Unfortunately, I could not get it to communicate with a new program
> that was introduced to my project, which is not written in Perl and required
> a TCP connection.

Have you looked at POE, then?  It deals well with TCP connections, and
POE::Filter::Reference can serialize Perl data structures in a number of
ways.

-- 
Roccco aputo - rcaputo@pobox.com - http://poe.perl.org/


------------------------------

Date: Mon, 16 Feb 2004 13:55:52 -0000
From: Greg Bacon <gbacon@hiwaay.net>
Subject: Statistics for comp.lang.perl.misc
Message-Id: <1031iv83q7gr527@corp.supernews.com>

Following is a summary of articles spanning a 7 day period,
beginning at 09 Feb 2004 13:56:22 GMT and ending at
16 Feb 2004 13:53:15 GMT.

Notes
=====

    - A line in the body of a post is considered to be original if it
      does *not* match the regular expression /^\s{0,3}(?:>|:|\S+>|\+\+)/.
    - All text after the last cut line (/^-- $/) in the body is
      considered to be the author's signature.
    - The scanner prefers the Reply-To: header over the From: header
      in determining the "real" email address and name.
    - Original Content Rating (OCR) is the ratio of the original content
      volume to the total body volume.
    - Find the News-Scan distribution on the CPAN!
      <URL:http://www.perl.com/CPAN/modules/by-module/News/>
    - Please send all comments to Greg Bacon <gbacon@cs.uah.edu>.
    - Copyright (c) 2004 Greg Bacon.
      Verbatim copying and redistribution is permitted without royalty;
      alteration is not permitted.  Redistribution and/or use for any
      commercial purpose is prohibited.

Excluded Posters
================

perlfaq-suggestions\@(?:.*\.)?perl\.com
faq\@(?:.*\.)?denver\.pm\.org
comdog\@panix\.com

Totals
======

Posters:  256
Articles: 899 (408 with cutlined signatures)
Threads:  168
Volume generated: 1795.0 kb
    - headers:    821.6 kb (15,120 lines)
    - bodies:     902.1 kb (29,072 lines)
    - original:   524.7 kb (18,536 lines)
    - signatures: 70.5 kb (1,461 lines)

Original Content Rating: 0.582

Averages
========

Posts per poster: 3.5
    median: 2.0 posts
    mode:   1 post - 117 posters
    s:      8.2 posts
Posts per thread: 5.4
    median: 4.0 posts
    mode:   2 posts - 28 threads
    s:      7.0 posts
Message size: 2044.6 bytes
    - header:     935.8 bytes (16.8 lines)
    - body:       1027.5 bytes (32.3 lines)
    - original:   597.6 bytes (20.6 lines)
    - signature:  80.3 bytes (1.6 lines)

Top 20 Posters by Number of Posts
=================================

         (kb)   (kb)  (kb)  (kb)
Posts  Volume (  hdr/ body/ orig)  Address
-----  --------------------------  -------

  106   255.7 ( 98.7/129.7/ 54.5)  Ben Morrow <usenet@morrow.me.uk>
   36    99.3 ( 43.3/ 51.3/ 44.2)  tadmc@augustmail.com
   29    47.2 ( 27.4/ 17.9/  9.4)  Gunnar Hjalmarsson <noreply@gunnar.cc>
   25    53.0 ( 22.4/ 24.9/  9.1)  Uri Guttman <uri@stemsystems.com>
   22    41.5 ( 20.5/ 21.0/ 12.0)  "gnari" <gnari@simnet.is>
   21    52.7 ( 19.2/ 30.0/ 18.1)  Tore Aursand <tore@aursand.no>
   21    36.7 ( 15.7/ 21.0/ 10.0)  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
   16    38.1 ( 12.9/ 25.2/ 19.0)  thumb_42@yahoo.com
   15    39.1 ( 21.3/ 13.9/  7.5)  jwillmore@adelphia.net
   13    24.7 (  9.9/ 12.3/  6.7)  Michele Dondi <bik.mido@tiscalinet.it>
   13    21.8 ( 10.2/ 11.5/  6.6)  "David K. Wall" <dwall@fastmail.fm>
   12    17.2 ( 11.3/  5.8/  3.4)  Bart Lateur <bart.lateur@pandora.be>
   12    18.3 (  9.5/  8.7/  4.8)  David O in Seattle <me@privacy.net>
   11    19.7 (  9.4/  7.5/  2.8)  David Dyer-Bennet <dd-b@dd-b.net>
   11    36.5 ( 10.1/ 25.7/ 13.9)  Brian McCauley <nobull@mail.com>
   11    18.5 ( 10.2/  8.4/  4.5)  "Matt Garrish" <matthew.garrish@sympatico.ca>
   10    16.9 (  8.0/  7.7/  5.0)  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
    8    25.7 ( 10.4/ 15.3/  7.3)  Chris <ceo@nospan.on.net>
    8    23.0 (  7.6/ 13.6/  7.6)  tassilo.parseval@post.rwth-aachen.de
    7    11.8 (  6.7/  5.1/  3.0)  Brad Baxter <bmb@ginger.libs.uga.edu>

These posters accounted for 45.3% of all articles.

Top 20 Posters by Number of Followups
=====================================

             (kb)   (kb)  (kb)  (kb)
Followups  Volume (  hdr/ body/ orig)  Address
---------  --------------------------  -------

      106   255.7 ( 98.7/129.7/ 54.5)  Ben Morrow <usenet@morrow.me.uk>
       34    99.3 ( 43.3/ 51.3/ 44.2)  tadmc@augustmail.com
       29    47.2 ( 27.4/ 17.9/  9.4)  Gunnar Hjalmarsson <noreply@gunnar.cc>
       25    53.0 ( 22.4/ 24.9/  9.1)  Uri Guttman <uri@stemsystems.com>
       22    41.5 ( 20.5/ 21.0/ 12.0)  "gnari" <gnari@simnet.is>
       21    36.7 ( 15.7/ 21.0/ 10.0)  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
       20    52.7 ( 19.2/ 30.0/ 18.1)  Tore Aursand <tore@aursand.no>
       15    38.1 ( 12.9/ 25.2/ 19.0)  thumb_42@yahoo.com
       14    39.1 ( 21.3/ 13.9/  7.5)  jwillmore@adelphia.net
       13    21.8 ( 10.2/ 11.5/  6.6)  "David K. Wall" <dwall@fastmail.fm>
       12    17.2 ( 11.3/  5.8/  3.4)  Bart Lateur <bart.lateur@pandora.be>
       12    24.7 (  9.9/ 12.3/  6.7)  Michele Dondi <bik.mido@tiscalinet.it>
       11    36.5 ( 10.1/ 25.7/ 13.9)  Brian McCauley <nobull@mail.com>
       11    19.7 (  9.4/  7.5/  2.8)  David Dyer-Bennet <dd-b@dd-b.net>
       10    18.3 (  9.5/  8.7/  4.8)  David O in Seattle <me@privacy.net>
       10    18.5 ( 10.2/  8.4/  4.5)  "Matt Garrish" <matthew.garrish@sympatico.ca>
       10    16.9 (  8.0/  7.7/  5.0)  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
        8    23.0 (  7.6/ 13.6/  7.6)  tassilo.parseval@post.rwth-aachen.de
        7    25.7 ( 10.4/ 15.3/  7.3)  Chris <ceo@nospan.on.net>
        7    10.7 (  6.8/  3.9/  2.4)  Joe Smith <Joe.Smith@inwap.com>

These posters accounted for 51.9% of all followups.

Top 20 Posters by Volume
========================

  (kb)   (kb)  (kb)  (kb)
Volume (  hdr/ body/ orig)  Posts  Address
--------------------------  -----  -------

 255.7 ( 98.7/129.7/ 54.5)    106  Ben Morrow <usenet@morrow.me.uk>
  99.3 ( 43.3/ 51.3/ 44.2)     36  tadmc@augustmail.com
  53.0 ( 22.4/ 24.9/  9.1)     25  Uri Guttman <uri@stemsystems.com>
  52.7 ( 19.2/ 30.0/ 18.1)     21  Tore Aursand <tore@aursand.no>
  47.2 ( 27.4/ 17.9/  9.4)     29  Gunnar Hjalmarsson <noreply@gunnar.cc>
  41.5 ( 20.5/ 21.0/ 12.0)     22  "gnari" <gnari@simnet.is>
  39.1 ( 21.3/ 13.9/  7.5)     15  jwillmore@adelphia.net
  38.1 ( 12.9/ 25.2/ 19.0)     16  thumb_42@yahoo.com
  36.7 ( 15.7/ 21.0/ 10.0)     21  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
  36.5 ( 10.1/ 25.7/ 13.9)     11  Brian McCauley <nobull@mail.com>
  25.7 ( 10.4/ 15.3/  7.3)      8  Chris <ceo@nospan.on.net>
  24.7 (  9.9/ 12.3/  6.7)     13  Michele Dondi <bik.mido@tiscalinet.it>
  23.0 (  7.6/ 13.6/  7.6)      8  tassilo.parseval@post.rwth-aachen.de
  21.8 ( 10.2/ 11.5/  6.6)     13  "David K. Wall" <dwall@fastmail.fm>
  19.7 (  9.4/  7.5/  2.8)     11  David Dyer-Bennet <dd-b@dd-b.net>
  18.5 ( 10.2/  8.4/  4.5)     11  "Matt Garrish" <matthew.garrish@sympatico.ca>
  18.3 (  9.5/  8.7/  4.8)     12  David O in Seattle <me@privacy.net>
  17.2 ( 11.3/  5.8/  3.4)     12  Bart Lateur <bart.lateur@pandora.be>
  16.9 (  8.0/  7.7/  5.0)     10  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
  13.7 (  7.5/  6.2/  2.6)      7  Ilya Zakharevich <nospam-abuse@ilyaz.org>

These posters accounted for 50.1% of the total volume.

Top 17 Posters by Volume of Original Content (min. ten posts)
=============================================================

        (kb)
Posts   orig  Address
-----  -----  -------

  106   54.5  Ben Morrow <usenet@morrow.me.uk>
   36   44.2  tadmc@augustmail.com
   16   19.0  thumb_42@yahoo.com
   21   18.1  Tore Aursand <tore@aursand.no>
   11   13.9  Brian McCauley <nobull@mail.com>
   22   12.0  "gnari" <gnari@simnet.is>
   21   10.0  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
   29    9.4  Gunnar Hjalmarsson <noreply@gunnar.cc>
   25    9.1  Uri Guttman <uri@stemsystems.com>
   15    7.5  jwillmore@adelphia.net
   13    6.7  Michele Dondi <bik.mido@tiscalinet.it>
   13    6.6  "David K. Wall" <dwall@fastmail.fm>
   10    5.0  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
   12    4.8  David O in Seattle <me@privacy.net>
   11    4.5  "Matt Garrish" <matthew.garrish@sympatico.ca>
   12    3.4  Bart Lateur <bart.lateur@pandora.be>
   11    2.8  David Dyer-Bennet <dd-b@dd-b.net>

These posters accounted for 44.1% of the original volume.

Top 17 Posters by OCR (minimum of ten posts)
============================================

         (kb)    (kb)
OCR      orig /  body  Posts  Address
-----  --------------  -----  -------

0.863  ( 44.2 / 51.3)     36  tadmc@augustmail.com
0.755  ( 19.0 / 25.2)     16  thumb_42@yahoo.com
0.649  (  5.0 /  7.7)     10  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
0.605  ( 18.1 / 30.0)     21  Tore Aursand <tore@aursand.no>
0.595  (  3.4 /  5.8)     12  Bart Lateur <bart.lateur@pandora.be>
0.571  (  6.6 / 11.5)     13  "David K. Wall" <dwall@fastmail.fm>
0.568  ( 12.0 / 21.0)     22  "gnari" <gnari@simnet.is>
0.551  (  4.8 /  8.7)     12  David O in Seattle <me@privacy.net>
0.545  (  6.7 / 12.3)     13  Michele Dondi <bik.mido@tiscalinet.it>
0.543  ( 13.9 / 25.7)     11  Brian McCauley <nobull@mail.com>
0.542  (  4.5 /  8.4)     11  "Matt Garrish" <matthew.garrish@sympatico.ca>
0.539  (  7.5 / 13.9)     15  jwillmore@adelphia.net
0.528  (  9.4 / 17.9)     29  Gunnar Hjalmarsson <noreply@gunnar.cc>
0.475  ( 10.0 / 21.0)     21  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
0.420  ( 54.5 /129.7)    106  Ben Morrow <usenet@morrow.me.uk>
0.378  (  2.8 /  7.5)     11  David Dyer-Bennet <dd-b@dd-b.net>
0.365  (  9.1 / 24.9)     25  Uri Guttman <uri@stemsystems.com>

Bottom 17 Posters by OCR (minimum of ten posts)
===============================================

         (kb)    (kb)
OCR      orig /  body  Posts  Address
-----  --------------  -----  -------

0.863  ( 44.2 / 51.3)     36  tadmc@augustmail.com
0.755  ( 19.0 / 25.2)     16  thumb_42@yahoo.com
0.649  (  5.0 /  7.7)     10  Walter Roberson <roberson@ibd.nrc-cnrc.gc.ca>
0.605  ( 18.1 / 30.0)     21  Tore Aursand <tore@aursand.no>
0.595  (  3.4 /  5.8)     12  Bart Lateur <bart.lateur@pandora.be>
0.571  (  6.6 / 11.5)     13  "David K. Wall" <dwall@fastmail.fm>
0.568  ( 12.0 / 21.0)     22  "gnari" <gnari@simnet.is>
0.551  (  4.8 /  8.7)     12  David O in Seattle <me@privacy.net>
0.545  (  6.7 / 12.3)     13  Michele Dondi <bik.mido@tiscalinet.it>
0.543  ( 13.9 / 25.7)     11  Brian McCauley <nobull@mail.com>
0.542  (  4.5 /  8.4)     11  "Matt Garrish" <matthew.garrish@sympatico.ca>
0.539  (  7.5 / 13.9)     15  jwillmore@adelphia.net
0.528  (  9.4 / 17.9)     29  Gunnar Hjalmarsson <noreply@gunnar.cc>
0.475  ( 10.0 / 21.0)     21  Anno Siegel <anno4000@lublin.zrz.tu-berlin.de>
0.420  ( 54.5 /129.7)    106  Ben Morrow <usenet@morrow.me.uk>
0.378  (  2.8 /  7.5)     11  David Dyer-Bennet <dd-b@dd-b.net>
0.365  (  9.1 / 24.9)     25  Uri Guttman <uri@stemsystems.com>

17 posters (6%) had at least ten posts.

Top 20 Threads by Number of Posts
=================================

Posts  Subject
-----  -------

   62  Why is Perl losing ground?
   27  Why references??
   26  Perl usage these days?
   24  Environment question
   23  RFC: utils.pm
   15  perl/korn scirpt for appending first variable to end of line.
   14  if for...
   13  Using Tie::IxHash order a hash reference
   13  Encrypt and Decrypt for text ...
   12  glob in perl
   12  Replacing unicode characters
   11  Comparing a string to filenames in a directory
   11  Regular Expression Help Needed
   10  hidden perl
   10  configure problem
   10  Is Windows running
   10  Anyone using Berkley XML DB w/Perl (of course)...?
   10  Reading html into a source file
    9  Choosing grouping of split items
    9  Sending HASH over TCP

These threads accounted for 36.8% of all articles.

Top 20 Threads by Volume
========================

  (kb)   (kb)  (kb)  (kb)
Volume (  hdr/ body/ orig)  Posts  Subject
--------------------------  -----  -------

 129.1 ( 55.9/ 66.7/ 40.2)     62  Why is Perl losing ground?
  79.7 ( 21.5/ 54.2/ 26.7)     23  RFC: utils.pm
  57.3 ( 24.8/ 30.3/ 18.6)     27  Why references??
  50.6 ( 27.2/ 20.9/ 13.7)     24  Environment question
  45.7 ( 24.0/ 20.8/ 12.4)     26  Perl usage these days?
  34.5 (  2.1/ 32.3/ 32.3)      2  Posting Guidelines for comp.lang.perl.misc ($Revision: 1.5 $)
  32.5 ( 10.7/ 21.0/  9.7)     10  Anyone using Berkley XML DB w/Perl (of course)...?
  30.6 ( 11.2/ 17.9/ 10.1)     12  Replacing unicode characters
  26.4 ( 16.9/  8.5/  3.4)     15  perl/korn scirpt for appending first variable to end of line.
  24.4 ( 12.8/  9.8/  5.2)     14  if for...
  24.0 ( 10.0/ 13.4/  6.5)      9  Sending HASH over TCP
  23.8 ( 10.5/ 12.7/  8.4)     11  Comparing a string to filenames in a directory
  23.4 ( 12.4/  8.8/  3.5)     13  Using Tie::IxHash order a hash reference
  21.1 (  6.9/ 13.5/  7.3)      8  Confused by hashes/data structures
  20.1 ( 10.5/  9.1/  4.3)     10  configure problem
  19.9 ( 10.1/  7.8/  4.0)     12  glob in perl
  19.6 ( 13.3/  5.8/  3.2)     13  Encrypt and Decrypt for text ...
  18.6 (  1.8/ 16.7/ 10.4)      2  When to "use strict" when teaching?
  18.6 ( 11.2/  6.9/  4.6)     11  Regular Expression Help Needed
  17.4 (  8.9/  8.0/  4.3)     10  Is Windows running

These threads accounted for 40.0% of the total volume.

Top 18 Threads by OCR (minimum of ten posts)
============================================

         (kb)    (kb)
OCR      orig /  body  Posts  Subject
-----  --------------  -----  -------

0.663  (  8.4/  12.7)     11  Comparing a string to filenames in a directory
0.658  (  4.6/   6.9)     11  Regular Expression Help Needed
0.656  ( 13.7/  20.9)     24  Environment question
0.629  (  3.4/   5.4)     10  Reading html into a source file
0.613  ( 18.6/  30.3)     27  Why references??
0.603  ( 40.2/  66.7)     62  Why is Perl losing ground?
0.595  ( 12.4/  20.8)     26  Perl usage these days?
0.564  ( 10.1/  17.9)     12  Replacing unicode characters
0.553  (  3.2/   5.8)     13  Encrypt and Decrypt for text ...
0.540  (  4.3/   8.0)     10  Is Windows running
0.534  (  5.2/   9.8)     14  if for...
0.520  (  3.1/   5.9)     10  hidden perl
0.508  (  4.0/   7.8)     12  glob in perl
0.493  ( 26.7/  54.2)     23  RFC: utils.pm
0.473  (  4.3/   9.1)     10  configure problem
0.463  (  9.7/  21.0)     10  Anyone using Berkley XML DB w/Perl (of course)...?
0.397  (  3.5/   8.8)     13  Using Tie::IxHash order a hash reference
0.396  (  3.4/   8.5)     15  perl/korn scirpt for appending first variable to end of line.

Bottom 18 Threads by OCR (minimum of ten posts)
===============================================

         (kb)    (kb)
OCR      orig /  body  Posts  Subject
-----  --------------  -----  -------

0.663  (  8.4 / 12.7)     11  Comparing a string to filenames in a directory
0.658  (  4.6 /  6.9)     11  Regular Expression Help Needed
0.656  ( 13.7 / 20.9)     24  Environment question
0.629  (  3.4 /  5.4)     10  Reading html into a source file
0.613  ( 18.6 / 30.3)     27  Why references??
0.603  ( 40.2 / 66.7)     62  Why is Perl losing ground?
0.595  ( 12.4 / 20.8)     26  Perl usage these days?
0.564  ( 10.1 / 17.9)     12  Replacing unicode characters
0.553  (  3.2 /  5.8)     13  Encrypt and Decrypt for text ...
0.540  (  4.3 /  8.0)     10  Is Windows running
0.534  (  5.2 /  9.8)     14  if for...
0.520  (  3.1 /  5.9)     10  hidden perl
0.508  (  4.0 /  7.8)     12  glob in perl
0.493  ( 26.7 / 54.2)     23  RFC: utils.pm
0.473  (  4.3 /  9.1)     10  configure problem
0.463  (  9.7 / 21.0)     10  Anyone using Berkley XML DB w/Perl (of course)...?
0.397  (  3.5 /  8.8)     13  Using Tie::IxHash order a hash reference
0.396  (  3.4 /  8.5)     15  perl/korn scirpt for appending first variable to end of line.

18 threads (10%) had at least ten posts.

Top 18 Targets for Crossposts
=============================

Articles  Newsgroup
--------  ---------

      15  comp.unix.solaris
      12  comp.lang.perl.modules
      10  comp.programming
      10  comp.unix.questions
      10  alt.php
      10  comp.lang.tcl
       8  comp.infosystems.www.authoring.html
       4  alt.html
       4  comp.sys.ibm.as400.misc
       3  comp.lang.c
       3  comp.lang.c++
       3  comp.lang.perl
       3  comp.lang.ada
       3  comp.lang.java.advocacy
       1  comp.answers
       1  news.answers
       1  alt.perl
       1  comp.lang.perl.moderated

Top 20 Crossposters
===================

Articles  Address
--------  -------

      16  Deja User <dejauser@safe-mail.net>
       8  Bruce Hartweg <bruce-news@hartweg.us>
       8  Gunnar Hjalmarsson <noreply@gunnar.cc>
       7  Barry Margolin <barmar@alum.mit.edu>
       5  Beardy <beardy@beardy.net>
       4  Michael Kelly <kelly@cs.dal.ca>
       4  Jack Klein <jackklein@spamcop.net>
       4  "Jean-Pierre Rosen" <rosen@adalog.fr>
       4  Programmer Dude <Chris@Sonnack.com>
       4  "John W. Krahn" <krahnj@acm.org>
       3  Ben Morrow <usenet@morrow.me.uk>
       3  jwillmore@adelphia.net
       2  funtoosh <a_gilotra@yahoo.com>
       2  thumb_42@yahoo.com
       2  Michel Rodriguez <mirod@xmltwig.com>
       2  Robert James Kaes <rjkaes@flarenet.com>
       2  Mark J Fenbers <Mark.Fenbers@noaa.gov>
       2  David O in Seattle <me@privacy.net>
       2  Tony Walton <tony.walton@s_u_n.com>
       2  Joe Smith <Joe.Smith@inwap.com>


------------------------------

Date: 16 Feb 2004 13:24:58 GMT
From: anno4000@lublin.zrz.tu-berlin.de (Anno Siegel)
Subject: Re: Trent Curry, wsanford@wallysanford.com, and falsely using existing email addresses
Message-Id: <c0qgba$for$1@mamenchi.zrz.TU-Berlin.DE>

Wally Sanford <wally@wallysanford.com> wrote in comp.lang.perl.misc:

> I see that someone has been posting to this and other groups since
> late 2003 using a false email address, one that happens to belong to
> me. I see that this person frequently engages in behaviors such as
> flaming, netcopping, and other generally obnoxious behavior.

Yes, we know the guy, by the name "Trent Curry" and perhaps a half or
full dozen others he(?) has been using.  His specialty is starting a
flame war, then posting supportive followups to himself under one
or more assumed identities.

> The problem I have with this is that I don't know how someone can
> distinguish my (hopefully polite and netiquitte-compliant) posts from
> this malefactor's.

If that is your only concern you don't have much to worry.  The guy has
tried his shenanigans three or four times on clpm, but is intellectually
unable to pull them through.  He soon forgets who he is posting as, which
leads to blunders like saying "I" when he ought to say "he" and similar
giveaways.

He is also unable to spell the word "just": one of three times it comes
out as "jsut".  Since "just" is indispensable in expressing his exasperation
this happens in about every post of his.  In other words, he is found
out before he has really started.

[...]

> I am expert in neither Perl nor Usenet, but I wonder if some of you
> who are might advise me how to contact this individual, and reason
> with him/her, or alternatively, if you know his/her physical address.

From his past behavior I wouldn't expect him to use your address again.
He seems to pick them up at random for use in one of his campaigns, which
lasts until the forgery has become absolutely undeniable.  The next
time he picks new ones.

Anno


------------------------------

Date: Mon, 16 Feb 2004 07:59:50 +0100
From: Gunnar Hjalmarsson <noreply@gunnar.cc>
Subject: Re: Trent Curry, wsanford@wallysanford.com, and falsely using existing email addresses
Message-Id: <c0pq75$1ab952$1@ID-184292.news.uni-berlin.de>

Wally Sanford wrote:
> I see that someone has been posting to this and other groups since 
> late 2003 using a false email address, one that happens to belong
> to me.

<snip>

> I realize that ... I may have no formal grounds to prevent such
> inappropriate use of my email address or domain,

I thought (and I surely hope) that abusing somebody's email address
that way is illegal.

The first thing I would do is trying to figure out which ISP he is
using to access Internet, and asking them to take actions. The 
"NNTP-Posting-Host:" header seems to contain that info.

-- 
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl



------------------------------

Date: 16 Feb 2004 11:48:24 GMT
From: Eric Bohlman <ebohlman@earthlink.net>
Subject: Re: Why is Perl losing ground?
Message-Id: <Xns94913B8228361ebohlmanomsdevcom@130.133.1.17>

Ben Morrow <usenet@morrow.me.uk> wrote in
news:c0j3tk$6gt$1@wisteria.csv.warwick.ac.uk: 

> OK, yes, there's one *hell* of a lot of really badly written Perl out
> there. Perl is a tool which treats the programmer as a responsible
> adult, capable of making sensible decisions about the compromise
> between readability and brevity, between getting the thing working now
> and keeping it working later. All too many programmers abuse that
> responsibility, but that doesn't make Perl a bad language, it makes
> tham bad programmers.

Yep.  Larry, AFAICT, didn't intend Perl to be an "undisciplined" language.  
He intended it to be a "bring your own discipline (BYOD)" language.  And I 
think that's because he realized that:

1) Despite all the religious wars over which style of programming 
discipline is the Right One, what really matters in the real world of 
writing correct, scalable and maintainable programs is that you pick one 
and stick to it.

2) (This one is probably going to raise some hackles) Many problems are not 
technical problems and therefore one should not attempt to solve them by 
technological means.  In particular, programming discipline or the lack 
thereof is a social problem, not a technical one, and therefore requires a 
social solution rather than a technical solution.  Trying to design a 
language to make it impossible to use bad programming technique is trying 
to solve a social problem by forcing people to change to fit the 
technology.  That seldom works in the Real World; when people don't "buy 
into" a particular methodology, they find ways of getting around it.  The 
solution is to get the buy-in.  (Note, though, that if the design of a 
language makes it difficult or impossible to apply good programming 
techniques, *that* is a technical problem which can only be solved by 
changing the tool.  I don't think that's applicable to Perl, though.)

Gerald Weinberg's classic _The Psychology of Computer Programming_ has much 
to say about such matters.


------------------------------

Date: 16 Feb 2004 11:59:17 GMT
From: Eric Bohlman <ebohlman@earthlink.net>
Subject: Re: Why is Perl losing ground?
Message-Id: <Xns94913D5AA6EE6ebohlmanomsdevcom@130.133.1.4>

Ben Morrow <usenet@morrow.me.uk> wrote in
news:c0otk7$gfa$2@wisteria.csv.warwick.ac.uk: 

> 
> use63net@yahoo.com (Unknown Poster) wrote:
>> The large software projects that will be developed through a complete
>> OO process can't be done 20% the way Bob likes it, 25% Ann likes it, 
>> 15% Raoul likes it, and 40% the only way the project manager knows
>> how to do it.
> 
> Right... so for such a project, you write a set of coding guidelines
> that standardises how it's going to be done. And everyone follows
> them. Where's the problem?

Exactly.  Their success doesn't depend on Jason, Michelle, Roberto and 
*their* project manager, all in some entirely different organization, doing 
things the same way they do it.  Yet trying to enforce programming 
discipline through language design has that effect.  TMTOWTDI doesn't imply 
that *all* WTDIs are equally good in every situation.  It simply recognizes 
that which WTDI is the best one will vary from situation to situation, and 
gives those who WBDI the freedom to choose it.


------------------------------

Date: Mon, 16 Feb 2004 12:11:04 +0100
From: Michel Rodriguez <mirod@xmltwig.com>
Subject: Re: XML::Parser Style => Object
Message-Id: <c0q35k$qqg$1@news-reader5.wanadoo.fr>

Bart Lateur wrote:

>>I was under the impression that XML::Parser was more-or-less
>>deprecated now, in favour of SAX... is that not correct?
> 
> That was what Matt Sergeant, maintainer of most of the XML modules,
> seems to be wanting, yes. I'm not sure I agree.

I think XML::Parser should be deprecated for "regular users". It is not 
actively supported any more, and if you want to learn a streaming 
interface, you might as well learn SAX (although SAX is lower-level than 
XML::Parser if you can believe it).

XML::Parser is still useful as a basic XML parsing layer for other 
modules though, that give a higher-level API: XML::Simple (can also work 
on top of a SAX parser), XML::Twig, XML::DOM (although I would not 
recommend using XML::DOM, but rather XML::LibXML), XML::PYX...

In the last 2/3 years the perl XML community has largely shifted from 
XML::Parser (and expat), to XML::LibXML (based on libxml2), which is a 
lot more powerful, giving you XML parsing but also DOM, XPath, 
XInclude... The only problems with XML::LibXML is that as the library 
and the module are both still work in progress, they have sometimes 
stability issues, you have to find the proper combination of 
libxml2/XML::LibXML that works and stick to it for a while 
(<pet_peeve>and that said, XML::Parser has had its problems too: the 
interface to the DTD events for example has changed quite a few times in 
incompatible ways</pet_peeve> ;--)

A plus of XML::Parser is also that it comes installed with Activestate 
Perl, which might make it easier for people who don't have a compiler on 
their system (but note that XML::SAX::PurePerl does not need to be 
compiled and can work in such an environment too).

-- 
Michel Rodriguez
Perl &amp; XML
http://www.xmltwig.com



------------------------------

Date: 6 Apr 2001 21:33:47 GMT (Last modified)
From: Perl-Users-Request@ruby.oce.orst.edu (Perl-Users-Digest Admin) 
Subject: Digest Administrivia (Last modified: 6 Apr 01)
Message-Id: <null>


Administrivia:

#The Perl-Users Digest is a retransmission of the USENET newsgroup
#comp.lang.perl.misc.  For subscription or unsubscription requests, send
#the single line:
#
#	subscribe perl-users
#or:
#	unsubscribe perl-users
#
#to almanac@ruby.oce.orst.edu.  

NOTE: due to the current flood of worm email banging on ruby, the smtp
server on ruby has been shut off until further notice. 

To submit articles to comp.lang.perl.announce, send your article to
clpa@perl.com.

#To request back copies (available for a week or so), send your request
#to almanac@ruby.oce.orst.edu with the command "send perl-users x.y",
#where x is the volume number and y is the issue number.

#For other requests pertaining to the digest, send mail to
#perl-users-request@ruby.oce.orst.edu. Do not waste your time or mine
#sending perl questions to the -request address, I don't have time to
#answer them even if I did know the answer.


------------------------------
End of Perl-Users Digest V10 Issue 6141
***************************************


home help back first fref pref prev next nref lref last post