[15793] in Perl-Users-Digest
Perl-Users Digest, Issue: 3206 Volume: 9
daemon@ATHENA.MIT.EDU (Perl-Users Digest)
Tue May 30 11:10:34 2000
Date: Tue, 30 May 2000 08:10:11 -0700 (PDT)
From: Perl-Users Digest <Perl-Users-Request@ruby.OCE.ORST.EDU>
To: Perl-Users@ruby.OCE.ORST.EDU (Perl-Users Digest)
Message-Id: <959699411-v9-i3206@ruby.oce.orst.edu>
Content-Type: text
Perl-Users Digest Tue, 30 May 2000 Volume: 9 Number: 3206
Today's topics:
Re: oops Re: seeking method to encode email addresses i <nospam@devnull.com>
Re: oops Re: seeking method to encode email addresses i <godzilla@stomp.stomp.tokyo>
Re: Perl forking question (multiple times) <billy@arnis-bsl.com>
Re: Perl, system and Cron <elaine@chaos.wustl.edu>
Re: Perl, system and Cron <e8825393@stud2.tuwien.ac.at>
Re: Reading files into a list- faster? <aqumsieh@hyperchip.com>
Re: seeking method to encode email addresses in web pag <nospam@devnull.com>
Re: seeking method to encode email addresses in web pag <godzilla@stomp.stomp.tokyo>
Statistics for comp.lang.perl.misc <gbacon@cs.uah.edu>
which perl module for http perl script (Beetlejuice)
Digest Administrivia (Last modified: 16 Sep 99) (Perl-Users-Digest Admin)
----------------------------------------------------------------------
Date: 30 May 2000 14:06:58 GMT
From: The WebDragon <nospam@devnull.com>
Subject: Re: oops Re: seeking method to encode email addresses in web pageforms
Message-Id: <8h0hu2$kvb$0@216.155.32.164>
In article <393066CA.319D5EEE@stomp.stomp.tokyo>, "Godzilla!"
<godzilla@stomp.stomp.tokyo> wrote:
Apologies in advance folks, for stretching the linewrapping for this one post. It was too tedious to mark every line that wrapped too early in the
script enclosed.. :/
| Extracted from Web Dragon's recent
| series of articles.
|
| PERL 5 VERSION
| (NO PRINT AT ALL)
(no, because I don't dereference the HOHOL until later on in the output section)
| __________________
|
|
| my($gametype, $name, $rating, $mapcount);
| foreach my $filez (sort @fileslist) {
| my @filedata = ();
| my $inputfile = File::Spec->catfile( $inputDir, $filez);
| open(IN, "<$inputfile") or die("Can't open $inputfile : $!\n");
| while (<IN>) {
| s/\012//; #'scuse the silly linewrapping below
| if (/Name: +<a +href="http:[^"]+\?nalicity\//gi .. /\G([^"]+)"/g) {
| if (defined $1) {
| ($gametype, $name) = split /\//, $1;
| $name =~ s/\.[^.]*$//;
| };
| } elsif (/Rating: +\(1-10\) +([\d\.]+)/) {
| $rating = $1;
| $listing{$gametype}{$name} = $rating;
| ++$mapcount;
| last;
| }
| }
| close(IN);
| }
note that what I wind up with here contains more or less the following hash (larger or smaller depending on the number of inputfiles I need to scan
through.. this past week had some 40 reviews come in, for example), so assuming we had 9 map reviews this week, three from each gametype, we get:
%listing = (
'utdm' => {
'dm-test' => "3.5",
'dm-somethingelse' => "9.5",
'dm-jumkymap' => "2",
}
'utassault' => {
'as-disaster' => "8",
'as-thebase' => "6.5",
'as-spades' => "5",
}
'utdomination' => {
'dom-test' => "4",
'dom-nutrageous' => "7.5",
'dom-99bottles' => "9.5",
}
);
which is, aside from being quite pretty, also nicely elegant in complexity, allowing me to access and sort any level of data I need by simple
dereferencing. I've since changed this to ALSO include the name of the html file in the odd case that it is different from the name of the map. (see
below, and below script at bottom)
current format is: $listing{$gametype}{$mapname} = [$html_filename, $rating]
i.e.
%listing = (
$gametype =>{
$mapname => [ $html_filename, $rating ]
}
)
as you'll see in the script at the bottom, this solved a rather sticky problem that crept in.
| ========================================================
|
|
| PERL 4 VERSION
| (INCLUDES A PRETTY PRINT)
| _________________________
|
|
| $pretend_input = (same input as Web Dragon input)
| $pretend_input =~ s/<([^>]+)>//g;
| $pretend_input =~ s/\n\n|\n/¦/g;
| $pretend_input =~ s/¦ ¦|¦¦/¦/g;
| $pretend_input =~ s/:¦/:/g;
| $pretend_input =~ s/¦ N/¡ N/g;
| chop ($pretend_input);
| @Count_Games = split (/ ¡ /, $pretend_input);
| $count = ($#Count_Games + 1);
| print "
| There are $count distinct games included within
| these input data. Those $count games are: \n\n";
| foreach $element (@Count_Games)
| {
| local ($name, $author, $rating) = split (/¦/, $element);
| print " $name $author $rating \n";
| }
| exit;
With this script, however, the $rating variable also winds up with "(1-10)" in it, which is irrelevant to the rating score.
The current script has gotten quite a bit more sophisticated as I slowly learn and find ways to do things since starting with perl about a month ago.
here is the current script in its entirety.
=== begin reviewsweeps.pl ===
=== cut here ===
#!perl -w
#### Map Reviewlisting overview generator
#### by FuzzBuster@planetunreal.com, for the NaliCity Sunday Reviews (yes, that's me too :)
#### release version 0.8b
use strict;
#leave this on while testing, otherwise comment out as it slows things down
#use diagnostics -verbose;
# set up some compatibility and formatting ease.
use POSIX qw(strftime);
use File::Spec;
use CGI::Carp;
use CGI qw( :standard :html3 );
if ($CGI::VERSION < 2.66) {
confess ("You need to install CGI.pm version 2.66 or later to run this script.\n",
'The most recent version information and revision history is at http://stein.cshl.org/WWW/software/CGI/index.html#new')
};
use CGI::Pretty qw( :html3 );
my $input_dir = File::Spec->catfile(File::Spec->curdir(), 'reviews', '');
my $outfile = File::Spec->catfile(File::Spec->curdir(), 'maprev.html');
my $diagnostic = File::Spec->catfile(File::Spec->curdir(), 'diagnostics.txt');
# delete previous file so the parsing routine doesn't scan it, as it will create a new one from the other files anyway.
unlink $outfile;
unlink $diagnostic;
opendir(DIR, $input_dir) || die "can't opendir $input_dir: $!";
my @filestart = readdir(DIR);
closedir DIR;
open(DIAGNOSTIC, ">$diagnostic") or die("Can't open $diagnostic : $!");
print DIAGNOSTIC "Diagnostic output: \n";
print DIAGNOSTIC scalar(@filestart), " Files found - ";
print scalar(@filestart), " Files found - ";
#if the file doesn't end in *.htm*, skip and go to the next file
# let's get this outta the way first off. saves time by not scanning useless files.
my @fileslist = ();
foreach my $filetest (@filestart) {
if ($filetest =~ m|\.htm.?$|) {
push @fileslist, $filetest
}
}
print DIAGNOSTIC scalar(@fileslist), " are .htm(l) files - ";
print scalar(@fileslist), " are .htm(l) files - ";
# Now, loop through all the .htm files, using a special loop construct to exit the loop for that file,
# once the gametype, mapname, and rating are extracted, to save time.
# also caught a bug in the original script that would miss the fact that the review could be named
# dm6pack2v1.1.html and the map named dm6pack2.zip, which would cause the link to become broken.
# Next revision to include warnings if the filenames do NOT match, (even though we do the right thing)
# as the SQL database these eventually get stored in needs to have exact matches on the filenames
my %listing = ();
my($gametype, $name, $rating, $mapcount);
foreach my $filez (sort @fileslist) {
my @filedata = ();
my $inputfile = File::Spec->catfile( $input_dir, $filez);
open(IN, "<$inputfile") or die("Can't open $inputfile : $!\n");
while (<IN>) {
chomp;
s/\012//;
if (/Name: +<a +href="http:[^"]+\?nalicity\//gi .. /\G([^"]+)"/g) {
if (defined $1) {
($gametype, $name) = split /\//, $1;
$name =~ s/\.[^.]*$//;
};
} elsif (/Rating: +\(1-10\) +([\d\.]+)/) {
$rating = $1;
$listing{$gametype}{$name} = [$filez, $rating];
++$mapcount;
last; # OK we got what we needed, skip to end of file and open next or exit loop
}
}
close(IN);
}
print DIAGNOSTIC scalar(keys %listing), " gametypes were found. $mapcount Maps were rated.\n";
print scalar(keys %listing), " gametypes were found. $mapcount Maps were rated.\n";
print "See $diagnostic for details.\n";
#begin output section and create it as html, but as an external file rather than running this as a CGI script.
open(OUT, ">$outfile") or die("Cannot open $outfile, $!");
print OUT start_html();
print OUT h3({-align=>'center'}, " TIMESTAMP: ". strftime("%A %B %d %X %Y", gmtime). " Greenwich Mean Time ");
print OUT p({-align=>'center'}, b("$mapcount Unreal Tournament Reviews")),
p(' ');
# loop thru the HOHOL (Hash of Hash of Lists) appropriately, and format the resultant output
foreach my $gt (sort (keys %listing)) {
print DIAGNOSTIC "Gametype = $gt\n"; #diagnostics
print OUT p( b( uc($gt) ) ) . "\n<p>";
foreach (sort (keys %{$listing{$gt}})) {
my $uc_review = $_;
# 'fancy' substr requires perl versions later than 5.004, and makes filenames look nicer with uppercase.
substr $uc_review, 0, index($uc_review, "-") + 2, uc(substr $uc_review, 0, index($uc_review, "-") + 2) if $] > 5.004 ;
#
print DIAGNOSTIC "Mapname = $uc_review -- Filename = $listing{$gt}{$_}[0] -- Rating = $listing{$gt}{$_}[1]\n"; #diagnostics
print OUT "$uc_review " . '(' . '<a href="reviews/' . $listing{$gt}{$_}[0] . '" target="_new">' . $listing{$gt}{$_}[1] . '</a>), ' . "\n";
}
print DIAGNOSTIC "----\n"; # diagnostics
print OUT "</p>\n";
}; # end HOHOL loop
print OUT end_html();
close(OUT);
close(DIAGNOSTIC);
=== cut here ===
As you can see, I've learned a lot in only a month. :) (hearty thank-you's to all of you here in c.l.p.m. -- I'd never have gotten this far, this
fast, without help.)
Again, sorry for the stretched linewrapping. If anyone has a problem with it, I'll be happy to post the code from this script on my website if you
want to see it. We now return you to your regularly scheduled newsgroup and linewrapping.
--
send mail to mactech (at) webdragon (dot) net instead of the above address.
this is to prevent spamming. e-mail reply-to's have been altered
to prevent scan software from extracting my address for the purpose
of spamming me, which I hate with a passion bordering on obsession.
------------------------------
Date: Tue, 30 May 2000 07:36:15 -0700
From: "Godzilla!" <godzilla@stomp.stomp.tokyo>
Subject: Re: oops Re: seeking method to encode email addresses in web pageforms
Message-Id: <3933D1DF.9B6AC663@stomp.stomp.tokyo>
The WebDragon wrote:
> In article <393066CA.319D5EEE@stomp.stomp.tokyo>, "Godzilla!"
> <godzilla@stomp.stomp.tokyo> wrote:
> | Extracted from Web Dragon's recent
> | series of articles.
> |
> | PERL 5 VERSION
> | (NO PRINT AT ALL)
> (no, because I don't dereference the HOHOL until later on
> in the output section)
Precisely. This is why my Perl 4 code does four
times as much with one-fourth the amount of coding
as a Perl 5 version.
Godzilla!
------------------------------
Date: Tue, 30 May 2000 14:05:46 GMT
From: Ilja Tabachnik <billy@arnis-bsl.com>
Subject: Re: Perl forking question (multiple times)
Message-Id: <8h0hrb$98u$1@nnrp1.deja.com>
In article <39332596.C60A305D@nwu.edu>,
j-hernandez2@nwu.edu wrote:
> Hello,
>
> Right now my program is forking once, so that it runs 2 processes at the
> same time. I was wondering if I could then have those two processes fork
> themselves so that then there are 4 processes running (2^2) ? Is this
> possible?
Why not ?
> If so, is it wise, would it be too much server load for one
> task?
It is task/OS/hardware specific.
> Also, I am calling $SIG{CHLD} = 'IGNORE';
> at the top of my scipt as well as right before I fork. It's all
> inherited right, so I don't have to call it again when I fork a second
> time (within the first fork), right?
>
AFAIK setting $SIG{...} to IGNORE is inherited by child processes.
BTW, if you use it to avoid zombies, you may want to read 'perldoc perlipc'
(or http://www.cpan.org/doc/manual/html/pod/perlipc.html), section "Signals",
which contains examples on how to solve this problem.
Hope this helps.
Ilja.
Sent via Deja.com http://www.deja.com/
Before you buy.
------------------------------
Date: Tue, 30 May 2000 13:11:24 GMT
From: Elaine Ashton <elaine@chaos.wustl.edu>
Subject: Re: Perl, system and Cron
Message-Id: <B559363B.509D%elaine@chaos.wustl.edu>
in article 8h05lf$9p7$1@news.tuwien.ac.at, e8825393 at
e8825393@stud2.tuwien.ac.at quoth:
> thanx john - the problem is, that not I am opening a file but linux does on
> entering the quota-command to modify the quota.
> And the user-wrights-problem - if the script is started by cron from user
> root - does'nt it have the same rights as root ?
wrights? Orville and Wilbur or something else...? Look, if you want to
modify quotas you are going to have to run it out of roots crontab or have
it suid root. I would highly recommend reading some of the documentation for
cron and the perlvar pod. While I gather that your native language is not
English, you can certainly find Unix documentation in a language near if not
in your native language. In system V you might know that there is an
/etc/defualt/cron which even allows you to set the default path and other
behaviour....if you are root.
e.
------------------------------
Date: Tue, 30 May 2000 15:11:49 +0200
From: "e8825393" <e8825393@stud2.tuwien.ac.at>
Subject: Re: Perl, system and Cron
Message-Id: <8h0emm$c34$1@news.tuwien.ac.at>
i'm no guru, but what envirenment could be necessary besides "editor" and
"visual" to edit a temporarily opened file. I already created a /tmp
directory in my /root - Directory just if the cronjob cant access the real
/tmp/ .
Any suggestion warmly wellcome !
"Clayton Scott" <claytons@americasm01.nt.com> wrote in message
news:3933A733.F847625A@americasm01.nt.com...
> e8825393 wrote:
> >
> > thanx john - the problem is, that not I am opening a file but linux does
on
> > entering the quota-command to modify the quota.
> > And the user-wrights-problem - if the script is started by cron from
user
> > root - does'nt it have the same rights as root ?
>
> Yes, but it doesn't have the same Environment. If your script
> relies on any environment variables you'll have to set them from
> within your script.
>
> --
> Clayton Scott
------------------------------
Date: Tue, 30 May 2000 14:17:47 GMT
From: Ala Qumsieh <aqumsieh@hyperchip.com>
Subject: Re: Reading files into a list- faster?
Message-Id: <7a7lcc3ywi.fsf@Merlin.i-did-not-set--mail-host-address--so-shoot-me>
webqueen, queen of the web <webqueen@my-deja.com> writes:
> Is it more efficient to read a file into a list then operate on
> it, rather than reading it line-by-line?
It really depends on what you want to do. If the file is small, then I
don't see any significant problems with reading the whole file into an
array in one shot. When the file becomes large, though, it might have a
bigger impact on your script's memory consumption.
Also, if you want to process each line in the file, independent of other
lines, then it's best to read the file line by line, and do your
processing.
--Ala
------------------------------
Date: 30 May 2000 14:37:43 GMT
From: The WebDragon <nospam@devnull.com>
Subject: Re: seeking method to encode email addresses in web page forms
Message-Id: <8h0jnn$or4$0@216.155.32.164>
In article <392EB402.B812D4F1@stomp.stomp.tokyo>, "Godzilla!"
<godzilla@stomp.stomp.tokyo> wrote:
| The WebDragon wrote:
| >
| > "Godzilla!" <godzilla@stomp.stomp.tokyo> wrote:
|
| > [snip]
| > | TEST SCRIPT:
| > | ____________
| > |
| > [snip]
|
|
| > Fiddled with your script a bit, and have a few suggestions:
| > o don't use so much whitespace unless you *really* need to --
|
| These are my articles. I will format my articles
| in any fashion I choose, Adolf.
hmm.. let's see.. I make some helpful suggestions and you turn around
and insult me
| > I find this 'ajusted' script just as easy to read if not easier
|
| That's nice. Make all the adjustments you want.
Your disparaging tone does not reflect the nature or tone of my original
article at all. This is surprising.
| > o good placement of short comments makes short scripts
| > even shorter.. look below and think about it in the future. ;)
|
| These are my scripts, my efforts. I will write
| my scripts in any fashion I choose, Adolf.
more insults.
| > o always use #!perl -w switch
|
| I have no need for -w. I do not write errors
| into my code as a general rule and, I provide
| my own error checking codes, as recently
| exemplified in one of my articles.
no one deliberately writes errors into their code.. they HAPPEN. this
helps CATCH them.
| Training wheels are for sissified geeks.
well, I make no claims of being a 'sissified geek' however I'm happy to
ride with training wheels as it seems to be good programming practice,
and doesn't slow down the operation of the script *at all*.
I.e. I lose NOTHING by using them.
| > o always use strict; # good habits
|
| No, I will not do this, Adolf. I have
| no need for strict. I know how to write
| programs correctly.
More insults
| Training wheels are for sissified geeks.
This is starting to get personal now.
| > resultant 'adjusted' script:
|
| I will have to get back to you on your
| adjusted script once I have a fellow
| professor skilled in Egyptian Hieroglyphics
funny, I coded it all in ASCII. Did your system munge it into
hieroglyphics?
| translate it for me. For now, I will retain
| a copy of your script in my circular filing
| cabinet, right next to my desk.
yup the insults are definitely intentional.
| > note that I made good use of the 'format'
|
| yeah.. sure..
more disparaging tone
| > other than that, interesting trick with the sprintf...
| > I wasn't aware one could do that. :)
|
| Pay attention. You might just learn some
| really slick tricks from me, tricks nobody
| else seems to know, like how to write truly
| effective and imaginative programs which
| do things "experts" claim you cannot do.
Seems to me in my 35 years of experience that anyone that's so cocksure
of themselves that they think they never make any mistakes, is the one
most likely to do so, particularly when they least expect it.
if you don't back off on the personal insults, I suggest you get very
comfy with hearing from your uplink Sysadmin. I was nothing but helpful,
and I'm not going to reciprocate the insults; you ain't important enough
to push my buttons.
Have a nice day! :)
--
send mail to mactech (at) webdragon (dot) net instead of the above address.
this is to prevent spamming. e-mail reply-to's have been altered
to prevent scan software from extracting my address for the purpose
of spamming me, which I hate with a passion bordering on obsession.
------------------------------
Date: Tue, 30 May 2000 08:02:56 -0700
From: "Godzilla!" <godzilla@stomp.stomp.tokyo>
Subject: Re: seeking method to encode email addresses in web page forms
Message-Id: <3933D820.95E06CF1@stomp.stomp.tokyo>
The WebDragon wrote:
> hmm.. let's see.. I make some helpful suggestions and
> you turn around and insult me...
No, you made tactless unwarranted comments pertaining
to notions which are simply none of your business.
Your comments are exceptionally insulting. Use social
grace with me or I will blast you each time for your
intentional poorly veiled rudeness and insulting
commentary.
Play it straight with me and you won't have problems.
Play games and I will match you move-for-move and,
in the final end, call a checkmate along with taking
both your King and Queen, my old friend.
Practice social grace with me. I have dignity and
pride within an environment lacking these qualities.
Godzilla!
------------------------------
Date: Tue, 30 May 2000 13:39:14 GMT
From: Greg Bacon <gbacon@cs.uah.edu>
Subject: Statistics for comp.lang.perl.misc
Message-Id: <sj7h42kq5pj155@corp.supernews.com>
Following is a summary of articles spanning a 8 day period,
beginning at 22 May 2000 14:44:22 GMT and ending at
30 May 2000 13:52:45 GMT.
Notes
=====
- A line in the body of a post is considered to be original if it
does *not* match the regular expression /^\s{0,3}(?:>|:|\S+>|\+\+)/.
- All text after the last cut line (/^-- $/) in the body is
considered to be the author's signature.
- The scanner prefers the Reply-To: header over the From: header
in determining the "real" email address and name.
- Original Content Rating (OCR) is the ratio of the original content
volume to the total body volume.
- Find the News-Scan distribution on the CPAN!
<URL:http://www.perl.com/CPAN/modules/by-module/News/>
- Please send all comments to Greg Bacon <gbacon@cs.uah.edu>.
- Copyright (c) 2000 Greg Bacon.
Verbatim copying and redistribution is permitted without royalty;
alteration is not permitted. Redistribution and/or use for any
commercial purpose is prohibited.
Excluded Posters
================
perlfaq-suggestions\@(?:.*\.)?perl\.com
Totals
======
Posters: 589
Articles: 2114 (854 with cutlined signatures)
Threads: 494
Volume generated: 3686.9 kb
- headers: 1759.2 kb (34,091 lines)
- bodies: 1823.4 kb (62,076 lines)
- original: 1175.4 kb (43,799 lines)
- signatures: 102.2 kb (2,694 lines)
Original Content Rating: 0.645
Averages
========
Posts per poster: 3.6
median: 1 post
mode: 1 post - 323 posters
s: 9.1 posts
Posts per thread: 4.3
median: 3.0 posts
mode: 1 post - 112 threads
s: 5.5 posts
Message size: 1785.9 bytes
- header: 852.1 bytes (16.1 lines)
- body: 883.3 bytes (29.4 lines)
- original: 569.3 bytes (20.7 lines)
- signature: 49.5 bytes (1.3 lines)
Top 10 Posters by Number of Posts
=================================
(kb) (kb) (kb) (kb)
Posts Volume ( hdr/ body/ orig) Address
----- -------------------------- -------
113 186.7 (112.0/ 61.3/ 37.1) Tom Phoenix <rootbeer@redcat.com>
84 150.6 ( 59.2/ 81.8/ 42.2) Larry Rosler <lr@hpl.hp.com>
77 189.9 ( 70.3/119.5/ 94.2) "Godzilla!" <godzilla@stomp.stomp.tokyo>
53 87.6 ( 43.6/ 43.1/ 20.3) Jonathan Stowe <gellyfish@gellyfish.com>
51 75.0 ( 44.1/ 30.5/ 18.9) Bart Lateur <bart.lateur@skynet.be>
50 96.1 ( 42.3/ 45.9/ 26.3) Gwyn Judd <tjla@guvfybir.qlaqaf.bet>
43 92.1 ( 37.8/ 47.0/ 18.7) Dave Cross <dave@dave.org.uk>
42 66.3 ( 32.0/ 34.2/ 14.8) abigail@arena-i.com
35 74.5 ( 28.5/ 36.3/ 35.4) The WebDragon <nospam@devnull.com>
34 54.1 ( 24.5/ 29.3/ 17.6) Jeff Zucker <jeff@vpservices.com>
These posters accounted for 27.5% of all articles.
Top 10 Posters by Volume
========================
(kb) (kb) (kb) (kb)
Volume ( hdr/ body/ orig) Posts Address
-------------------------- ----- -------
189.9 ( 70.3/119.5/ 94.2) 77 "Godzilla!" <godzilla@stomp.stomp.tokyo>
186.7 (112.0/ 61.3/ 37.1) 113 Tom Phoenix <rootbeer@redcat.com>
150.6 ( 59.2/ 81.8/ 42.2) 84 Larry Rosler <lr@hpl.hp.com>
96.1 ( 42.3/ 45.9/ 26.3) 50 Gwyn Judd <tjla@guvfybir.qlaqaf.bet>
92.1 ( 37.8/ 47.0/ 18.7) 43 Dave Cross <dave@dave.org.uk>
87.6 ( 43.6/ 43.1/ 20.3) 53 Jonathan Stowe <gellyfish@gellyfish.com>
75.0 ( 44.1/ 30.5/ 18.9) 51 Bart Lateur <bart.lateur@skynet.be>
74.5 ( 28.5/ 36.3/ 35.4) 35 The WebDragon <nospam@devnull.com>
66.3 ( 32.0/ 34.2/ 14.8) 42 abigail@arena-i.com
54.1 ( 24.5/ 29.3/ 17.6) 34 Jeff Zucker <jeff@vpservices.com>
These posters accounted for 29.1% of the total volume.
Top 10 Posters by OCR (minimum of five posts)
==============================================
(kb) (kb)
OCR orig / body Posts Address
----- -------------- ----- -------
1.000 ( 2.8 / 2.8) 6 Guest <guest@yale.edu>
1.000 ( 1.1 / 1.1) 6 "Adolf" <adolftw@tcts1.seed.net.tw>
0.976 ( 35.4 / 36.3) 35 The WebDragon <nospam@devnull.com>
0.943 ( 1.4 / 1.5) 6 "A Pietro" <apietro@my-deja.com>
0.836 ( 20.5 / 24.5) 24 "Alan J. Flavell" <flavell@mail.cern.ch>
0.817 ( 1.3 / 1.5) 5 "Thomas Åhlen" <thomas2@dalnet.se>
0.788 ( 94.2 /119.5) 77 "Godzilla!" <godzilla@stomp.stomp.tokyo>
0.768 ( 3.7 / 4.9) 6 Mur <jboesNOjbSPAM@qtm.net.invalid>
0.739 ( 5.6 / 7.6) 8 Chris <csorensen@uptimeresources.net>
0.735 ( 2.5 / 3.4) 5 sergei_kucherov <sergei_kucherov@3com.com>
Bottom 10 Posters by OCR (minimum of five posts)
=================================================
(kb) (kb)
OCR orig / body Posts Address
----- -------------- ----- -------
0.426 ( 1.8 / 4.3) 5 Steve Leibel <stevel@coastside.net>
0.404 ( 3.8 / 9.5) 8 Bert IJff <bert@scanlaser.nl>
0.403 ( 1.8 / 4.4) 5 Tk Soh <r28629@email.sps.mot.com>
0.398 ( 18.7 / 47.0) 43 Dave Cross <dave@dave.org.uk>
0.382 ( 2.0 / 5.2) 5 zak@mighty.co.za
0.362 ( 0.9 / 2.4) 7 =?iso-8859-1?Q?Thorbj=F8rn?= Ravn Andersen <thunderbear@bigfoot.com>
0.358 ( 4.8 / 13.4) 22 Brad Baxter <bmb@dataserv.libs.uga.edu>
0.328 ( 3.5 / 10.5) 9 Sue Spence <sue@pennine.com>
0.317 ( 1.3 / 4.2) 11 Tom Briles <sariq@texas.net>
0.215 ( 2.0 / 9.1) 10 Carlos Ramirez <cramirez@gte.net>
79 posters (13%) had at least five posts.
Top 10 Threads by Number of Posts
=================================
Posts Subject
----- -------
52 Waxing Philosophical
49 Basic scripting question
46 seeking method to encode email addresses in web page forms
33 Can't Knit Two Files Together
32 file locking
26 join " ", do {$x++}, do {$x++}, do {$x++};
23 Perl unusable as a programming language
22 using param() and CGI.pm to store a persistent value
20 HALL OF SHAME :-)
17 Finding IP addresses+
These threads accounted for 15.1% of all articles.
Top 10 Threads by Volume
========================
(kb) (kb) (kb) (kb)
Volume ( hdr/ body/ orig) Posts Subject
-------------------------- ----- -------
103.7 ( 48.2/ 53.4/ 32.4) 52 Waxing Philosophical
99.4 ( 48.3/ 48.4/ 30.4) 46 seeking method to encode email addresses in web page forms
90.2 ( 47.5/ 37.7/ 24.9) 49 Basic scripting question
87.5 ( 27.8/ 57.7/ 40.2) 33 Can't Knit Two Files Together
58.5 ( 31.5/ 25.6/ 13.5) 32 file locking
52.2 ( 17.6/ 30.8/ 24.5) 22 using param() and CGI.pm to store a persistent value
51.9 ( 26.2/ 23.5/ 14.6) 23 Perl unusable as a programming language
41.0 ( 22.0/ 17.8/ 10.5) 26 join " ", do {$x++}, do {$x++}, do {$x++};
39.6 ( 19.9/ 19.1/ 11.3) 20 HALL OF SHAME :-)
39.6 ( 5.9/ 33.4/ 28.1) 7 HPUX and Perl
These threads accounted for 18.0% of the total volume.
Top 10 Threads by OCR (minimum of five posts)
==============================================
(kb) (kb)
OCR orig / body Posts Subject
----- -------------- ----- -------
0.921 ( 2.1/ 2.3) 6 Help with emailing files
0.841 ( 28.1/ 33.4) 7 HPUX and Perl
0.826 ( 8.8/ 10.7) 6 Chop
0.822 ( 4.8/ 5.9) 7 Visibility of package lexicals in debugger
0.816 ( 4.6/ 5.6) 6 Poll - IP voting timeouts......?
0.806 ( 11.1/ 13.8) 12 updated : Re: regexes *sigh* damn I hate these things
0.800 ( 3.5/ 4.4) 9 help
0.795 ( 24.5/ 30.8) 22 using param() and CGI.pm to store a persistent value
0.784 ( 8.0/ 10.3) 5 SDBM_File - %HASH not returning all records ?
0.778 ( 4.1/ 5.2) 5 Is anything planned re: Perl error reporting?
Bottom 10 Threads by OCR (minimum of five posts)
=================================================
(kb) (kb)
OCR orig / body Posts Subject
----- -------------- ----- -------
0.450 ( 4.7 / 10.4) 10 "or" vs. "||" - operator precedence question
0.450 ( 5.9 / 13.1) 13 Passing an array to a regexp
0.449 ( 1.3 / 2.9) 5 can't run the perl CGI script.
0.434 ( 1.9 / 4.4) 8 Building a hash array
0.427 ( 2.3 / 5.4) 7 Couple of Newbie RegExp questions
0.424 ( 1.9 / 4.4) 5 $ENV{'REMOTE_USER'};
0.391 ( 2.5 / 6.3) 10 How do I extract 2nd and 3rd characters from a string
0.357 ( 1.2 / 3.3) 5 Forum for 'how to do it' questions?
0.354 ( 1.0 / 2.9) 5 pl2exe missing from ActivePerl
0.345 ( 1.3 / 3.9) 5 ([^|]+) vs (.*?)
141 threads (28%) had at least five posts.
Top 10 Targets for Crossposts
=============================
Articles Newsgroup
-------- ---------
34 comp.lang.perl.moderated
23 comp.lang.perl
23 comp.lang.perl.modules
15 comp.unix.shell
14 comp.unix.programmer
14 comp.sys.sun.admin
14 comp.unix.admin
10 alt.perl
8 comp.sys.hp.hpux
7 comp.infosystems.www.servers.unix
Top 10 Crossposters
===================
Articles Address
-------- -------
11 Larry Rosler <lr@hpl.hp.com>
9 abigail@arena-i.com
8 Dave Vandervies <dj3vande@student.math.uwaterloo.ca>
8 Tom Phoenix <rootbeer@redcat.com>
6 "debra karr" <dkarr1@san.rr.com>
5 Simon Cozens <simon@brecon.co.uk>
4 Denis Haskin <dwhaskin@earthlink.net>
4 bmetcalf@baynetworks.com
4 Michael Sternberg <sternberg@phys.uni-paderborn.de>
4 Dan Sugalski <dan@tuatha.sidhe.org>
------------------------------
Date: Tue, 30 May 2000 14:47:59 GMT
From: replytousenet@usenet.com (Beetlejuice)
Subject: which perl module for http perl script
Message-Id: <3933d3ed.2795635@news.state.mn.us>
I am writing a perl script to access a given web page, then
check any link found on the page to insure that it still is up --
i.e. that if you access that link that it returns the given page
with no errors.
I have downloaded a ton of www related modules, but would
appreciate if someone could narrow down which ones
would be useful.
thanks!
------------------------------
Date: 16 Sep 99 21:33:47 GMT (Last modified)
From: Perl-Users-Request@ruby.oce.orst.edu (Perl-Users-Digest Admin)
Subject: Digest Administrivia (Last modified: 16 Sep 99)
Message-Id: <null>
Administrivia:
The Perl-Users Digest is a retransmission of the USENET newsgroup
comp.lang.perl.misc. For subscription or unsubscription requests, send
the single line:
subscribe perl-users
or:
unsubscribe perl-users
to almanac@ruby.oce.orst.edu.
| NOTE: The mail to news gateway, and thus the ability to submit articles
| through this service to the newsgroup, has been removed. I do not have
| time to individually vet each article to make sure that someone isn't
| abusing the service, and I no longer have any desire to waste my time
| dealing with the campus admins when some fool complains to them about an
| article that has come through the gateway instead of complaining
| to the source.
To submit articles to comp.lang.perl.announce, send your article to
clpa@perl.com.
To request back copies (available for a week or so), send your request
to almanac@ruby.oce.orst.edu with the command "send perl-users x.y",
where x is the volume number and y is the issue number.
For other requests pertaining to the digest, send mail to
perl-users-request@ruby.oce.orst.edu. Do not waste your time or mine
sending perl questions to the -request address, I don't have time to
answer them even if I did know the answer.
------------------------------
End of Perl-Users Digest V9 Issue 3206
**************************************