[Teammetrics-discuss] We have -- lists.debian.org!

Sukhbir Singh sukhbir.in at gmail.com
Fri Jul 8 18:13:28 UTC 2011


Hi,

We are now able to parse lists.debian.org successfully :-)

So we generate mbox archives after fetching the list archives from the
NNTP server and then parse those mbox archives, as we did in liststat.

Here is the result from debian-jr at lists.debian.org:

------------------------------------------------------------------------------+-------
 Ben Armstrong
       |   208
 Andreas Tille
       |   120
 Sam Hart
       |    30
 Andrew Sackville-West
       |    29
 Bill Kendrick
       |    23
 Rudy Godoy
       |    20
 Ron Johnson
       |    16
 Michelle Konzack
       |    15

Note that there are some lists that cannot be accessed via NNTP (very
few). In such a case, there will be an error in the log file. The log
file is the same used by liststat. There is no separation on the user
side -- the lower level details are hidden from the user completely,
the way we wanted it.

And Andreas had his name as "Tille, Andreas" at many places, which we
easily fixed by adding it in updatenames.py and all was well :-)
There were some weird errors in older messages, like wrong formatting
of dates which I have handled. Our spam filter is also working
perfectly. I intend to test other mailing lists tonight just to be
completely sure.

I am happy with the code and that is a good sign! I will post more
results when I am done testing.

-- 
Sukhbir



More information about the Teammetrics-discuss mailing list