[Teammetrics-discuss] Comparison between the old code and the new code.
Andreas Tille
andreas at an3as.eu
Thu Sep 8 19:18:05 UTC 2011
On Thu, Sep 08, 2011 at 11:59:43PM +0530, Sukhbir Singh wrote:
>
> Well, there should not be a bug because we are just fetching the
> messages form Gmane in a range() loop. The only possible explanation
> is that Gmane has deleted some messages due its own spam filter/
> implementation perhaps?
Yes, that's what I wanted to say: Gmane is just lacking some data (for
whatever reason) and your code has no chance to fetch it.
> But whether this results in significant change
> in numbers is to be investigated (I doubt it though).
>
> However, the database does seem to be populated with all the messages
> and `SELECT COUNT(*)` returns a number that matches the number of
> articles in Gmane. So why this is happening, I am not sure because the
> end result from the database *matches* up to the article count from
> Gmane. And I have verified this IIRC and I will do it again.
IMHO the way to verify is the following: Find a mailing list which
shows in a specific month a different number for a given author. Look
up the web archive on lists.debian.org and gmane to see where the mail
is missing. It seems like a good idea to investigate one of those lists
which do not have that much postings but show a certain difference.
> Heh, when will that happen is only a guess!
Sure. But I pinged yesterday - lets see how the next ping-cycle will
work.
> How do you recommend I test NNTPstat to find out where the problem is?
As I said: Just find a specific case by digging the archive manually.
Kind regards
Andreas.
--
http://fam-tille.de
More information about the Teammetrics-discuss
mailing list