[Teammetrics-discuss] Findings from NNTPStat and Web Archive Parser

Andreas Tille andreas at an3as.eu
Mon Dec 12 22:55:36 UTC 2011


On Tue, Dec 13, 2011 at 12:05:21AM +0530, Sukhbir Singh wrote:
> Because we have only a few fields on the web archives, [From, Date,
> Subject, Message-ID], we will be missing a lot many fields that are
> found in the original mbox archives. As such, an exact replication
> won't be possible. I hope that is acceptable?

Sure.  I mean we want to replicate those data wie have.  Seems to be
quite similar to what we get after our stripping routine.
 
> A typical mbox for lists.d.o contains lots of fields, however we are
> restricted to the ones mentioned above.

... which are perfectly sufficient for the moment.

Kind regards

        Andreas. 

-- 
http://fam-tille.de



More information about the Teammetrics-discuss mailing list