[Teammetrics-discuss] Findings from NNTPStat and Web Archive Parser
Andreas Tille
andreas at an3as.eu
Mon Dec 12 22:55:36 UTC 2011
On Tue, Dec 13, 2011 at 12:05:21AM +0530, Sukhbir Singh wrote:
> Because we have only a few fields on the web archives, [From, Date,
> Subject, Message-ID], we will be missing a lot many fields that are
> found in the original mbox archives. As such, an exact replication
> won't be possible. I hope that is acceptable?
Sure. I mean we want to replicate those data wie have. Seems to be
quite similar to what we get after our stripping routine.
> A typical mbox for lists.d.o contains lots of fields, however we are
> restricted to the ones mentioned above.
... which are perfectly sufficient for the moment.
Kind regards
Andreas.
--
http://fam-tille.de
More information about the Teammetrics-discuss
mailing list