Bug#941569: RFS: sentencepiece/0.1.83+dfsg-1 [ITP] -- Unsupervised text tokenizer for Neural Network-based text generation

NOKUBI Takatsugu knok at daionet.gr.jp
Thu Oct 3 07:41:57 BST 2019


On Thu, 03 Oct 2019 13:16:53 +0900,
Mo Zhou wrote:
> Your copyright file is not complete
> https://bitbucket.org/tsuchm/pkg-sentencepiece/src/master/debian/copyright

Thank you for your pointing out.

Fourtunately, the data directory is not essential. The software is
forcused to build your own tokenizer model.

> Besides, the packaging of tensorflow is stalled, as it's difficult
> to tame the 4.5 million lines of code without a usable build system.
> For a long time the users (including myself) have to (somewhat)
> depend on third party ecosystems until the day Google started to
> rethink about distribution integration (basically hopeless).

I agree. It seems too complex and quite fast to develop.

> Apart from the science team, you are welcome to join the deep learning
> team as well: https://salsa.debian.org/deeplearning-team
> (it's an informal team)

Ok, I sent a request.



More information about the debian-science-maintainers mailing list