[Python-modules-commits] [pyuca] 03/07: Import pyuca_1.1.2.orig.tar.xz
Daniel Stender
stender at moszumanska.debian.org
Sat Jun 25 07:01:34 UTC 2016
This is an automated email from the git hooks/post-receive script.
stender pushed a commit to branch master
in repository pyuca.
commit d0340aae505ca0a8c81783c02522167dbe6b3615
Author: Daniel Stender <stender at debian.org>
Date: Sat Jun 25 08:52:20 2016 +0200
Import pyuca_1.1.2.orig.tar.xz
---
.gitignore | 1 +
.travis.yml | 1 +
CONTRIBUTING.md | 7 +++++++
README.md | 16 +++++++++++++---
paper.md | 26 ++++++++++++++++++++++++++
setup.cfg | 2 ++
setup.py | 3 ++-
7 files changed, 52 insertions(+), 4 deletions(-)
diff --git a/.gitignore b/.gitignore
index e123436..a0ddb06 100644
--- a/.gitignore
+++ b/.gitignore
@@ -5,3 +5,4 @@ dist/
.coverage
htmlcov/
.tox/
+build/
diff --git a/.travis.yml b/.travis.yml
index d337912..76de759 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -3,6 +3,7 @@ python:
- "2.7"
- "3.3"
- "3.4"
+ - "3.5"
install:
- pip install flake8
- pip install coveralls
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
new file mode 100644
index 0000000..d66b77a
--- /dev/null
+++ b/CONTRIBUTING.md
@@ -0,0 +1,7 @@
+Pull requests are very welcome although for sizable tasks it may be worth
+discussing via a GitHub issue before undertaking too much work.
+
+All tests, including flake8 should pass and code coverage should stay at 100%.
+
+If you have any problems, questions or suggestions, please file an issue on
+GitHub.
diff --git a/README.md b/README.md
index 6db7458..f7359d2 100644
--- a/README.md
+++ b/README.md
@@ -3,11 +3,12 @@
[](https://travis-ci.org/jtauber/pyuca)
[](https://coveralls.io/r/jtauber/pyuca?branch=master)

+[](https://zenodo.org/badge/latestdoi/3769/jtauber/pyuca)
This is a Python implementation of the
[Unicode Collation Algorithm (UCA)](http://unicode.org/reports/tr10/). It
-passes 100% of the UCA conformances tests for Unicode 6.3.0 with a
-variable-weighting setting of Non-ignorable.
+passes 100% of the UCA conformance tests for Unicode 5.2.0 (Python 2.7) and
+Unicode 6.3.0 (Python 3.3+) with a variable-weighting setting of Non-ignorable.
## What do you use it for?
@@ -38,7 +39,8 @@ Usage example:
from pyuca import Collator
c = Collator()
- sorted_words = sorted(words, key=c.sort_key)
+ assert sorted(["cafe", "caff", "café"]) == ["cafe", "caff", "café"]
+ assert sorted(["cafe", "caff", "café"], key=c.sort_key) == ["cafe", "café", "caff"]
``Collator`` can also take an optional filename for specifying a custom
collation element table.
@@ -48,3 +50,11 @@ collation element table.
Python code is made available under an MIT license (see `LICENSE`).
`allkeys.txt` is made available under the similar license defined in
`LICENSE-allkeys`.
+
+## Contacting the Developer
+
+If you have any problems, questions or suggestions, it's best to file an issue
+on GitHub although you can also contact me at jtauber at jtauber.com.
+
+For more of my work on linguistics and Ancient Greek, see
+<http://jktauber.com/>.
diff --git a/paper.md b/paper.md
new file mode 100644
index 0000000..3d9536b
--- /dev/null
+++ b/paper.md
@@ -0,0 +1,26 @@
+---
+title: 'pyuca: a Python implementation of the Unicode Collation Algorithm'
+authors:
+ - name: J. K. Tauber
+ orcid: 0000-0001-6534-8866
+date: 18 May 2016
+---
+
+# Summary
+
+Collation, the sorting of strings, is an important part of computational work
+in corpus linguistics and digital humanities. Lexicographical sorting, however,
+is rarely appropriate for languages other than English. The Unicode Consortium
+has developed the Unicode Collation Algorithm (UCA) to solve this problem.
+
+pyuca is a Python implementation of the Unicode Collation Algorithm suitable
+for researchers doing text processing in Python. It passes 100% of the UCA
+conformance tests for Unicode 5.2.0 (Python 2.7) and 6.3.0 (Python 3.3+) with a variable-weighting setting of Non-ignorable.
+
+pyuca includes the Default Unicode Collation Element Table (DUCET) which
+provides a default collation suitable for many of the world's scripts.
+
+# References
+
+The Unicode Consortium. Unicode Collation Algorithm (Unicode Technical Standard
+ #10) http://unicode.org/reports/tr10/
diff --git a/setup.cfg b/setup.cfg
new file mode 100644
index 0000000..2a9acf1
--- /dev/null
+++ b/setup.cfg
@@ -0,0 +1,2 @@
+[bdist_wheel]
+universal = 1
diff --git a/setup.py b/setup.py
index 37a0540..58a36c1 100644
--- a/setup.py
+++ b/setup.py
@@ -2,7 +2,7 @@ from setuptools import setup
setup(
name="pyuca",
- version="1.1",
+ version="1.1.2",
description="a Python implementation of the Unicode Collation Algorithm",
license="MIT",
url="http://github.com/jtauber/pyuca",
@@ -16,6 +16,7 @@ setup(
"Programming Language :: Python :: 2.7",
"Programming Language :: Python :: 3.3",
"Programming Language :: Python :: 3.4",
+ "Programming Language :: Python :: 3.5",
"Topic :: Text Processing",
],
)
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/python-modules/packages/pyuca.git
More information about the Python-modules-commits
mailing list