Warning: file_exists(): open_basedir restriction in effect. File(/srv/http/vhosts/aur.archlinux.org/public/web/locale//en/LC_MESSAGES/aurweb.mo) is not within the allowed path(s): (/srv/http/vhosts/aur-dev.archlinux.org/:/etc/aurweb/) in /srv/http/vhosts/aur-dev.archlinux.org/public/web/lib/streams.php on line 90
AUR (en) - ucto-git

Notice: Undefined variable: name in /srv/http/vhosts/aur-dev.archlinux.org/public/web/lib/pkgfuncs.inc.php on line 248

Package Details: ucto-git 1-3

Git Clone URL: https://aur-dev.archlinux.org/ucto-git.git (read-only)
Package Base: ucto-git
Description: An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline.
Upstream URL: https://languagemachines.github.io/ucto
Keywords: nlp tokenizer
Licenses: GPL
Conflicts: ucto
Provides: ucto
Submitter: proycon
Maintainer: proycon
Last Packager: proycon
Votes: 2
Popularity: 0.013932
First Submitted: 2015-05-22 17:48
Last Updated: 2016-07-11 17:16

Required by (7)

Sources (1)