Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: textcat | Distribution: Fedora Project |
Version: 1.10 | Vendor: Fedora Project |
Release: 20.fc41 | Build date: Sun Jul 21 11:57:09 2024 |
Group: Unspecified | Build host: buildvm-s390x-05.s390.fedoraproject.org |
Size: 359561 | Source RPM: textcat-1.10-20.fc41.src.rpm |
Packager: Fedora Project | |
Url: http://www.let.rug.nl/~vannoord/TextCat/ | |
Summary: Written language identification |
TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language).
LGPLv2+
* Sat Jul 20 2024 Fedora Release Engineering <releng@fedoraproject.org> - 1.10-20 - Rebuilt for https://fedoraproject.org/wiki/Fedora_41_Mass_Rebuild * Sat Jan 27 2024 Fedora Release Engineering <releng@fedoraproject.org> - 1.10-19 - Rebuilt for https://fedoraproject.org/wiki/Fedora_40_Mass_Rebuild * Sat Jul 22 2023 Fedora Release Engineering <releng@fedoraproject.org> - 1.10-18 - Rebuilt for https://fedoraproject.org/wiki/Fedora_39_Mass_Rebuild * Sat Jan 21 2023 Fedora Release Engineering <releng@fedoraproject.org> - 1.10-17 - Rebuilt for https://fedoraproject.org/wiki/Fedora_38_Mass_Rebuild * Sat Jul 23 2022 Fedora Release Engineering <releng@fedoraproject.org> - 1.10-16 - Rebuilt for https://fedoraproject.org/wiki/Fedora_37_Mass_Rebuild
/usr/bin/textcat /usr/share/doc/textcat /usr/share/doc/textcat/CHANGES /usr/share/doc/textcat/COPYING /usr/share/doc/textcat/Copyright /usr/share/doc/textcat/README /usr/share/doc/textcat/textcat.pdf /usr/share/textcat /usr/share/textcat/lm /usr/share/textcat/lm/afrikaans.lm /usr/share/textcat/lm/albanian.lm /usr/share/textcat/lm/amharic-utf.lm /usr/share/textcat/lm/arabic-iso8859_6.lm /usr/share/textcat/lm/arabic-windows1256.lm /usr/share/textcat/lm/armenian.lm /usr/share/textcat/lm/basque.lm /usr/share/textcat/lm/belarus-windows1251.lm /usr/share/textcat/lm/bosnian.lm /usr/share/textcat/lm/breton.lm /usr/share/textcat/lm/bulgarian-iso8859_5.lm /usr/share/textcat/lm/catalan.lm /usr/share/textcat/lm/chinese-big5.lm /usr/share/textcat/lm/chinese-gb2312.lm /usr/share/textcat/lm/croatian-ascii.lm /usr/share/textcat/lm/czech-iso8859_2.lm /usr/share/textcat/lm/danish.lm /usr/share/textcat/lm/dutch.lm /usr/share/textcat/lm/english.lm /usr/share/textcat/lm/esperanto.lm /usr/share/textcat/lm/estonian.lm /usr/share/textcat/lm/finnish.lm /usr/share/textcat/lm/french.lm /usr/share/textcat/lm/frisian.lm /usr/share/textcat/lm/georgian.lm /usr/share/textcat/lm/german.lm /usr/share/textcat/lm/greek-iso8859-7.lm /usr/share/textcat/lm/hebrew-iso8859_8.lm /usr/share/textcat/lm/hindi.lm /usr/share/textcat/lm/hungarian.lm /usr/share/textcat/lm/icelandic.lm /usr/share/textcat/lm/indonesian.lm /usr/share/textcat/lm/irish.lm /usr/share/textcat/lm/italian.lm /usr/share/textcat/lm/japanese-euc_jp.lm /usr/share/textcat/lm/japanese-shift_jis.lm /usr/share/textcat/lm/korean.lm /usr/share/textcat/lm/latin.lm /usr/share/textcat/lm/latvian.lm /usr/share/textcat/lm/lithuanian.lm /usr/share/textcat/lm/malay.lm /usr/share/textcat/lm/manx.lm /usr/share/textcat/lm/marathi.lm /usr/share/textcat/lm/mingo.lm /usr/share/textcat/lm/nepali.lm /usr/share/textcat/lm/norwegian.lm /usr/share/textcat/lm/persian.lm /usr/share/textcat/lm/polish.lm /usr/share/textcat/lm/portuguese.lm /usr/share/textcat/lm/quechua.lm /usr/share/textcat/lm/romanian.lm /usr/share/textcat/lm/rumantsch.lm /usr/share/textcat/lm/russian-iso8859_5.lm /usr/share/textcat/lm/russian-koi8_r.lm /usr/share/textcat/lm/russian-windows1251.lm /usr/share/textcat/lm/sanskrit.lm /usr/share/textcat/lm/scots.lm /usr/share/textcat/lm/scots_gaelic.lm /usr/share/textcat/lm/serbian-ascii.lm /usr/share/textcat/lm/slovak-ascii.lm /usr/share/textcat/lm/slovak-windows1250.lm /usr/share/textcat/lm/slovenian-ascii.lm /usr/share/textcat/lm/slovenian-iso8859_2.lm /usr/share/textcat/lm/spanish.lm /usr/share/textcat/lm/swahili.lm /usr/share/textcat/lm/swedish.lm /usr/share/textcat/lm/tagalog.lm /usr/share/textcat/lm/tamil.lm /usr/share/textcat/lm/thai.lm /usr/share/textcat/lm/turkish.lm /usr/share/textcat/lm/ukrainian-koi8_u.lm /usr/share/textcat/lm/vietnamese.lm /usr/share/textcat/lm/welsh.lm /usr/share/textcat/lm/yiddish-utf.lm
Generated by rpm2html 1.8.1
Fabrice Bellet, Mon Nov 18 00:48:25 2024