-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.txt
49 lines (30 loc) · 2.88 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool http://sourceforge.net/projects/aelius/
Author: Leonel F. de Alencar <[email protected]>
Copyright (C) 2010-2013 Leonel F. de Alencar
For license information, see LICENSE.txt
Aelius is an ongoing free software, open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.
Documentation: http://aelius.sourceforge.net/manual.html
Mailing Lists: see Aelius homepage
Contributing: If you would like to contribute to Aelius,
please see Aelius homepage
Donating: Have you found the toolkit helpful? Please support Aelius development
by donating to the project via PayPal, using the link on the Aelius homepage.
Redistributing: Aelius source code is distributed under the Apache 2.0 License.
Aelius documentation is distributed under the Creative Commons
Attribution-Noncommercial-No Derivative Works 3.0 United States license.
Aelius corpora and language models are provided under the terms
given in the README and/or LICENSE file
for these corpora and language models; all are redistributable, and available for non-commercial use.
Aelius may be freely redistributed, subject to the provisions of these licenses.
Citing: If you publish work that uses Aelius, please cite the following:
APA style
Alencar, L. F. de. (2013). Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool (Version 0.9.7) [Software]. Available from http://aelius.sourceforge.net.
ABNT style
Alencar, Leonel Figueiredo de. Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool, versão 0.9.7. Fortaleza: [s.n.], 2013. Disponível em: <http://aelius.sourceforge.net/>. Acesso em: 25 fev. 2013.
The first version of Aelius was presented at the following conference, whose proceedings will hopefully be published soon; a draft of the presentation is available on-line:
Alencar, Leonel Figueiredo de. Aelius: uma ferramenta para anotação automática de corpora usando o NLTK. ELC 2010, The 9th Brazilian Corpus Linguistics Meeting, Porto Alegre, Brazil, Rio Grande do Sul Catholic University (PUCRS), October 8 and 9, 2010.
http://corpuslg.org/gelc/elc2010.php?paged=2
http://corpuslg.org/gelc/media/blogs/elc2010/slides/Figueiredo_de_Alencar.pdf
Acknowledgments: my Computational Linguistics students at the Universidade Federal do Ceará; Marcel Caraciolo for some code improvement suggestions and feedback on the Aelius documentation.
This file is based on NTLK's README.txt
http://code.google.com/p/nltk/source/browse/trunk/nltk/README.txt