Module senseval
source code
Read from the Senseval 2 Corpus.
SENSEVAL [http://www.senseval.org/] Evaluation exercises for Word
Sense Disambiguation. Organized by ACL-SIGLEX
[http://www.siglex.org/]
Prepared by Ted Pedersen <tpederse@umn.edu>, University of
Minnesota, http://www.d.umn.edu/~tpederse/data.html Distributed with
permission.
The NLTK version of the Senseval 2 files uses well-formed XML. Each
instance of the ambiguous words "hard", "interest",
"line", and "serve" is tagged with a sense
identifier, and supplied with context.
|
items = [ ' hard ' , ' interest ' , ' line ' , ' serve ' ]
|
raw(files=[ ' hard ' , ' interest ' , ' line ' , ' serve ' ] )
| source code
|
- Parameters:
files (string or tuple(string) ) - One or more Senseval files to be processed
- Returns: iterator over
tuple
|