This class defines an interlinearized text, which consists of a
collection of Paragraph objects.
|
__init__(self,
file=None,
fm_line=' ref ' ,
fm_paragraph=' id ' ,
fm_morpheme=' m ' ,
fm_morpheme_gloss=' g ' ,
fm_word=' w ' )
Constructor for Text object. |
source code
|
|
|
get_lines(self)
Obtain a list of line objects (ignoring paragraph structure). |
source code
|
|
|
get_paragraphs(self)
Obtain a list of paragraph objects. |
source code
|
|
|
|
|
getLineFM(self)
Get field marker that identifies a new line. |
source code
|
|
|
setLineFM(self,
lineHeadFieldMarker)
Change default field marker that identifies new line. |
source code
|
|
|
getParagraphFM(self)
Get field marker that identifies a new paragraph. |
source code
|
|
|
setParagraphFM(self,
paragraphHeadFieldMarker)
Change default field marker that identifies new paragraph. |
source code
|
|
|
getWordFM(self)
Get field marker that identifies word tier. |
source code
|
|
|
setWordFM(self,
wordFieldMarker)
Change default field marker that identifies word tier. |
source code
|
|
|
getMorphemeFM(self)
Get field marker that identifies morpheme tier. |
source code
|
|
|
setMorphemeFM(self,
morphemeFieldMarker)
Change default field marker that identifies morpheme tier. |
source code
|
|
|
getMorphemeGlossFM(self)
Get field marker that identifies morpheme gloss tier. |
source code
|
|
|
setMorphemeGlossFM(self,
morphemeGlossFieldMarker)
Change default field marker that identifies morpheme gloss tier. |
source code
|
|
|
|
|
set_file(self,
file)
Change file path set upon initialization. |
source code
|
|
|
parse(self)
Parse specified Shoebox file into Text object. |
source code
|
|
Inherited from corpora.toolbox.StandardFormat :
close ,
fields ,
open ,
open_string ,
raw_fields
Inherited from object :
__delattr__ ,
__getattribute__ ,
__hash__ ,
__new__ ,
__reduce__ ,
__reduce_ex__ ,
__repr__ ,
__setattr__ ,
__str__
|