|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.trees.AbstractTreebankLanguagePack
edu.stanford.nlp.trees.international.tuebadz.TueBaDZLanguagePack
public class TueBaDZLanguagePack
Language pack for the Tuebingen Treebank of Written German (TueBa-D/Z). http://www.sfs.nphil.uni-tuebingen.de/en_tuebadz.shtml This treebank is in utf-8.
Field Summary |
---|
Fields inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack |
---|
DEFAULT_ENCODING, DEFAULT_GF_CHAR, gfCharacter |
Constructor Summary | |
---|---|
TueBaDZLanguagePack()
Gives a handle to the TreebankLanguagePack |
|
TueBaDZLanguagePack(boolean leaveGF)
Make a new language pack with grammatical functions used based on the value of leaveGF |
|
TueBaDZLanguagePack(boolean useLimitedGF,
boolean leaveGF,
char gfChar)
Make a new language pack with grammatical functions used based on the value of leaveGF and marked with the character gfChar. |
|
TueBaDZLanguagePack(boolean leaveGF,
char gfChar)
Make a new language pack with grammatical functions used based on the value of leaveGF and marked with the character gfChar. |
Method Summary | |
---|---|
String |
basicCategory(String category)
Returns the basic syntactic category of a String. |
String |
getEncoding()
Return the input Charset encoding for the Treebank. |
HeadFinder |
headFinder()
The HeadFinder to use for your treebank. |
boolean |
isLeaveGF()
|
boolean |
isLimitedGF()
|
char[] |
labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be truncated to give the basic syntactic category of a label. |
static void |
main(String[] args)
Prints a few aspects of the TreebankLanguagePack, just for debugging. |
String[] |
punctuationTags()
Returns a String array of punctuation tags for this treebank/language. |
String[] |
punctuationWords()
Returns a String array of punctuation words for this treebank/language. |
String[] |
sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for this treebank/language. |
String[] |
sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for this treebank/language. |
void |
setLeaveGF(boolean leaveGF)
|
void |
setLimitedGF(boolean limitedGF)
|
String[] |
startSymbols()
Returns a String array of treebank start symbols. |
String |
stripGF(String category)
Returns the category for a String with everything following the gf character (which may be language specific) stripped. |
String |
treebankFileExtension()
Returns the extension of treebank files for this treebank. |
TreeReaderFactory |
treeReaderFactory()
Returns a TreeReaderFactory suitable for general purpose use with this language/treebank. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public TueBaDZLanguagePack()
public TueBaDZLanguagePack(boolean leaveGF)
public TueBaDZLanguagePack(boolean leaveGF, char gfChar)
public TueBaDZLanguagePack(boolean useLimitedGF, boolean leaveGF, char gfChar)
Method Detail |
---|
public char[] labelAnnotationIntroducingCharacters()
labelAnnotationIntroducingCharacters
in interface TreebankLanguagePack
labelAnnotationIntroducingCharacters
in class AbstractTreebankLanguagePack
public String[] punctuationTags()
AbstractTreebankLanguagePack
punctuationTags
in interface TreebankLanguagePack
punctuationTags
in class AbstractTreebankLanguagePack
public String[] punctuationWords()
AbstractTreebankLanguagePack
punctuationWords
in interface TreebankLanguagePack
punctuationWords
in class AbstractTreebankLanguagePack
public String[] sentenceFinalPunctuationTags()
AbstractTreebankLanguagePack
sentenceFinalPunctuationTags
in interface TreebankLanguagePack
sentenceFinalPunctuationTags
in class AbstractTreebankLanguagePack
public String[] startSymbols()
AbstractTreebankLanguagePack
startSymbols
in interface TreebankLanguagePack
startSymbols
in class AbstractTreebankLanguagePack
public String[] sentenceFinalPunctuationWords()
TreebankLanguagePack
public String treebankFileExtension()
TreebankLanguagePack
public String basicCategory(String category)
AbstractTreebankLanguagePack
labelAnnotationIntroducingCharacters()
.
However, there is also special case stuff to deal with
labelAnnotationIntroducingCharacters in category labels:
(i) if the first char is in this set, it's never truncated
(e.g., '-' or '=' as a token), and (ii) if it starts with
one of this set, a second instance of the same item from this set is
also excluded (to deal with '-LLB-', '-RCB-', etc.).
basicCategory
in interface TreebankLanguagePack
basicCategory
in class AbstractTreebankLanguagePack
category
- The whole String name of the label
public String stripGF(String category)
TreebankLanguagePack
stripGF
in interface TreebankLanguagePack
stripGF
in class AbstractTreebankLanguagePack
category
- The String name of the label (may previously have had basic category called on it)
public boolean isLeaveGF()
public void setLeaveGF(boolean leaveGF)
public String getEncoding()
Charset
class.
getEncoding
in interface TreebankLanguagePack
getEncoding
in class AbstractTreebankLanguagePack
public static void main(String[] args)
public boolean isLimitedGF()
public void setLimitedGF(boolean limitedGF)
public TreeReaderFactory treeReaderFactory()
AbstractTreebankLanguagePack
treeReaderFactory
in interface TreebankLanguagePack
treeReaderFactory
in class AbstractTreebankLanguagePack
public HeadFinder headFinder()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |