DEFAULT_REQUIREMENTS, STANFORD_CDC_TOKENIZE, STANFORD_CLEAN_XML, STANFORD_COLUMN_DATA_CLASSIFIER, STANFORD_COREF, STANFORD_COREF_MENTION, STANFORD_DEPENDENCIES, STANFORD_DETERMINISTIC_COREF, STANFORD_DOCDATE, STANFORD_ENTITY_MENTIONS, STANFORD_GENDER, STANFORD_KBP, STANFORD_LEMMA, STANFORD_LINK, STANFORD_MWT, STANFORD_NATLOG, STANFORD_NER, STANFORD_OPENIE, STANFORD_PARSE, STANFORD_POS, STANFORD_QUOTE, STANFORD_QUOTE_ATTRIBUTION, STANFORD_REGEXNER, STANFORD_RELATION, STANFORD_SENTIMENT, STANFORD_SSPLIT, STANFORD_TOKENIZE, STANFORD_TOKENSREGEX, STANFORD_TRUECASE, STANFORD_UD_FEATURES
Constructor and Description |
---|
MWTAnnotator(java.lang.String name,
java.util.Properties props) |
Modifier and Type | Method and Description |
---|---|
void |
annotate(Annotation annotation)
The annotation process runs in two steps.
|
void |
loadMultiWordTokenMappings(java.util.HashMap<java.lang.String,java.util.List<java.lang.String>> dictionary,
java.lang.String mapFilePath) |
java.util.Set<java.lang.Class<? extends CoreAnnotation>> |
requirementsSatisfied()
Returns a set of requirements for which tasks this annotator can
provide.
|
java.util.Set<java.lang.Class<? extends CoreAnnotation>> |
requires()
Returns the set of tasks which this annotator requires in order
to perform.
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
exactRequirements, unmount
public MWTAnnotator(java.lang.String name, java.util.Properties props)
public void loadMultiWordTokenMappings(java.util.HashMap<java.lang.String,java.util.List<java.lang.String>> dictionary, java.lang.String mapFilePath)
public void annotate(Annotation annotation)
1.) Split all tokens that are in the multiWordTokenMapping dictionary 2.) Run the part-of-speech model, split all words according to statisticalMappingFile
public java.util.Set<java.lang.Class<? extends CoreAnnotation>> requires()
Annotator
public java.util.Set<java.lang.Class<? extends CoreAnnotation>> requirementsSatisfied()
Annotator
requirementsSatisfied
in interface Annotator