Tanl Linguistic Pipeline |
Public Member Functions | |
PosTagger (const char *modelFiles) | |
Argument. | |
Tanl::Enumerator< std::vector < Tanl::Token * > * > * | pipe (Tanl::Enumerator< Tanl::Token * > &ts) |
Create an Enumerator that pulls tokens from an Enumerator ts . |
Tanl::POS::PosTagger::PosTagger | ( | const char * | modelFiles | ) |
Argument.
modelFiles | is the name of the parameter file to be used by the tagger, and produced by training the tagger on an annotated corpus. |
Tanl::Enumerator< std::vector< Tanl::Token * > * > * Tanl::POS::PosTagger::pipe | ( | Tanl::Enumerator< Tanl::Token * > & | ts | ) | [virtual] |
Create an Enumerator that pulls tokens from an Enumerator ts
.
ts | an Enumerator<Token*> from which tokens are extracted in groups up to an EOS token, in order to form a sentence to tag. |
Enumerator<vector<Token*>>
of the tagged sentences produced by the tagger. The Token's
in the result Enumerator
are extensions of the corresponding input Token's
with the addition of two attributes: POSTAG LEMMA whose values represent respectively: the POS tag and the lemma of the Token
. Implements Tanl::IPipe< Tanl::Token *, std::vector< Tanl::Token * > * >.