Tanl Linguistic Pipeline |
Interface for tagging each token in a sentence with supplementary information, such as its part of speech. More...
#include <Tagger.h>
Public Member Functions | |
virtual std::vector< Token * > | tag (std::vector< Token * > *)=0 |
Enumerator< std::vector< Token * > * > * | pipe (Enumerator< std::vector< Token * > * > &ts)=0 |
Create an Enumerator that pulls tokens from an Enumerator ts . |
Interface for tagging each token in a sentence with supplementary information, such as its part of speech.
Enumerator<std::vector<Token*>*>* Tanl::Tagger::pipe | ( | Enumerator< std::vector< Token * > * > & | ts | ) | [pure virtual] |
Create an Enumerator that pulls tokens from an Enumerator ts
.
ts | an Enumerator<vector<Token*>*> from which tokens are extracted in groups up to an EOS token, in order to form a sentence to tag. |
Enumerator<vector<Token*>*>
of the tagged sentences produced by the tagger. The Token's
in the result Enumerator
are extensions of the corresponding input Token's
with the addition of two attributes: POSTAG LEMMA whose values represent respectively: the POS tag and the lemma of the Token
. Implements Tanl::IPipe< std::vector< Token * > *, std::vector< Token * > * >.