Tanl Linguistic Pipeline |
Read a plain text file, split into tokens and return one sentence at a time. More...
#include <SplitSentenceReader.h>
Classes | |
struct | BoundarySet |
Parameters defininig where to split a sentence. More... | |
Public Member Functions | |
SplitSentenceReader (std::istream *is, Corpus *corpus=0, BoundarySet const *bs=&defaultBS) | |
Creates a new SplitSentenceReader . | |
bool | MoveNext () |
Return true if there is another sentence available. | |
void | reset () |
Restart. | |
Static Public Attributes | |
static BoundarySet const | defaultBS |
Read a plain text file, split into tokens and return one sentence at a time.
Tanl::SplitSentenceReader::SplitSentenceReader | ( | std::istream * | is, | |
Corpus * | corpus = 0 , |
|||
BoundarySet const * | bs = &defaultBS | |||
) |
Creates a new SplitSentenceReader
.
is | the stream containing the sentence to read. | |
lang | the language of the document. |
bool Tanl::SplitSentenceReader::MoveNext | ( | ) | [virtual] |
Return true if there is another sentence available.
Reimplemented from Tanl::SentenceReader.
References Parser::Tokenizer::Current(), Parser::Tokenizer::MoveNext(), and Tanl::SentenceReader::sentence.