Tanl Linguistic Pipeline

Class List

Here are the classes, structs, unions and interfaces with brief descriptions:
_generator
IXE::io::AbstractSocketAbstract base class for Sockets
IXE::AnyObjectGeneric object, used for reading/writing dynamically defined tables
APAveraged Perceptron Classifier
APAAveraged Passive-Aggressive Classifier
APASSparse Averaged Passive-Aggressive
APASVSparse Vector Averaged Passive-Aggressive
APDDense Average Perceptron
Parser::ApParserParser class using an Averaged Perceptron classifier
APSSparse Average Perceptron
APSVSparse Vector Average Perceptron
Tanl::ArabicLanguage
Tanl::ArabicMorphExtractor
IXE::ArrayArray of persistent objects
IXE::ArrayField< ArrayType >Arrays for fields
IXE::ArrayField< ArrayOf< T > >
IXE::ArrayField< std::vector< char const * > >
IXE::ArrayField< std::vector< std::string > >
IXE::ArrayField< std::vector< T > >
IXE::ArrayOf< T >
IXE::ArrayType< T >
IXE::ArrayType< ArrayOf< T > >
IXE::ArrayType< std::vector< T > >
IXE::AssertionErrorThrown if an internal consistency check fails
Tanl::AttributeIndexAssociates to an attribute name the index of its value in a attribute vector
Tanl::AttributesRepresents the attributes of a token
Tanl::BanglaLanguage
IXE::io::basic_ipstream< CharT, Traits >
IXE::io::basic_opstream< CharT, Traits >
IXE::io::basic_pstream< CharT, Traits >
IXE::io::basic_pstreambuf< CharT, Traits >Class template for stream buffer
IXE::io::basic_rpstream< CharT, Traits >
Tanl::BasqueLanguage
Tanl::BasqueMorphExtractor
IXE::BigramTableA BigramTable contains the index (TermID) of the first word in the Lexicon starting with that bigram
Tanl::Text::Unicode::BlockInfo
Tanl::Text::Unicode::BlockLess
Tanl::SplitSentenceReader::BoundarySetParameters defininig where to split a sentence
Tanl::BulgarianLanguage
Tanl::BulgarianMorphExtractor
Cache
Tanl::CatalanLanguage
Tanl::Text::CCharThis is just a type specifier for use in CharBuffer
Tanl::Text::CharRepresentation of Unicode characters
std::char_traits< Tanl::Text::Char >
Tanl::Text::CharBuffer< CharTraits >A text buffer that provides a random access iterator through it
Tanl::ChineseLanguage
Tanl::Classifier::ClassifierAbstract class for ML classifiers
Tanl::Classifier::clearedContext
Tanl::Classifier::clearedNumber
IXE::CollectionErrorThrown for miscellaneous collection errors
Tanl::CombCorpusUsed to convert a Conll08 corpus replacing dependency links with the first non relexive arg link
Tanl::Classifier::compare1st
IXE::CompositeBuilder< T >Interface for creating fields
IXE::CompositeFieldComposite fields
IXE::Condition
IXE::conf< T >
IXE::conf< bool >A conf<bool> is a Var for containing the value of a Boolean configuration variable
IXE::conf< ColorMap >Conf<ColorMap> is a Var containing a set of pair<TermColor, TermWeight> associated to an HTML tag/meta-attribute
IXE::conf< Dictionary >
IXE::conf< float >A conf<float> is a Var for containing the value of a float configuration variable
IXE::conf< int >A conf<int> is a Var for containing the value of an integer configuration variable
IXE::conf< PatternSet >A conf_PatternSet contains a list of shell wildcard patterns
IXE::conf< Replacements >
IXE::conf< std::string >A conf<string> is a configuration variable containing a string value
IXE::conf< std::vector< std::string > >A conf_vector contains a set of configuration values
Tanl::NER::conf_featureA conf_feature contains a set of feature specifications
Parser::conf_featureA conf_feature contains a set of feature specifications
IXE::conf_set< T >A conf_set contains a set of configuration values
IXE::ConfigFileErrorThrown when reading a configuration file fails
IXE::ConfigurationA Configuration object that holds all the configuration variables
Tanl::Conll08Corpus
Tanl::ConllXCorpus
Tanl::ConllXSentenceReaderReader of sentences in CoNLL format: one token per line, attributes tab separated
IXE::IndexTable< Entry >::const_iterator
Tanl::Text::CharBuffer< CharTraits >::const_iterator
IXE::StringTable::const_iterator
Tanl::Attributes::const_iterator
IXE::PostingList::const_iterator
Tanl::ContextRepresents contexts of the tokens
Tanl::Classifier::ContextRepresentation of a set of features by means of their internal IDs
Tanl::POS::Context< CT >This is a language model to calculate P(C | A, B) with linear interpolation i.e
IXE::ConversionA Conversion maps a filename pattern to a conversion command
Tanl::CorpusRepresents common aspects of a Corpus
Tanl::CorpusFormatRepresentatation of Corpus data format
Tanl::CorpusFormatError
Tanl::CorpusMapAssociates a Corpus constructor to a corpus name
Parser::Correspondence
Tanl::POS::CountsRepresents the information of a trie node
Tanl::POS::TrieNode::counts_iteratorIterator of a TrieNode
IXE::Lexicon::Cursor
IXE::Table< T, IndexerClass >::CursorA Cursor on a Table<T, IndexerClass>
IXE::DynamicTable::Cursor
Tanl::CzechLanguage
Tanl::CzechMorphExtractor
Tanl::DanishLanguage
Tanl::DanishMorphExtractor
IXE::DBTRepresents items inserted/extracted from DB
decision_function
IXE::deref< P >
IXE::deref< T * >
Tanl::DgaCorpus
Tanl::DgaSentenceReaderRead a corpus and iterate over it returning one sentence at time
Tanl::Disposable
IXE::DocInfoAbstract class for document info
IXE::DocNotFoundErrorThrown when an attempt is made to access a document which is not in the collection
Tanl::DutchLanguage
Tanl::DutchMorphExtractor
IXE::DynamicTableDynamicTable
Tanl::Text::Encoding
Tanl::EnglishLanguage
Tanl::NER::EntityType
Tanl::SST::EntityType
IXE::EntryCompare
Tanl::Enumerator< T >Enumerator interface
IXE::Enumerator< T >Enumerator interface
IXE::eptacode
Tanl::Text::eqstr
Tanl::Text::eqstrcase
std::equal_to< char const * >
IXE::ErrorBase class for all errors reported
Tanl::Classifier::EventStreamInterface for objects that deliver a stream of training events for computing a Maxent model, through GIS, LMBFG or other procedures
Parser::EventStreamCreates a stream of Classifier::Event's from an annotated dependency corpus
ME::EventStreamA stream of training events
Tanl::Classifier::EventStreamError
IXE::ExcludeFileAn ExcludeFile contains the set of filename patterns to exclude during either indexing or extraction
Tanl::Classifier::FeatureEncoderUsed to fill Context from unencoded features
Tanl::Classifier::FeatureExtractor< Features, Item >Interface for feature extractors
Tanl::Classifier::FeaturesRepresentation of features during training
Tanl::MorphExtractor::Features
Tanl::NER::FeatureSpecRepresentation of feature model:

  • the token attribute to analyze
  • the pattern to check for the feature
  • the tokens to analyze
Tanl::SST::FeatureSpec
Tanl::SST::FeatureSpecs
IXE::FieldAbstract class for representing fields in tables
IXE::FieldBuilder< T >
IXE::io::File
Tanl::POS::file_error
IXE::FileActionSet the limit for the given resource to its maximum value
IXE::FileEnum
Tanl::Classifier::FileError
IXE::FileErrorThrown when opening a file fails
IXE::FileFormatThis variable contains the version number of the file format for the index
IXE::io::FileHandleFileHandle abstract from OS
IXE::FileHeaderHeader for fulltext index files
Tanl::Classifier::FileTokenizer
IXE::FileTypeA FileType maps a filename pattern to a file type
IXE::FixedField< T >Fixed size fields
FlexLexer
IXE::io::FormatA class to perform formatted output
IXE::FormatErrorWrong index format file
Tanl::FrenchLanguage
Tanl::FrenchMorphExtractor
Fs
Tanl::Classifier::GenericEvent< FeatureType >Generic class for events
Tanl::GermanLanguage
Tanl::Classifier::GISGeneralized Iterative Scaling algorithm to find the parameters {lambda1,
Parser::GlobalInfoStores global information collected while analyzing a corpus
Tanl::GreekLanguage
Tanl::GreekMorphExtractor
hash< std::pair< unsigned, std::vector< unsigned > > >
hash< std::pair< unsigned, unsigned > >Use compact encoding for representing sparse matrix lambda[f, c]
hash< std::string >
std::tr1::hash< Tanl::POS::State const & >
IXE::HasMetaClass< T >Determine if a class has a MetaClass
Tanl::HindiLanguage
IXE::PostingList::HitsCursorCursor over hit lists
Tanl::POS::HMMAbstract class for Hidden Markov Model
Tanl::POS::HmmTagger
Tanl::Text::HtmlTokenizerSimilar to StringTokenizer, except that it skips HTML tags
Tanl::HungarianLanguage
Tanl::HungarianMorphExtractor
IXE::If< Cond, Positive, Negative >
IXE::If< NullType, Positive, Negative >
IXE::IncludeFileAn IncludeFile contains the set of filename patterns to include during either indexing or extraction
IXE::Indexable< T >Interface for classes providing indexer access
IXE::IndexingErrorThrown during indexing
IXE::IndexTable< Entry >An instance of this class is used to access either the word, stop-word, file, or meta-name index portions of an index file
Tanl::IndianLanguage
Tanl::IndianMorphExtractor
IXE::InternalErrorThrown when an internal inconsistency occurs
IXE::InvalidArgumentErrorThrown when an invalid argument is supplied to the API
IXE::InvalidResultErrorThrown when trying to access invalid data
IXE::InvalidThreadStateError
IXE::IOError
Tanl::IPipe< Tin, Tout >IPipe interface
IXE::isArray< T >
IXE::isArray< ArrayOf< T > >
IXE::isArray< std::vector< T > >
IXE::isClass< T >
IXE::isClass< bool >
IXE::isClass< char >
IXE::isClass< const bool >
IXE::isClass< const char >
IXE::isClass< const double >
IXE::isClass< const float >
IXE::isClass< const int >
IXE::isClass< const long double >
IXE::isClass< const long int >
IXE::isClass< const short int >
IXE::isClass< const signed long long int >
IXE::isClass< const unsigned char >
IXE::isClass< const unsigned int >
IXE::isClass< const unsigned long int >
IXE::isClass< const unsigned long long int >
IXE::isClass< const unsigned short int >
IXE::isClass< double >
IXE::isClass< float >
IXE::isClass< int >
IXE::isClass< long double >
IXE::isClass< long int >
IXE::isClass< short int >
IXE::isClass< signed char >
IXE::isClass< signed long long int >
IXE::isClass< std::string >
IXE::isClass< std::vector< T > >
IXE::isClass< T * >
IXE::isClass< unsigned char >
IXE::isClass< unsigned int >
IXE::isClass< unsigned long int >
IXE::isClass< unsigned long long int >
IXE::isClass< unsigned short int >
IXE::isPointer< T >
IXE::isPointer< T * >
IXE::isReference< T >
IXE::isReference< Reference< T > >
Tanl::ItalianLanguage
Tanl::ItalianMorphExtractor
Tanl::ItalianTutLanguage
Tanl::ItalianTutMorphExtractor
IXE::ItemItems represent values for fields
IXE::Configuration::Item
IXE::ItemOf< T >
Tanl::Text::CharBuffer< CharTraits >::iterator
Tanl::Iterator< T >Iterator interface
Parser::Iterator< T >Iterator interface
IXE::IVisitorDefine Visitable classes as:
Tanl::JapaneseLanguage
Kernel
IXE::KeyValuePairsRepresents a sorted collection of associated string keys and string values that can be accessed with the key
Tanl::LanguageFunctional interface for extractors of morphological features
Tanl::Languages
Tanl::Text::Latin1NormalizerString normalizer interface
Tanl::Classifier::LBFGS
std::less< char const * >Specialize the binary_function "less" so that C-style strings (char const*) will work properly with STL containers
IXE::LexEntry
IXE::LexiconManage and use an inverted index dictionary
Tanl::POS::LexiconA Lexicon stores the possible tags of each word in the training corpus, with their frequency
IXE::LockLock object, used for synchronization
IXE::Locked< T >
IXE::LockUpMutex interface: locks mutex at creation, unlocks at destruction
IXE::LogicErrorBase class for errors due to programming errors
map
IXE::io::mappedFileMap a whole file
IXE::io::MappedFileViewA mappedFile is an object that maps a file into memory (via the Unix system call mmap(2)) allowing it to be accessed via iterators
IXE::io::MappedGzFileGzip-compressed file uncompressed and mapped into memory
IXE::io::MappedGzFileViewPortion of gzip-compressed file mapped into memory
IXE::MappedSubFieldRepresents fields in table containing references to other objects This is the version for value type objects, which are stored in a separate mapped file
Tanl::Text::RegExp::MatchGroupsHolds the offsets of matching regular expressions
Tanl::Classifier::MaxEntA Maximum Entropy classifier
IXE::MemoryErrorThrown when there is a communications problem with a remote collection
Parser::MeParserA Parser using a Maximum Entropy classifier
IXE::MetaClassClass MetaClass Describes the structure of an object
IXE::MetaClassOf< T, int >
IXE::MetaClassOf< T, 1 >
Parser::MlpModel
Parser::MlpParserA Parser using a Multi Layer Perceptron classifier
IXE::MmapErrorThrown when mmap fails mapping a file to memory
Tanl::POS::Model
Tanl::MorphExtractorFunctional interface for extractors of morphological features
Tanl::Token::MorphoCompact morphology representation
Tanl::MorphPipeReads tokens from Enumerator
Tanl::MorphPyPipeA pipe that tags sentences from a Python iterator
Tanl::MorphSplitter
Tanl::MorphStreamReads tokens from input stream
Tanl::POS::MorphtableThis class loads the morphtable (words with possible tags) from a file to a map
Parser::MovingAverageYoshua Bengio:
Parser::MultiSvmParserParser using two SVM classifiers, provided by libsvm, one to predict the action, one to predict the label
Tanl::NER::NERA Named Entity Recognizer
Tanl::NER::NerEventStreamAn Enumerator<Event*> extracting features from a sentence
Tanl::NER::NerFeatureExtractorExtract features for NER
Tanl::NER::NerPipePipe connected to a sentence enumerator
Tanl::NER::NerPyPipeA pipe that tags sentences from a Python iterator
IXE::NetworkErrorThrown when there is a communications problem with a remote collection
Tanl::POS::Viterbi::Node
Tanl::XML::XmlReader::NodeAttr
IXE::NoDocPredicate false for any document
Tanl::Text::NormalizerString normalizer interface
Tanl::Text::NormEqualCompare strings by normalizing to lowercase and discarding dots
Tanl::Text::NormHash
Tanl::Text::NormWordSet
IXE::NullType
Tanl::POS::HMM::Observation
ONE_CLASS_Q
IXE::OpeningErrorThrown when opening a collection fails
IXE::OptionStream::Option
IXE::OptionsOptions describes a set of command-line options
Tanl::Classifier::Options
IXE::OptionStreamGiven the traditional argc and argv for command-line arguments, extract options from them following the stream model
PAPassive-Aggressive Classifier
PADDense Passive-Aggressive
Parser::ParserBase class for parsers
IXE::ParserError
Parser::ParserMapAssociates a Parser constructor to an algorithm name
Parser::ParserPipeA pipe that parses sentences from a stream
Parser::ParserPipePythonA pipe that parses sentences from a Python iterator
Parser::ParserSentPipeA pipe that parses sentences from a stream of sentences
Parser::ParseStateParse state
PASSparse Passive-Aggressive
PASVSparse Vector Passive-Aggressive
Tanl::Text::RegExp::PatternRegular Expression matching
IXE::Pattern2Extension of RegExp::Pattern that stores original pattern for serialization
IXE::PatternListAn PatternList contains a list of shell wildcard patterns
IXE::PatternMap< T >A PatternMap maps a shell wildcard pattern to an object of type T
IXE::PatternSetA PatternSet contains a set of shell wildcard patterns
IXE::PatternVarA PatternVar is a configuration variable containing a set of filename patterns
Parser::PennPosTagger
Tanl::PortugueseLanguage
Tanl::PortugueseMorphExtractor
Parser::PosTaggerBase class for POS taggers
Tanl::POS::PosTagger
Parser::PosTaggerError
Tanl::POS::PosTaggerError
Tanl::POS::PosTaggerPipe
IXE::PostingListThis class, given a IndexTable::const_iterator, accesses the list of postings for a word
IXE::PostingOffset
Tanl::POS::ProbError
Tanl::POS::ProbLM< CT >
IXE::io::pstream_common< CharT, Traits >Class template for common base class
IXE::io::pstreamsCommon base class providing constants and typenames
Tanl::PtbScanner
Tanl::PtbTokenizerRead a plain text file, split into tokens
Tanl::PySwigObject
Parser::PySwigObject
Tanl::NER::PySwigObject
QMatrix
IXE::QueryErrorThrown when an SQL query fails
IXE::RangeErrorThrown when an element is out of range
IXE::ReaderErrorThrown when reader fails interpreting document format
Tanl::RefCountable
IXE::Reference< T >Reference to another persistent object
IXE::ReferenceField< T >Reference fields
Tanl::Text::RegExp::RegExpException
IXE::PostingList::remap_iteratorIterates through postings, but skipping those for documents which have been remapped
Tanl::SST::Resources
Tanl::NER::Resources
Parser::ReviseContextStream
Parser::ReviseEventStream
Parser::ReviseStream
IXE::Runnable
IXE::RuntimeErrorBase class for errors due to run time problems
Parser::Scanner
Tanl::Sentence
Parser::SentenceInfoCommon info to all states while parsing a sentence
Tanl::SentenceQueueReaderReads sentences from a queue in memory
Tanl::SentenceReader
Parser::SerializableSerializable interface
IXE::io::ServerSocketServer socket
IXE::Set< T >A Set is a set but with the addition of a contains() member function, one that returns a simpler bool result indicating whether a given element is in the set
IXE::Set< char const * >Specialize Set for C-stle strings so as not to have a reference (implemented as a pointer) to a char const*
IXE::SimpleConditionCondition without the additional lock required by pthread_cond_wait()
Tanl::SloveneLanguage
Tanl::SloveneMorphExtractor
IXE::io::SocketThis class implements client sockets (also called just "sockets")
Solver::SolutionInfo
Solver
Solver_NU
Tanl::SpanishLanguage
Tanl::SpanishMorphExtractor
IXE::Options::spec
Tanl::POS::SpecialTokens
Tanl::SplitSentenceReaderRead a plain text file, split into tokens and return one sentence at a time
Tanl::SST::SSTData uses old IOB conventions
Tanl::SST::SstEventStreamAn Enumerator<Event*> extracting features from a sentence
Tanl::SST::SstFeatureExtractorExtract features for SST
Tanl::SST::SSTPipePipe connected to a sentence enumerator
Tanl::POS::StateA State in the n-order HMM corresponds to an n-uple of tags preceding the current word, last tag first
Parser::StateParser base state
Tanl::Classifier::StreamTokenizer
Tanl::Text::StreamTokenizer
Tanl::Text::StringString class This class stores and manipulates strings of characters defined according to ISO10646
IXE::StringTable
Tanl::Text::StringTokenizer
IXE::SubCursorRepresents a cursor on a Reference subfield
IXE::SubFieldRepresents fields in table containing references to other objects
Tanl::Text::SuffixesList of string suffix
Tanl::POS::SuffixGuesserThe task of the suffix guesser is to predict a tag-distribution based on the suffix of the word
SVC_Q
svm_model
svm_node
svm_parameter
svm_problem
Parser::SvmParserParser using an SVM classifier, provided by libsvm
SVR_Q
Tanl::SwedishLanguage
IXE::SystemErrorThrown when a system call fails
IXE::Table< T, IndexerClass >
IXE::TableErrorThrown when accessing a database Table fails
tableField
Tanl::TagRepresents SGML tags
Parser::Tagged
Tanl::TaggedSentenceReader
Tanl::TaggerInterface for tagging each token in a sentence with supplementary information, such as its part of speech
IXE::ThreadPool::Thread::TaskDescr
Tanl::TeluguLanguage
IXE::TermHitTermHit is used to represent a word occurrence in a document, a sentence delimiter or a tag
IXE::TermInfo
Tanl::TextCorpusPlain text corpus
IXE::ThreadPool::ThreadA thread is an abstract base class that wraps an OS thread, within a thread pool context, i.e., it does all of the grunt work of waiting for tasks and coordinating with the thread pool object to which it belongs
IXE::ThreadA class to start and manage a thread of execution
IXE::ThreadGroupJava-like ThreadGroup
IXE::ThreadPoolA ThreadPool pre-creates and manages a pool of persistent threads to do tasks taken from a queue
IXE::Timer
Tanl::TimerClass for timing durations
Tanl::PtbScanner::TokenRepresents a token recognized by the scanner
Parser::Scanner::TokenRepresents a token recognized by the scanner
Tanl::TokenRepresents a sentence token (a word in a sentence)
Tanl::SST::TokenCategorizerAssigns a type to a token, according to language locale
Tanl::NER::TokenCategorizerAssigns a type to a token, according to language locale
Tanl::TokenFieldDescribes the fields in each token in a corpus
Tanl::TokenizedTextCorpusCorpus made of tokens
Tanl::Tokenizer
Parser::TokenizerRead a plain text file, split into tokens
Tanl::TokenizerPipeTokenizer pipe connected to a stream
Tanl::TokenizerPipeEnumPipe connected to a Enumerator<string*>
Tanl::TokenizerPipePythonPipe connected to a Python iterator
Tanl::TokenLinkRepresents links to other tokens
Parser::TokenPathSpecification of a path from a token, through the sentence
Tanl::TokenSentenceReaderRead a text split into space-separated tokens, one sentence per line, and iterate over it returning one sentence at time
Parser::TrainStateParser training state
Tanl::TreeTokenToken with descendants
Tanl::POS::TrieNodeTrie to represent the suffices plus the additional tag counting information
IXE::TrueType
Tanl::TurkishLanguage
Tanl::TurkishMorphExtractor
Tanl::Text::Unicode::UnicodeInfo
IXE::UnimplementedErrorThrown when an attempt to use an unimplemented feature is made
IXE::unordered_map< _Key, _Tp, _Hash, _Pred, _Alloc >
Tanl::unordered_map< _Key, _Tp, _Hash, _Pred, _Alloc >
Tanl::unordered_set< _Value, _Hash, _Pred, _Alloc, __cache_hash_code >
Tanl::Text::Utf8CharThis is just a type specifier for use in CharBuffer
Tanl::Text::Unicode::Utf8NormalizerConverts
IXE::Var< T >Configuration variable
IXE::VarDefault< T >Configuration variable with default value
IXE::VarField< T >Variable size fields
IXE::Visitable< R >
IXE::Visitor< T, R >
Tanl::POS::ViterbiThis is the good old Viterbi algorithm for HMM decoding
Tanl::POS::VocabTwo way word index: associates an index to a word and viceversa gets the word with the given index
Parser::WordCounts
Parser::WordFreq
Tanl::Text::WordIndexAssociates an ID to each word in a set
Tanl::Text::WordSetSet of words
Tanl::Text::WordSetBase< _Hash, _Pred >
Tanl::XML::XmlException
Tanl::XML::XmlReader
Tanl::XML::XmlScanner
yy_buffer_state
yyFlexLexer
IXE::io::ZlibErrorThrown when a zlib call fails
 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Defines
 
Copyright © 2005-2011 G. Attardi. Generated on 4 Mar 2011 by doxygen 1.6.1.