Difference between revisions of "Corpora"

From Medialab

m
Line 1: Line 1:
* [[Sentence_Splitter|Sentence Splitter]]
 
  +
* SentenceSplitter
 
** [[media:IT-TrainingCorpus.txt|Italian Training Corpus]] (13,376 sentences)
 
** [[media:IT-TrainingCorpus.txt|Italian Training Corpus]] (13,376 sentences)
   
* [[POS_Tagger|POS Tagger]]
+
* POS Tagger
** TreeTagger Lexicon (1,268,369 entities)
+
** [[media:fullexMorph.tanl|TreeTagger Lexicon]] (1,268,369 entities)

Revision as of 16:50, 6 March 2009