Difference between revisions of "Corpora"

From Medialab

m
Line 1: Line 1:
 
* SentenceSplitter
 
* SentenceSplitter
** [[media:IT-TrainingCorpus.txt|Italian Training Corpus]] (13,376 sentences)
+
** [[media:IT-TrainingCorpus.txt|Italian Training Corpus]] (13,376 sentences)
   
 
* POS Tagger
 
* POS Tagger
** [[media:fullexMorph.tanl|TreeTagger Lexicon]] (1,268,369 entities)
+
** [[media:fullexMorph.tanl|TreeTagger Lexicon]] (1,268,369 entities)

Revision as of 16:50, 6 March 2009