Difference between revisions of "Corpora"

From Medialab

Line 1: Line 1:
* SentenceSplitter
 
  +
* SemaWiki
** [[media:IT-TrainingCorpus.txt|Italian Training Corpus]] (13,376 sentences)
 
  +
** SentenceSplitter
 
  +
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/IT-TrainingCorpus.txt Italian Training Corpus] (13,376 sentences)
* POS Tagger
+
** POS Tagger
** [[media:fullexMorph.tanl|Italian Lexicon]] (1,268,369 entities)
+
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/fullexMorph.tanl Italian Lexicon] (1,268,369 entities)

Revision as of 17:11, 10 March 2009