Difference between revisions of "Corpora"

From Medialab

m
Line 1: Line 1:
 
* SemaWiki
 
* SemaWiki
 
** SentenceSplitter
 
** SentenceSplitter
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/IT-TrainingCorpus.txt Italian Training Corpus] (13,376 sentences)
+
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/IT-TrainingCorpus.txt,bz2 Italian Training Corpus] (13,376 sentences)
 
** POS Tagger
 
** POS Tagger
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/fullexMorph.tanl Italian Lexicon] (1,268,369 entities)
+
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/fullexMorph.tanl.bz2 Italian Lexicon] (1,268,369 entities)

Revision as of 17:57, 10 March 2009