Difference between revisions of "Corpora"

From Medialab

m
 
Line 1: Line 1:
 
* SemaWiki
 
* SemaWiki
 
** SentenceSplitter
 
** SentenceSplitter
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/IT-TrainingCorpus.txt,bz2 Italian Training Corpus] (13,376 sentences)
+
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/IT-TrainingCorpus.txt.bz2 Italian Training Corpus] (13,376 sentences)
 
** POS Tagger
 
** POS Tagger
 
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/fullexMorph.tanl.bz2 Italian Lexicon] (1,268,369 entities)
 
*** [http://medialab.di.unipi.it/Project/SemaWiki/Corpus/fullexMorph.tanl.bz2 Italian Lexicon] (1,268,369 entities)

Latest revision as of 00:08, 19 April 2009