Token Format

From Medialab

Revision as of 13:42, 11 June 2008 by Giuseppe.Attardi (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Processes exchange Token streams as discussed in Software Architecture.

The representation of Token is flexible, so that each process can extend it by adding its own particular features.

A Token consists in:

  • a mapping: FeaturesValues
  • a mapping: Relation → 〈TokenId, Label

where Features express specific features of a token and <Relatsion express relations by menas of labeled links to other tokens.

Tokens are the constituents of sentences (whose structure is specified in Sentence Format). Sentences are grouped into corpora whose format is described in Corpus Format.