<< Regresar

NUMBER OF TAGGED UTTERANCES PER TAGGING LEVEL PER DIALOGUE
DATE (month/dd/yyyy)
Jun/23/2006 DIALOGUE NUMBER
d01 d02 d03 d04 d05 d06 d07 d08 d09 d10 d11 d12 d13 d14 d15 d16 d17 d18 d19 d20 d21 d22 d23 d24 d25 d26
TOTAL NUMBER OF UTTERANCES 116 196 168 384 118 371 98 162 692 96 285 117 191 137 90 454 237 216 105 179 69 181 81 300 116 210
DIME-DAMSL 116 0 168 0 0 0 0 0 0 0 0 117 191 137 90 0 0 0 0 0 0 0 0 0 0 210
ALLOPHONES (T54) 116 0 168 0 0 0 40 0 0 0 0 117 191 137 90 0 237 0 105 0 0 0 81 0 0 0
PHONETIC SYLLABLES 116 0 168 0 0 0 40 0 0 0 0 117 191 137 90 0 237 0 105 0 0 0 81 0 0 0
WORDS 116 0 168 0 0 0 40 0 0 0 0 0 191 137 90 0 237 0 105 0 0 0 81 0 0 0
DEFAULT MOMEL  -  See note 1 116 196 168 384 118 371 98 162 692 96 285 117 191 137 90 454 237 216 105 179 69 181 81 300 116 210
VERIFIED MOMEL - See note 1 55 0 168 0 0 0 0 0 0 14 0 100 0 50 41 0 179 0 52 0 0 0 64 0 0 0
INTSINT - See note 2 55 0 134 0 0 0 0 0 0 14 0 100 0 50 41 0 179 0 39 0 0 0 64 0 0 0
BREAK INDICES (from  ToBI) 116 0 168 0 0 0 40 0 0 0 0 0 191 137 90 0 237 0 105 0 0 0 81 0 0 0
SENTENCE MOOD 0 0 0 0 0 0 39 0 0 0 0 0 191 0 90 0 237 0 105 0 0 0 81 0 0 0
PART OF SPEECH (P.O.S.) 115 0 161 0 0 0 32 0 0 0 0 115 153 133 76 0 171 0 85 0 0 0 81 0 0 0
DISCOURSE MARKER - See note 3 115 0 161 0 0 0 32 0 0 0 0 115 153 133 0 0 103 0 37 0 0 0 81 0 0 0
SPEECH REPAIR - See note 3 115 0 161 0 0 0 0 0 0 0 0 115 153 0 5 0 19 0 7 0 0 0 81 0 0 0
NOTES
1) In order to generate verified MOMEL files, MOMEL default files are needed. Creating default MOMELs can be impossible
in some cases because of certain inherent characteristics in the original audio files, such as too short duration, too wide f0
range, etc. In some other cases, default Momels can be created but containing errors that are produced by Momel algorithm itself.
In both cases, the generation of verified Momels could require a special processing to be specified by S. Coria.
In some few cases, producing the verified Momels could be perhaps impossible.
2) INTSINT tagging is produced by an automatic process from verified Momel taggings.
3) Discourse markers and speech repairs do not occur in every utterance; but the human tagger needs
to analize them all to see which do contain them. That is why we count all utterances.
4) At the present stage, DIME-DAMSL tagging  is being developed for the dialogs as follows:
1, 3, 12, 13, 14, 15, 17, 19, 23 y 26 (12 and 14 are rather re-annotations with the most recent version of the model)