|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
NUMBER OF TAGGED UTTERANCES PER TAGGING LEVEL
PER DIALOGUE |
|
|
|
|
|
|
|
|
|
|
DATE (month/dd/yyyy) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jun/23/2006 |
DIALOGUE NUMBER |
|
|
d01 |
d02 |
d03 |
d04 |
d05 |
d06 |
d07 |
d08 |
d09 |
d10 |
d11 |
d12 |
d13 |
d14 |
d15 |
d16 |
d17 |
d18 |
d19 |
d20 |
d21 |
d22 |
d23 |
d24 |
d25 |
d26 |
|
TOTAL NUMBER OF UTTERANCES |
116 |
196 |
168 |
384 |
118 |
371 |
98 |
162 |
692 |
96 |
285 |
117 |
191 |
137 |
90 |
454 |
237 |
216 |
105 |
179 |
69 |
181 |
81 |
300 |
116 |
210 |
|
DIME-DAMSL |
116 |
0 |
168 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
117 |
191 |
137 |
90 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
210 |
|
ALLOPHONES
(T54) |
116 |
0 |
168 |
0 |
0 |
0 |
40 |
0 |
0 |
0 |
0 |
117 |
191 |
137 |
90 |
0 |
237 |
0 |
105 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
PHONETIC
SYLLABLES |
116 |
0 |
168 |
0 |
0 |
0 |
40 |
0 |
0 |
0 |
0 |
117 |
191 |
137 |
90 |
0 |
237 |
0 |
105 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
WORDS |
116 |
0 |
168 |
0 |
0 |
0 |
40 |
0 |
0 |
0 |
0 |
0 |
191 |
137 |
90 |
0 |
237 |
0 |
105 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
DEFAULT
MOMEL - See note 1 |
116 |
196 |
168 |
384 |
118 |
371 |
98 |
162 |
692 |
96 |
285 |
117 |
191 |
137 |
90 |
454 |
237 |
216 |
105 |
179 |
69 |
181 |
81 |
300 |
116 |
210 |
|
VERIFIED
MOMEL - See note 1 |
55 |
0 |
168 |
0 |
0 |
0 |
0 |
0 |
0 |
14 |
0 |
100 |
0 |
50 |
41 |
0 |
179 |
0 |
52 |
0 |
0 |
0 |
64 |
0 |
0 |
0 |
|
INTSINT -
See note 2 |
55 |
0 |
134 |
0 |
0 |
0 |
0 |
0 |
0 |
14 |
0 |
100 |
0 |
50 |
41 |
0 |
179 |
0 |
39 |
0 |
0 |
0 |
64 |
0 |
0 |
0 |
|
BREAK
INDICES (from ToBI) |
116 |
0 |
168 |
0 |
0 |
0 |
40 |
0 |
0 |
0 |
0 |
0 |
191 |
137 |
90 |
0 |
237 |
0 |
105 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
SENTENCE
MOOD |
0 |
0 |
0 |
0 |
0 |
0 |
39 |
0 |
0 |
0 |
0 |
0 |
191 |
0 |
90 |
0 |
237 |
0 |
105 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
PART OF
SPEECH (P.O.S.) |
115 |
0 |
161 |
0 |
0 |
0 |
32 |
0 |
0 |
0 |
0 |
115 |
153 |
133 |
76 |
0 |
171 |
0 |
85 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
DISCOURSE
MARKER - See note 3 |
115 |
0 |
161 |
0 |
0 |
0 |
32 |
0 |
0 |
0 |
0 |
115 |
153 |
133 |
0 |
0 |
103 |
0 |
37 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
SPEECH
REPAIR - See note 3 |
115 |
0 |
161 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
115 |
153 |
0 |
5 |
0 |
19 |
0 |
7 |
0 |
0 |
0 |
81 |
0 |
0 |
0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
NOTES |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1)
In order to generate verified MOMEL files, MOMEL default files are needed.
Creating default MOMELs can be impossible |
|
|
|
|
|
in
some cases because of certain inherent characteristics in the original audio
files, such as too short duration, too wide f0 |
|
|
|
|
|
range,
etc. In some other cases, default Momels can be created but containing errors
that are produced by Momel algorithm itself. |
|
|
|
|
In
both cases, the generation of verified Momels could require a special
processing to be specified by S. Coria. |
|
|
|
|
|
|
|
|
In
some few cases, producing the verified Momels could be perhaps impossible. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2) INTSINT tagging is produced by an automatic
process from verified Momel taggings. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3) Discourse markers and speech repairs do not
occur in every utterance; but the human tagger needs |
|
|
|
|
|
|
|
|
|
|
to analize them all to see which do contain
them. That is why we count all utterances. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4) At the present stage, DIME-DAMSL
tagging is being developed for the
dialogs as follows: |
|
|
|
|
|
|
|
|
|
|
|
|
1, 3, 12, 13, 14, 15, 17, 19, 23 y 26 (12 and
14 are rather re-annotations with the most recent version of the model) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|