The other three sequences in this module show a metronome in front of the two cameras, which provides information about the synchronisation of the audio and video streams. The sequences show the metronome on a slow, medium-paced, and fast setting. It must be noted, that due to the stereo camera and recording setup, there is a delay of one NTSC video frame between audio and video streams (video lagging the audio).
Since the sequences in this module are speaker-independent, only one recording of each of them was needed per recording period. If corpus recordings are made over prolonged time spans (months or years), or in intervals (for example, extending the corpus at a later stage, as it was done for AVOZES), the sequences should be repeated once during each interval to record possible changes to the recording environment.
Example Sequence
Note: Any example sequence is provided for informative purposes, so
that you can judge whether AVOZES is the right data corpus for you. You
may use it for internal evaluation purposes only. For all other uses,
including academic research, a licence must be acquired
(non-commercial (academic) licence,
commercial licence).
Download an example sequence (34.4MB, AVI) containing a 10s sample of
a sequence showing the amount of acoustic background noise as well as
the visual background.
[Homepage] [AVOZES Homepage] [Research]