AVOZES

The Audio-Video Australian English Speech Data Corpus


Contents Module 1 - Recording setup without speaker

This module contains five sequences in the AVOZES data corpus. The first two are 30 second sequences of the recording scene viewed by the two cameras, but without any speaker present, one for each recording period. The sequences can be used to determine the background level of acoustic noise present in the recording studio, due to air-conditioning as well as computer and recording equipment. In addition, information about the visual background can be gained, if it is required for the segmentation of the speaker from the background in the video stream.

The other three sequences in this module show a metronome in front of the two cameras, which provides information about the synchronisation of the audio and video streams. The sequences show the metronome on a slow, medium-paced, and fast setting. It must be noted, that due to the stereo camera and recording setup, there is a delay of one NTSC video frame between audio and video streams (video lagging the audio).

Since the sequences in this module are speaker-independent, only one recording of each of them was needed per recording period. If corpus recordings are made over prolonged time spans (months or years), or in intervals (for example, extending the corpus at a later stage, as it was done for AVOZES), the sequences should be repeated once during each interval to record possible changes to the recording environment.

Example Sequence
Note: Any example sequence is provided for informative purposes, so that you can judge whether AVOZES is the right data corpus for you. You may use it for internal evaluation purposes only. For all other uses, including academic research, a licence must be acquired (non-commercial (academic) licence, commercial licence).


Download an example sequence (34.4MB, AVI) containing a 10s sample of a sequence showing the amount of acoustic background noise as well as the visual background.

[Homepage] [AVOZES Homepage] [Research]


© Roland Göcke
Last modified: Tue Nov 09 17:21:25 AUS Eastern Daylight Time 2004