The AVOZES data corpus includes one sequence per digit for each speaker, spoken in order from 0 to 9. Again, each digit is enclosed by the carrier phrase "You grab /DIGIT/ beer." to ensure lip closure before and after the digit for ease of segmentation of the video stream. These sequences are typically 2-3s long.
Example Sequence
Note: Any example sequence is provided for informative purposes, so
that you can judge whether AVOZES is the right data corpus for you. You
may use it for internal evaluation purposes only. For all other uses,
including academic research, a licence must be acquired
(non-commercial (academic) licence,
commercial licence).
Download an example sequence (6.9MB, AVI) of "You grab ONE beer."
[Homepage] [AVOZES Homepage] [Research]