Cost-effective solution to synchronized audio-visual capture using multiple sensors

Jeroen Lichtenauer, Michel Valstar, Jie Shen, Maja Pantic

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    18 Citations (Scopus)
    96 Downloads (Pure)

    Abstract

    Applications such as surveillance and human motion capture require high-bandwidth recording from multiple cameras. Furthermore, the recent increase in research on sensor fusion has raised the demand on synchronization accuracy between video, audio and other sensor modalities. Previously, capturing synchronized, high resolution video from multiple cameras required complex, inflexible and expensive solutions. Our experiments show that a single PC, built from contemporary low-cost computer hardware, could currently handle up to 470MB/s of input data. This allows capturing from 18 cameras of 780x580pixels at 60fps each, or 36 cameras at 30fps. Furthermore, we achieve accurate synchronization between audio, video and additional sensors, by recording audio together with sensor trigger- or timestamp signals, using a multi-channel audio input. In this way, each sensor modality can be captured with separate software and hardware, allowing maximal flexibility with minimal cost.
    Original languageUndefined
    Title of host publicationProceedings 6th International Conference on Advanced Video and Signal Based Surveillance (AVSS '09)
    Place of PublicationLos Alamitos
    PublisherIEEE Computer Society
    Pages324-329
    Number of pages6
    ISBN (Print)978-0-7695-3718-4
    DOIs
    Publication statusPublished - 2009

    Publication series

    Name
    PublisherIEEE Computer Society Press

    Keywords

    • METIS-264473
    • IR-69694
    • Synchronization
    • Video recording
    • Multisensor systems
    • Audio recording
    • EWI-17188
    • EC Grant Agreement nr.: FP7/211486
    • HMI-MI: MULTIMODAL INTERACTIONS
    • HMI-HF: Human Factors

    Cite this

    Lichtenauer, J., Valstar, M., Shen, J., & Pantic, M. (2009). Cost-effective solution to synchronized audio-visual capture using multiple sensors. In Proceedings 6th International Conference on Advanced Video and Signal Based Surveillance (AVSS '09) (pp. 324-329). [10.1109/AVSS.2009.92] Los Alamitos: IEEE Computer Society. https://doi.org/10.1109/AVSS.2009.92