TY - GEN
T1 - Audio-visual synchronization recovery in multimedia content
AU - Lee, Jong Seok
AU - Ebrahimi, Touradj
PY - 2011
Y1 - 2011
N2 - This paper proposes a method recovering audio-visual synchronization of multimedia content. It exploits the correlation between the acoustic and the visual signals in order to estimate the audio-visual drift existing in the content. By shifting the audio signal relative to the visual signal, the estimation of the drift is obtained by searching for the shift producing the maximal audio-visual correlation. We consider two correlation measures, namely, mutual information and canonical correlation, and compare their performance. Experimental results demonstrate that the method using the canonical correlation is effective in recovering the audio-visual synchronization for both speech and non-speech sequences.
AB - This paper proposes a method recovering audio-visual synchronization of multimedia content. It exploits the correlation between the acoustic and the visual signals in order to estimate the audio-visual drift existing in the content. By shifting the audio signal relative to the visual signal, the estimation of the drift is obtained by searching for the shift producing the maximal audio-visual correlation. We consider two correlation measures, namely, mutual information and canonical correlation, and compare their performance. Experimental results demonstrate that the method using the canonical correlation is effective in recovering the audio-visual synchronization for both speech and non-speech sequences.
KW - Audio-visual synchronization
KW - canonical correlation
KW - multimedia
KW - mutual information
UR - http://www.scopus.com/inward/record.url?scp=80051618714&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80051618714&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2011.5946937
DO - 10.1109/ICASSP.2011.5946937
M3 - Conference contribution
AN - SCOPUS:80051618714
SN - 9781457705397
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2280
EP - 2283
BT - 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
T2 - 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Y2 - 22 May 2011 through 27 May 2011
ER -