Abstract
With the recent advancements of technology, and in particular with graphics processing and artificial intelligence algorithms, fake media generation has become easier. Using deep learning techniques like Deepfakes and FaceSwap, anyone can generate fake videos by manipulating the face/voice of target subjects in videos. These AI synthesized videos are a big threat to the authenticity and trustworthiness of online information and can be used for malicious purposes. Detecting face tampering in videos is of utmost importance. We propose a spatio-temporal hybrid model of Capsule Networks integrated with Long Short-Term Memory (LSTM) networks. This model exploits the inconsistencies in videos to distinguish real and fake videos. We use three different frame selection techniques and show that frame selection has a significant impact on the performance of models. The combined Capsule and LSTM network have comparable performance to state-of-the-art models and about 1/5th the number of parameters, resulting in reduced computational cost.
Original language | English |
---|---|
Title of host publication | Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP |
Editors | Giovanni Maria Farinella, Petia Radeva, Jose Braz, Kadi Bouatouch |
Publisher | SCITEPRESS |
Pages | 407-414 |
Number of pages | 8 |
ISBN (Electronic) | 9789897584886 |
DOIs | |
Publication status | Published - 2021 |
Event | 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2021 - Virtual, Online Duration: 8 Feb 2021 → 10 Feb 2021 Conference number: 16 |
Conference
Conference | 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2021 |
---|---|
Abbreviated title | VISIGRAPP 2021 |
City | Virtual, Online |
Period | 8/02/21 → 10/02/21 |
Keywords
- Capsule Networks
- Deepfake Detection
- Face Video Manipulation
- Long Short-Term Memory Networks