ARUN M GIRISAN. Deep Learning Architectures for Multimodal Data Fusion in Natural Language Processing and Computer Vision. International Journal of Artificial Intelligence, [S. l.], v. 1, n. 2, p. 1–11, 2020. Disponível em: https://ijai.in/index.php/home/article/view/IJAI.01.02.001.. Acesso em: 17 may. 2025.