Arun M Girisan. “Deep Learning Architectures for Multimodal Data Fusion in Natural Language Processing and Computer Vision”. International Journal of Artificial Intelligence 1, no. 2 (July 10, 2020): 1–11. Accessed May 29, 2025. https://ijai.in/index.php/home/article/view/IJAI.01.02.001.