[1]
Himanshu Tyagi, et al. 2023. TAPER-WE: Transformer-Based Model Attention with Relative Position Encoding and Word Embedding for Video Captioning and Summarization in Dense Environment. International Journal on Recent and Innovation Trends in Computing and Communication. 11, 9 (Nov. 2023), 4851–4857. DOI:https://doi.org/10.17762/ijritcc.v11i9.10081.