1

Class-dependent and cross-modal memory network considering sentimental features for video-based captioning

xzcuywvmqts2rf
The video-based commonsense captioning task aims to add multiple commonsense descriptions to video captions to understand video content better. This paper aims to consider the importance of cross-modal mapping. We propose a combined framework called Class-dependent and Cross-modal Memory Network considering SENtimental features (CCMN-SEN) for Video-based Captioning to enhance commonse... https://www.unitedssports.shop/product-category/bike-parts-tires-27-5/
Report this page

Comments

    HTML is allowed

Who Upvoted this Story