Video-text retrieval techniques endeavour to bridge the semantic gap between visual content and natural language descriptions. By learning joint representations for both video and text, these ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results