UNLOCKING THE STRENGTH OF VIDEO CLIP TO TEXT CONVERSION: A COMPREHENSIVE GUIDELINE