Efficient video comprehension with AI