Tool for video production professionals to automatically get clip information using speech recognition, computer vision