video-audio-to-text video-audio-to-text that can use CahtGPT to summary Fork my project or give me start if you like it =).