## V2A- CoT Project

This project is a demo of V2A-CoT.

#### Algorithm

“ frame.py “ is an algorithm for extracting key frames.

” gptv-demo.py “ is a demo algorithm for running V2A-CoT on GPT-4o, and an API key needs to be entered when using it. The default prompt is a direct inquiry, and standard CoT and V2A-CoT can be used by adjusting the content.

#### Generated Results

Some examples of generating audio videos using V2A-CoT under the Generated_results folder.

For example, ” sample1_Original_version(Silent).mp4 ” is an original silent video, while “ sample1_V2A-CoT_generated.mp4 ” is an audio video generated using V2A-CoT and has a high degree of audiovisual consistency.