High performance video feature extractor using CLIP
Video is first segmented into shots using TransNet, for each shot a signle frames in the middle of shot is used for CLIP feature extraction
PatchyVideo/TouhouVideoFeatureExtractor
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|