Skip to content

基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】

Notifications You must be signed in to change notification settings

ZillaRU/VideoSearch-tpu

Repository files navigation

CLIP Video Search

Please download bmodels and place them in ./inference/clip_model/bmodels.

python -m pip install dfn
# download CLIP VIT-b32 and put these files into ./clip_image_search/clip/bmodels/EN
python3 -m dfn --url https://disk.sophgo.vip/sharing/optDG3uDs
# download ChineseCLIP VIT-16 and put these files into ./clip_image_search/clip/bmodels/CH
python3 -m dfn --url https://disk.sophgo.vip/sharing/qw6hvmVWs

CLIP (Contrastive Language–Image Pre-training) is a technique which efficiently learns visual concepts from natural language supervision. CLIP has found applications in stable diffusion.

This repository aims act as a POC in exploring the ability to use CLIP for video search using natural language outlined in the article found here.

Usage

Dependencies

  • python >= 3.8

Running

streamlit run app.py EN

About

基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages