Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

xinyiW915
/
CAMP-VQA

Visual Question Answering
deep-learning
vision
VQA
Transformer
CNN
Model card Files Files and versions
xet
Community
CAMP-VQA / src /extractor
46.1 kB
  • 1 contributor
History: 1 commit
Xinyi Wang
initial commit
b9b1b10 about 1 month ago
  • __init__.py
    48 Bytes
    initial commit about 1 month ago
  • extract_clip_embeds.py
    7.33 kB
    initial commit about 1 month ago
  • extract_clip_embeds_ablation.py
    11 kB
    initial commit about 1 month ago
  • extract_frag.py
    15 kB
    initial commit about 1 month ago
  • extract_frame_info.py
    8.59 kB
    initial commit about 1 month ago
  • extract_slowfast_clip.py
    2.72 kB
    initial commit about 1 month ago
  • extract_swint_clip.py
    1.45 kB
    initial commit about 1 month ago