Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
CAMP-VQA
like
0
Visual Question Answering
8 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2511.07290
arxiv:
2407.11496
License:
mit
Model card
Files
Files and versions
xet
Community
main
CAMP-VQA
/
src
/
extractor
46.1 kB
1 contributor
History:
1 commit
Xinyi Wang
initial commit
b9b1b10
about 1 month ago
__init__.py
Safe
48 Bytes
initial commit
about 1 month ago
extract_clip_embeds.py
Safe
7.33 kB
initial commit
about 1 month ago
extract_clip_embeds_ablation.py
Safe
11 kB
initial commit
about 1 month ago
extract_frag.py
Safe
15 kB
initial commit
about 1 month ago
extract_frame_info.py
Safe
8.59 kB
initial commit
about 1 month ago
extract_slowfast_clip.py
Safe
2.72 kB
initial commit
about 1 month ago
extract_swint_clip.py
Safe
1.45 kB
initial commit
about 1 month ago