Weikai Huang's picture

In a Training Loop 🔄

Weikai Huang PRO

weikaih

·

https://weikaih04.github.io/

weikaih04

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

upvoted a paper 2 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

updated a collection 2 days ago

View all activity

Organizations

upvoted a paper 2 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 7 days ago • 229

upvoted a collection 8 days ago

WildDet3D

This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 2 days ago • 17

upvoted a collection 12 days ago

Molmo2

Artifacts for the Molmo2 release • 5 items • Updated Mar 2 • 36

upvoted 2 papers 18 days ago

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Paper • 2603.24575 • Published 21 days ago • 18

Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos

Paper • 2602.23543 • Published Feb 26 • 9

upvoted 2 papers about 2 months ago

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Paper • 2602.19313 • Published Feb 22 • 26

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 72

upvoted a collection 2 months ago

XGen-MM-1 models and datasets

A collection of all XGen-MM (Foundation LMM) models! • 15 items • Updated Mar 2 • 40

upvoted 2 collections 3 months ago

Open Coding Agents

13 items • Updated Mar 5 • 52

Molmo2 Data

Artifacts for the Molmo2 data release • 13 items • Updated Mar 2 • 39

upvoted a collection 5 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 168

upvoted 2 papers 5 months ago

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published Jun 20, 2025 • 64

Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index

Paper • 2506.12229 • Published Jun 13, 2025 • 3

upvoted a collection 8 months ago

DocRAG Datasets

Processed ("Unified") datasets used in DocRAG for training or inference purposes. • 12 items • Updated Jun 14, 2025 • 1

upvoted a paper 10 months ago

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Paper • 2504.15280 • Published Apr 21, 2025 • 25

upvoted a collection 10 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

upvoted a collection 11 months ago

Synthetic Object Compositions for Det / Seg / Grounding

Dataset Collections for paper: https://github.com/weikaih04/Synthetic-Detection-Segmentation-Grounding-Data • 8 items • Updated Mar 2 • 2

upvoted a collection over 1 year ago

CoTA Datasets

This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 4 items • Updated Mar 2 • 7

upvoted a paper over 1 year ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

upvoted a collection almost 2 years ago

TaskMeAnything

A collection of TaskMeAnything resources [https://github.com/JieyuZ2/TaskMeAnything] • 7 items • Updated Mar 2 • 3