Building on HF

52 40 22

vansin PRO

vansin

AI & ML interests

None yet

Recent Activity

commented on a paper 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

commented on a paper 5 days ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

liked a model 7 days ago

deepseek-ai/DeepSeek-V3.2

View all activity

Organizations

commented 2 papers 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 6 days ago • 175 •

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published 7 days ago • 87 •

New activity in opencompass/RISEBench_Gallery 13 days ago

cpu quota limit,can't start

#1 opened 13 days ago by

vansin

commented a paper 2 months ago

AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems

Paper • 2510.05432 • Published Oct 6 • 6 •

New activity in huggingface/InferenceSupport 3 months ago

internlm/Intern-S1-mini

#4406 opened 3 months ago by

vansin

commented a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256 •

New activity in huggingface/InferenceSupport 5 months ago

internlm/internlm3-8b-instruct

#3618 opened 5 months ago by

vansin

commented a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303 •

commented 12 papers 9 months ago

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Paper • 2503.07459 • Published Mar 10 • 16 •

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Paper • 2503.06885 • Published Mar 10 • 4 •

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5 •

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Paper • 2503.07459 • Published Mar 10 • 16 •

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Paper • 2503.06680 • Published Mar 9 • 20 •

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 74 •

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12 • 45 •

VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Paper • 2503.06492 • Published Mar 9 • 11 •

VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Paper • 2503.06492 • Published Mar 9 • 11 •

vansin PRO

AI & ML interests

Recent Activity

Organizations

vansin's activity

cpu quota limit,can't start

internlm/Intern-S1-mini

internlm/internlm3-8b-instruct