Kaicheng Yang's picture

Kaicheng Yang

Kaichengalex

·

https://kaichengyang0828.github.io/Kaicheng-Yang0828.github.io/

Kaicheng-Yang0828

AI & ML interests

Multimodal Representation Learning/ Vision-Language Pretraining/DeepResearch

Recent Activity

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 5 days ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

View all activity

Organizations

upvoted a paper 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 11 days ago • 106

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 170

upvoted a paper 5 days ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published 6 days ago • 14

upvoted an article 5 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

7 days ago

•

224

upvoted a paper 11 days ago

HunyuanOCR Technical Report

Paper • 2511.19575 • Published 13 days ago • 19

updated a collection 16 days ago

Vision-Language Dataset

3 items • Updated 16 days ago

published a dataset 16 days ago

Kaichengalex/DanQing100M

Updated 16 days ago • 14

upvoted a paper 19 days ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published 21 days ago • 102

upvoted a paper 27 days ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published 30 days ago • 42

upvoted a paper about 1 month ago

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

Paper • 2510.13795 • Published Oct 15 • 56

updated a collection about 1 month ago

SFT Dataset

5 items • Updated about 1 month ago

liked a dataset about 1 month ago

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5 • 14.8M • 99.6k • 100

upvoted 2 articles about 1 month ago

Article

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Nov 5

•

52

Article

What makes good reasoning data

Oct 30

•

34

upvoted a paper about 1 month ago

Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum

Paper • 2510.27571 • Published Oct 31 • 17

updated a collection about 1 month ago

MLLM4Embedding

7 items • Updated Nov 4

upvoted 2 papers about 1 month ago

UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

Paper • 2511.00405 • Published Nov 1 • 5

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 106

updated a collection about 1 month ago

SFT Dataset

5 items • Updated about 1 month ago

upvoted a paper about 1 month ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96