Sergiu Han's picture

Sergiu Han

hgsg

·

https://sergiudm.github.io/

sergiudm

AI & ML interests

NLP, agent

Recent Activity

liked a model about 6 hours ago

ai-sage/GigaChat3.1-702B-A36B

upvoted a paper 7 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

liked a model 8 days ago

mistralai/Mistral-Small-4-119B-2603

View all activity

Organizations

None yet

upvoted a paper 7 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 8 days ago • 127

upvoted a collection 8 days ago

Mistral Small 4

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 9 days ago • 61

upvoted a collection 14 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated about 21 hours ago • 100

upvoted a paper 15 days ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 56

upvoted an article 30 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

490

upvoted a collection about 1 month ago

Qwen3.5

21 items • Updated 16 days ago • 1.3k

upvoted a paper about 2 months ago

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published Dec 26, 2025 • 30

upvoted a collection about 2 months ago

Open Coding Agents

13 items • Updated 20 days ago • 52

upvoted 2 papers 2 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 21

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200

upvoted a collection 2 months ago

TranslateGemma

3 items • Updated 13 days ago • 223

upvoted an article 2 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9, 2025

•

54

upvoted 8 papers 3 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 156

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 39

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 74

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 133

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 244

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 264