Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.12524

YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection

Paper • 2406.11641 • Published Jun 17, 2024
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12
DGE-YOLO: Dual-Branch Gathering and Attention for Accurate UAV Object Detection

Paper • 2506.23252 • Published Jun 29, 2025
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Paper • 2512.23273 • Published Dec 29, 2025 • 14

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19, 2025 • 30
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

Object detection 🔍

YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems

Paper • 2408.09332 • Published Aug 18, 2024 • 2
YOLOv10: Real-Time End-to-End Object Detection

Paper • 2405.14458 • Published May 23, 2024 • 6
End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 7
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 24
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 8

YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12
YOLOv10: Real-Time End-to-End Object Detection

Paper • 2405.14458 • Published May 23, 2024 • 6

YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44
AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21, 2024 • 59
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 30
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 27

YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection

Paper • 2406.11641 • Published Jun 17, 2024
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12
DGE-YOLO: Dual-Branch Gathering and Attention for Accurate UAV Object Detection

Paper • 2506.23252 • Published Jun 29, 2025
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Paper • 2512.23273 • Published Dec 29, 2025 • 14

YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12
YOLOv10: Real-Time End-to-End Object Detection

Paper • 2405.14458 • Published May 23, 2024 • 6

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19, 2025 • 30
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

Object detection 🔍

YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems

Paper • 2408.09332 • Published Aug 18, 2024 • 2
YOLOv10: Real-Time End-to-End Object Detection

Paper • 2405.14458 • Published May 23, 2024 • 6
End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 7
YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published Feb 18, 2025 • 12

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44
AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21, 2024 • 59
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 30
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 27

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 24
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 8

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs