-
YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection
Paper • 2406.11641 • Published -
YOLOv12: Attention-Centric Real-Time Object Detectors
Paper • 2502.12524 • Published • 12 -
DGE-YOLO: Dual-Branch Gathering and Attention for Accurate UAV Object Detection
Paper • 2506.23252 • Published -
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
Paper • 2512.23273 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2502.12524
-
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems
Paper • 2408.09332 • Published • 2 -
YOLOv10: Real-Time End-to-End Object Detection
Paper • 2405.14458 • Published • 6 -
End-to-End Object Detection with Transformers
Paper • 2005.12872 • Published • 7 -
YOLOv12: Attention-Centric Real-Time Object Detectors
Paper • 2502.12524 • Published • 12
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 29 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 90 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 24 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8
-
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Paper • 2410.16153 • Published • 44 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 59 -
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Paper • 2410.12787 • Published • 30 -
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks
Paper • 2410.01744 • Published • 27
-
YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection
Paper • 2406.11641 • Published -
YOLOv12: Attention-Centric Real-Time Object Detectors
Paper • 2502.12524 • Published • 12 -
DGE-YOLO: Dual-Branch Gathering and Attention for Accurate UAV Object Detection
Paper • 2506.23252 • Published -
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
Paper • 2512.23273 • Published • 14
-
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems
Paper • 2408.09332 • Published • 2 -
YOLOv10: Real-Time End-to-End Object Detection
Paper • 2405.14458 • Published • 6 -
End-to-End Object Detection with Transformers
Paper • 2005.12872 • Published • 7 -
YOLOv12: Attention-Centric Real-Time Object Detectors
Paper • 2502.12524 • Published • 12
-
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Paper • 2410.16153 • Published • 44 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 59 -
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Paper • 2410.12787 • Published • 30 -
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks
Paper • 2410.01744 • Published • 27
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 29 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 90 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 24 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8