nvidia
/

nemotron-graphic-elements-v1

@@ -14,24 +14,24 @@ tags:
 - ingestion
 - yolox
 ---
-# Nemoretriever Graphic Element v1
 ## **Model Overview**
 ![viz.png](viz.png)
 *Preview of the model output on the example image.*
-The input of this model is expected to be a chart image. You can use the [Nemoretriever Page Element v3](https://huggingface.co/nvidia/nemoretriever-page-elements-v3) to detect and crop such images.
 ### **Description**
-The **NeMo Retriever Graphic Elements v1** model is a specialized object detection system designed to identify and extract key elements from charts and graphs. Based on YOLOX, an anchor-free version of YOLO (You Only Look Once), this model combines a simpler architecture with enhanced performance. While the underlying technology builds upon work from [Megvii Technology](https://github.com/Megvii-BaseDetection/YOLOX), we developed our own base model through complete retraining rather than using pre-trained weights.
 The model excels at detecting and localizing various graphic elements within chart images, including titles, axis labels, legends, and data point annotations. This capability makes it particularly valuable for document understanding tasks and automated data extraction from visual content.
 This model is ready for commercial/non-commercial use.
-We are excited to announce the open sourcing of this commercial model. For users interested in deploying this model in production environments, it is also available via the model API in NVIDIA Inference Microservices (NIM) at [nemoretriever-graphic-elements-v1](https://build.nvidia.com/nvidia/nemoretriever-graphic-elements-v1).
 ### License/Terms of use
@@ -52,7 +52,7 @@ Global
 ### Use Case
-The **NeMo Retriever Graphic Elements v1** is designed for automating extraction of graphic elements of charts in enterprise documents. Key applications include:
 - Enterprise document extraction, embedding and indexing
 - Augmenting Retrieval Augmented Generation (RAG) workflows with multimodal retrieval
 - Data extraction from legacy documents and reports
@@ -60,7 +60,7 @@ The **NeMo Retriever Graphic Elements v1** is designed for automating extraction
 ### Release Date
-10/23/2025 via https://huggingface.co/nvidia/nemoretriever-graphic-elements-v1
 ### References
@@ -128,11 +128,11 @@ git lfs install
 ```
 - Using https
 ```
-git clone https://huggingface.co/nvidia/nemoretriever-graphic-elements-v1
 ```
 - Or using ssh
 ```
-git clone [email protected]:nvidia/nemoretriever-graphic-elements-v1
 ```
 2. Run the model using the following code:
@@ -184,7 +184,7 @@ We provide examples in the notebook `Demo.ipynb`.
 ### Software Integration
 **Runtime Engine(s):**
-- **NeMo Retriever Page Elements v3** NIM
 **Supported Hardware Microarchitecture Compatibility [List in Alphabetic Order]:**
@@ -201,7 +201,7 @@ This AI model can be embedded as an Application Programming Interface (API) call
 ## Model Version(s):
-* `nemoretriever-graphic-elements-v1`
 ## Training and Evaluation Datasets:

 - ingestion
 - yolox
 ---
+# Nemotron Graphic Element v1
 ## **Model Overview**
 ![viz.png](viz.png)
 *Preview of the model output on the example image.*
+The input of this model is expected to be a chart image. You can use the [Nemotron Page Element v3](https://huggingface.co/nvidia/nemotron-page-elements-v3) to detect and crop such images.
 ### **Description**
+The **Nemotron Graphic Elements v1** model is a specialized object detection system designed to identify and extract key elements from charts and graphs. Based on YOLOX, an anchor-free version of YOLO (You Only Look Once), this model combines a simpler architecture with enhanced performance. While the underlying technology builds upon work from [Megvii Technology](https://github.com/Megvii-BaseDetection/YOLOX), we developed our own base model through complete retraining rather than using pre-trained weights.
 The model excels at detecting and localizing various graphic elements within chart images, including titles, axis labels, legends, and data point annotations. This capability makes it particularly valuable for document understanding tasks and automated data extraction from visual content.
 This model is ready for commercial/non-commercial use.
+We are excited to announce the open sourcing of this commercial model. For users interested in deploying this model in production environments, it is also available via the model API in NVIDIA Inference Microservices (NIM) at [nemotron-graphic-elements-v1](https://build.nvidia.com/nvidia/nemotron-graphic-elements-v1).
 ### License/Terms of use
 ### Use Case
+The **Nemotron Graphic Elements v1** is designed for automating extraction of graphic elements of charts in enterprise documents. Key applications include:
 - Enterprise document extraction, embedding and indexing
 - Augmenting Retrieval Augmented Generation (RAG) workflows with multimodal retrieval
 - Data extraction from legacy documents and reports
 ### Release Date
+10/23/2025 via https://huggingface.co/nvidia/nemotron-graphic-elements-v1
 ### References
 ```
 - Using https
 ```
+git clone https://huggingface.co/nvidia/nemotron-graphic-elements-v1
 ```
 - Or using ssh
 ```
+git clone [email protected]:nvidia/nemotron-graphic-elements-v1
 ```
 2. Run the model using the following code:
 ### Software Integration
 **Runtime Engine(s):**
+- **Nemotron Page Elements v3** NIM
 **Supported Hardware Microarchitecture Compatibility [List in Alphabetic Order]:**
 ## Model Version(s):
+* `nemotron-graphic-elements-v1`
 ## Training and Evaluation Datasets: