Manish Tiwari's picture

2 8

Manish Tiwari

PredictiveManish

·

AI & ML interests

Natural Language Processing

Recent Activity

liked a Space 3 days ago

SandLogicTechnologies/Shakti-250M

replied to RakshitAralimatti's post about 1 month ago

OCR has absolutely blown up in 2025, and honestly, my perspective on document processing has completely changed. This year has been wild. Vision Language Models like Nanonets OCR2-3B hit the scene and suddenly we're getting accuracy on complex forms (vs for traditional OCR). We're talking handwritten checkboxes, watermarked documents, multi-column layouts, even LaTeX equations all handled in a single pass. The market numbers say it all: OCR accuracy passed 98% for printed text, AI integration is everywhere, and real-time processing is now standard. The entire OCR market is hitting $25.13 billion in 2025 because this tech actually works now. I wrote a detailed Medium article walking through: 1. Why vision LMs changed the game 2. NVIDIA NeMo Retriever architecture 3. Complete code breakdown 4. Real government/healthcare use cases 5. Production deployment guide Article: https://medium.com/@rakshitaralimatti2001/nvidia-nemo-retriever-ocr-building-document-intelligence-systems-for-enterprise-and-government-42a6684c37a1 Try It Yourself

reacted to RakshitAralimatti's post with 🚀 about 1 month ago

OCR has absolutely blown up in 2025, and honestly, my perspective on document processing has completely changed. This year has been wild. Vision Language Models like Nanonets OCR2-3B hit the scene and suddenly we're getting accuracy on complex forms (vs for traditional OCR). We're talking handwritten checkboxes, watermarked documents, multi-column layouts, even LaTeX equations all handled in a single pass. The market numbers say it all: OCR accuracy passed 98% for printed text, AI integration is everywhere, and real-time processing is now standard. The entire OCR market is hitting $25.13 billion in 2025 because this tech actually works now. I wrote a detailed Medium article walking through: 1. Why vision LMs changed the game 2. NVIDIA NeMo Retriever architecture 3. Complete code breakdown 4. Real government/healthcare use cases 5. Production deployment guide Article: https://medium.com/@rakshitaralimatti2001/nvidia-nemo-retriever-ocr-building-document-intelligence-systems-for-enterprise-and-government-42a6684c37a1 Try It Yourself

View all activity

Organizations

liked a Space 3 days ago

Shakti-250M

An efficient small multi-language model for edge AI

replied to RakshitAralimatti's post about 1 month ago

but in healthcare can we depend on OCR?

reacted to RakshitAralimatti's post with 🚀 about 1 month ago

Post

1374

OCR has absolutely blown up in 2025, and honestly, my perspective on document processing has completely changed.

This year has been wild. Vision Language Models like Nanonets OCR2-3B hit the scene and suddenly we're getting accuracy on complex forms (vs for traditional OCR). We're talking handwritten checkboxes, watermarked documents, multi-column layouts, even LaTeX equations all handled in a single pass.

The market numbers say it all: OCR accuracy passed 98% for printed text, AI integration is everywhere, and real-time processing is now standard. The entire OCR market is hitting $25.13 billion in 2025 because this tech actually works now.

I wrote a detailed Medium article walking through:

1. Why vision LMs changed the game
2. NVIDIA NeMo Retriever architecture
3. Complete code breakdown
4. Real government/healthcare use cases
5. Production deployment guide

Article: https://medium.com/@rakshitaralimatti2001/nvidia-nemo-retriever-ocr-building-document-intelligence-systems-for-enterprise-and-government-42a6684c37a1

Try It Yourself

3 replies

·

liked a model about 1 month ago

suparnojit/gemma-3-270m-finetune-gguf

0.3B • Updated Nov 13 • 4 • 1

liked a model about 2 months ago

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12 • 81.9k • • 833

updated a model 3 months ago

PredictiveManish/wall-crack-detection

Object Detection • Updated Oct 7

published a model 3 months ago

PredictiveManish/wall-crack-detection

Object Detection • Updated Oct 7

updated a dataset 3 months ago

PredictiveManish/bhasha-sangrah

Viewer • Updated Oct 7 • 499 • 24

published a dataset 3 months ago

PredictiveManish/bhasha-sangrah

Viewer • Updated Oct 7 • 499 • 24

New activity in sarvamai/sarvam-translate 4 months ago

Update README.md

#17 opened 4 months ago by

liked 3 models 7 months ago

bharatgenai/patram-7b-instruct

Image-Text-to-Text • 8B • Updated Jun 7 • 160 • 30

ai4bharat/IndicNER

Token Classification • Updated Dec 21, 2022 • 1.65k • • 27

ai4bharat/IndicF5

Text-to-Speech • 0.4B • Updated Mar 12 • 6.59k • 89

New activity in sarvamai/sarvam-m 7 months ago

Many ones who don't used hugging face are stuck with downloading models

#5 opened 7 months ago by

PredictiveManish

liked a model 7 months ago

sarvamai/sarvam-m

Text Generation • 24B • Updated May 28 • 639 • 315

liked a model 10 months ago

ai4bharat/hercule-hi-lora

Text Generation • 8B • Updated Oct 18, 2024 • 14 • 1