Library/NeMo-Retriever
Library/NeMo-RetrieverForked

NVIDIA/NeMo-Retriever

NeMo-Retriever

NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

Builder

NVIDIA

NVIDIA

NVIDIA • big-tech

Stars

2,893

Using upstream star count

Forks

311

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 22, 2024

Project creation date

README Summary

NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice developed by NVIDIA. It leverages specialized NVIDIA NIM microservices to extract and contextualize text, tables, charts, and images from documents. The extracted content is optimized for use in downstream generative AI applications.

AI Dev Skills

Unmapped

Retrieval-Augmented GenerationDocument ProcessingMultimodal Content ExtractionMicroservices ArchitectureInformation RetrievalComputer Vision for Document AnalysisNatural Language ProcessingTable UnderstandingChart AnalysisContent Contextualization

Tags

Retrieval-Augmented GenerationDocument ProcessingMultimodal Content ExtractionMicroservices ArchitectureInformation RetrievalComputer Vision for Document AnalysisNatural Language ProcessingTable UnderstandingChart AnalysisContent ContextualizationOn-premiseEnterprise Content ManagementIntelligent Document ProcessingMicroservicesDocument SummarizationLegal TechResearch Paper ProcessingInsuranceTable and Chart AnalysisMultimodalHealthcareRegulatory Document AnalysisAI MicroservicesCompound AI SystemsSelf-hostedDocument Question AnsweringTextCloud APIKnowledge Base ConstructionMultimodal AIImageContent Extraction from PDFsEnterprise AIPublishingTabularFinTechResearch and AcademiaPython

Taxonomy

Recent Activity

Updated 27 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

beta
Quality
high
Maturity
beta

Categories

Foundation ModelsPrimaryDev Tools & AutomationLearning ResourcesIndustry: FinTechRAG & RetrievalEvals & BenchmarkingNLP & TextHealthcare & BiologyFinance & LegalMultimodal AIEdge & Mobile AISearch & KnowledgeOther AI / MLRoboticsComputer Vision

PM Skills

Developer Platform

Languages

Python100.0%

Timeline

Project created
Aug 22, 2024
Forked
Mar 14, 2026
Your last push
27 days ago
Upstream last push
6 days ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…