Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/llmware
Library/llmwareForked

llmware-ai/llmware

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

View on GitHub↗Upstream llmware-ai/llmware↗

Builder

llmware-ai

llmware-ai

llmware-ai • individual

Stars

14,849

Using upstream star count

Forks

2,930

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Sep 29, 2023

Project creation date

README Summary

llmware ![Static Badge](https://img.shields.io/badge/python-3.10_%7C_3.11%7C_3.12%7C_3.13%7C_3.14-blue?color=blue) ![PyPI - Version](https://img.shields.io/pypi/v/llmware?color=blue) [![members](https://discord-live-members-count-badge.vercel.app/api/discord-members?guildId=1179245642770559067&label=discord%20members&color=5865F2)](https://discord.gg/bphreFK4NJ) [![Documentation](https://github.com/llmware-ai/llmware/actions/workflows/pages.yml/badge.svg)](https://github.com/llmware-ai/llmware/a

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Document Processing and ParsingEnterprise AI Pipeline DevelopmentKnowledge Graph ConstructionModel Fine-tuning and OptimizationMulti-document Question AnsweringRetrieval-Augmented GenerationSmall Language ModelsStructured Data Extraction from TextText Embedding and Semantic SearchVector Database Integration

Tags

Document Processing and ParsingEnterprise AI Pipeline DevelopmentKnowledge Graph ConstructionModel Fine-tuning and OptimizationMulti-document Question AnsweringRetrieval-Augmented GenerationSmall Language ModelsStructured Data Extraction from TextText Embedding and Semantic SearchVector Database IntegrationAPIAnthropic / ClaudeAutomationAzure AIBenchmarkingC++CachingChromaChunkingClaudeContext EngineeringCourseCurated ListDatabaseDeepSeekDockerDocument ProcessingEmbeddingsEvalsFinTechForkedGPU / CUDAGoogle AIHuggingFaceImage GenerationInferenceLLM ServingLarge Language ModelsMilvusMistralModel OptimizationNumPyONNXOllamaOpen SourceOpenAIPhiPineconePyTorchPythonQdrantQuantizationQwenRAGReal-Time / StreamingSecuritySpeech to TextTool UseTransformersTutorialVector Databasellama.cpppgvector

Taxonomy

AI Trends

Small Language ModelsCompound AI SystemsEnterprise AIPrivate AIAgentic AIOn-premise AI

category

Foundation ModelsAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: FinTechSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedOn-premiseCloud APIDocker Containers

Industries

Legal TechFinTechHealthcareInsuranceProfessional ServicesGovernment and Compliance

Modalities

TextTabular

Skill Areas

Retrieval-Augmented GenerationSmall Language ModelsDocument Processing and ParsingVector Database IntegrationKnowledge Graph ConstructionText Embedding and Semantic SearchMulti-document Question AnsweringEnterprise AI Pipeline DevelopmentModel Fine-tuning and OptimizationStructured Data Extraction from Text

tag

APIActiveAnthropic / ClaudeAutomationAzure AIBenchmarkingC++CachingChromaChunkingClaudeContext EngineeringCourseCurated ListDatabaseDeepSeekDockerDocument ProcessingEmbeddingsEvalsFinTechForkedGPU / CUDAGoogle AIHuggingFaceImage GenerationInferenceLLM ServingLarge Language ModelsMilvusMistralModel OptimizationNumPyONNXOllamaOpen SourceOpenAIPhiPineconePyTorchPythonQdrantQuantizationQwenRAGReal-Time / StreamingSecuritySpeech to TextTool UseTransformersTutorialVector Databasellama.cpppgvector

Use Cases

Document Question AnsweringContract Analysis and ReviewFinancial Document ProcessingCompliance and Regulatory AnalysisEnterprise Knowledge ManagementAutomated Research and SummarizationMulti-document IntelligencePrivate Document Chat Systems

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

12

Merge pull request #1282 from llmware-ai/update-032626-linux-x86-gguf

Darren Oberst • Mar 26, 2026

377f01a

update linux x86 gguf backend 8538 build

Darren Oberst • Mar 26, 2026

1bc7a0d

Merge pull request #1281 from llmware-ai/update-032526-mac-gguf

Darren Oberst • Mar 25, 2026

bdb61ad

Quality

beta
Quality
high
Maturity
beta

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingInference & ServingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: FinTechSecurity & SafetyData Science & AnalyticsFoundation ModelsAI AgentsModel TrainingGenerative MediaFinance & LegalEdge & Mobile AIOther AI / ML

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
Sep 29, 2023
Forked
Mar 29, 2026
Your last push
2 months ago
Upstream last push
17 days ago
Tracked since
Mar 26, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…