Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/awesome-bigdata
Library/awesome-bigdataForked

oxnr/awesome-bigdata

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

View on GitHub↗Upstream oxnr/awesome-bigdata↗

Builder

oxnr

oxnr

oxnr • individual

Stars

14,410

Using upstream star count

Forks

2,581

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jul 4, 2014

Project creation date

README Summary

[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Batch ProcessingBig Data EngineeringData Pipeline DesignData Storage SystemsData Warehouse DesignDistributed Systems ArchitectureETL/ELT ProcessesStream Processing

Tags

Batch ProcessingBig Data EngineeringData Pipeline DesignData Storage SystemsData Warehouse DesignDistributed Systems ArchitectureETL/ELT ProcessesStream ProcessingAI AgentsAWSAWS BedrockAirflowBenchmarkingC++CachingCurated ListData EngineeringData ScienceData VisualizationDatabaseDeep LearningDockerDocument ProcessingEmbeddingsEvalsFeature StoreFinTechForkedGoogle CloudGraphQLHaystackJavaScriptKerasKubernetesLLM ServingLarge Language ModelsMLOpsMachine LearningMobileNode.jsNumPyOpen SourcePyTorchPythonReact / Next.jsReal-Time / StreamingReinforcement LearningResearch / PapersRustScikit-learnSecuritySemantic SearchSimulationSparkStatisticsTensorFlowTutorialVisualizationWeaviate

Taxonomy

AI Trends

MLOpsData-Centric AIReal-time ML InferenceFeature StoresAutoML

category

MLOps & InfrastructureFoundation ModelsAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingRoboticsDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: FinTechIndustry: GamingSecurity & SafetyData Science & Analytics

Deployment Context

CloudOn-premiseHybrid CloudDistributed Clusters

Industries

FinTechHealthcareE-commerceTelecommunicationsManufacturingMedia & EntertainmentGovernmentResearch Institutions

Modalities

TabularTextTime SeriesGraph DataGeospatial Data

Skill Areas

Big Data EngineeringDistributed Systems ArchitectureData Pipeline DesignStream ProcessingBatch ProcessingData Storage SystemsData Warehouse DesignETL/ELT Processes

tag

AI AgentsAWSAWS BedrockAirflowBenchmarkingC++CachingCurated ListData EngineeringData ScienceData VisualizationDatabaseDeep LearningDockerDocument ProcessingEmbeddingsEvalsFeature StoreFinTechForkedGoogle CloudGraphQLHaystackJavaScriptKerasKubernetesLLM ServingLarge Language ModelsMLOpsMachine LearningMobileNode.jsNumPyOpen SourcePyTorchPythonReact / Next.jsReal-Time / StreamingReinforcement LearningResearch / PapersRustScikit-learnSecuritySemantic SearchSimulationSparkStatisticsTensorFlowTutorialVisualizationWeaviate

Use Cases

Large-scale Data ProcessingReal-time AnalyticsData WarehousingLog Processing and AnalysisMachine Learning Data PipelinesBusiness IntelligenceData Lake ManagementStream Analytics

Recent Activity

Updated 3 months ago

7 Days

0

30 Days

0

90 Days

0

Merge pull request #364 from em3s/add-actionbase

Vincent Koc • Feb 5, 2026

3f577a6

Remove extra blank line

Minseok Kim • Feb 2, 2026

6359a85

Add Actionbase to Graph Data Model

Minseok Kim • Feb 2, 2026

25788b4

Quality

production
Quality
medium
Maturity
production

Categories

Foundation ModelsPrimaryAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingRoboticsML Platform & InfrastructureData Science & AnalyticsFinance & LegalEdge & Mobile AISearch & KnowledgeOther AI / MLMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: FinTechIndustry: GamingSecurity & Safety

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryAI-Native Architecture

Languages

No language breakdown recorded.

Timeline

Project created
Jul 4, 2014
Forked
Mar 13, 2026
Your last push
3 months ago
Upstream last push
3 months ago
Tracked since
Feb 5, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…