Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/awesome-multimodal-ml
Library/awesome-multimodal-mlForked

pliang279/awesome-multimodal-ml

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

View on GitHub↗Upstream pliang279/awesome-multimodal-ml↗

Builder

pliang279

pliang279

pliang279 • individual

Stars

6,874

Using upstream star count

Forks

901

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 27, 2019

Project creation date

README Summary

By [Paul Liang](http://www.cs.cmu.edu/~pliang/) (pliang@cs.cmu.edu), [Machine Learning Department](http://www.ml.cmu.edu/) and [Language Technologies Institute](https://www.lti.cs.cmu.edu/), [CMU](https://www.cmu.edu/), with help from members of the [MultiComp Lab](http://multicomp.cs.cmu.edu/) at LTI, CMU. If there are any areas, papers, and datasets I missed, please let me know!

Community Evaluation

Loading…

AI Dev Skills

Unmapped

computer-visiondeep-learningmachine-learningmultimodalresearch

Tags

computer-visiondeep-learningmachine-learningmultimodalresearchAI AgentsAI SafetyBenchmarkingComputer VisionCourseDatabaseDeep LearningDistillationEmbeddingsEvalsForkedGraspingHealthcare AIImage GenerationKnowledge GraphMachine LearningMulti-AgentMultimodal AIMusic TechOpenAIPlanning / CoTPyTorchReal-Time / StreamingReinforcement LearningResearch / PapersRobot LearningRoboticsSegmentationSpeech to TextTensorFlowText to SpeechTransformersTutorialVideo Generation

Taxonomy

category

Generative MediaFoundation ModelsAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingComputer VisionRoboticsDev Tools & AutomationLearning ResourcesIndustry: HealthcareIndustry: Audio & MusicSecurity & Safety

Skill Areas

multimodalresearchmachine-learningdeep-learningcomputer-vision

tag

AI AgentsAI SafetyActiveBenchmarkingComputer VisionCourseDatabaseDeep LearningDistillationEmbeddingsEvalsForkedGraspingHealthcare AIImage GenerationKnowledge GraphMachine LearningMulti-AgentMultimodal AIMusic TechOpenAIPlanning / CoTPyTorchReal-Time / StreamingReinforcement LearningResearch / PapersRobot LearningRoboticsSegmentationSpeech to TextTensorFlowText to SpeechTransformersTutorialVideo Generation

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
high
Maturity
research

Categories

Foundation ModelsPrimaryAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaComputer VisionRoboticsSafety & AlignmentCoding & Dev ToolsHealthcare & BiologyMultimodal AIEdge & Mobile AISearch & KnowledgeOther AI / MLDev Tools & AutomationLearning ResourcesIndustry: HealthcareIndustry: Audio & MusicSecurity & Safety

PM Skills

Safety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryAI-Native Architecture

Languages

No language breakdown recorded.

Timeline

Project created
May 27, 2019
Forked
Mar 31, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Aug 20, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…