Tag: Multimodal Learning
All the papers with the tag "Multimodal Learning".
ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant
grok-3-latestScore: 0.56Published: at 16:00本文提出 ReGraP-LLaVA 模型,通过知识图谱和思维链问答数据增强个性化多模态大语言模型的关系推理能力,显著提升了上下文理解和复杂任务表现。
SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning
grok-3-latestScore: 0.64Published: at 09:09本文提出 SEFE 方法,通过 ASD 范式和 RegLoRA 分别解决多模态持续指令微调中的表面遗忘和本质遗忘问题,显著提升模型性能并实现最先进的遗忘缓解效果。
SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning
grok-3-latestScore: 0.61Published: at 09:09本文提出 SEFE 方法,通过 ASD 范式和 RegLoRA 分别缓解多模态持续指令微调中的表面遗忘和本质遗忘,显著提升模型在持续学习中的性能。