multimodal - 搜索 News

资讯

2 天

揭示多模态大语言模型的秘密：残差结构如何促进跨模态理解

近年来，多模态大语言模型（Multimodal Large Language Models, MLLMs）在视觉与语言交互的任务中取得了显著的进展。通过整合图像和文本数据，MLLMs不仅能生成与视觉内容相关的文本描述，还能根据文本指令理解图像内容。这一切的背后，究竟隐藏着怎样的秘密？

China.org.cn7 天

YTL Power launches Malaysia's first homegrown AI model

To catalyze adoption and innovation, YTL AI Labs also announced the ILMU AI Accelerator Programme, which is open to Malaysian startups, small and medium enterprises, and global solution providers ...

科技行者 on MSN16 天

微软发布Phi-4-Mini：3.8B参数的"小钢炮"，多模态表现堪比大模型两倍体量

这项由微软研究团队开发的最新人工智能模型研究发表于2025年3月，论文详细介绍了Phi-4-Mini和Phi-4-Multimodal两个模型的技术细节和性能表现。有兴趣深入了解的读者可以通过arXiv:2503.01743v2访问完整论文。

来自MSN5月

微软发布 Phi-4 多模态和 Phi-4 迷你小语言模型 - MSN

品玩2月27日讯，据 Cnbeta 报道，微软宣布为群爱你2月推出的小语言模型Phi-4 系列增加两款新模型，提供更多的更新。这两款模型分别是Phi-4-multimodal ...

当前正在显示可能无法访问的结果。

隐藏无法访问的结果