News

Hoantrbl changed the title How can I become the contributor? I have deployed the kimi-vl on this framework. I want to contribute it. Apr 22, 2025 Hoantrbl changed the title I have deployed the kimi-vl ...
Abstract: By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding. Nevertheless, ...
Large Multimodal Models (LMMs) have demonstrated remarkable capabilities when trained on extensive visual-text paired data, advancing multimodal understanding tasks significantly. However, these ...
Abstract: Existing Large Multimodal Models (LMMs) demonstrate excellent performance in handling visual tasks in everyday scenarios. However, they still face challenges in understanding structured ...