On August 25,Watch Sukeban Deka the Movie 2: Counter Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 02:49
1438 views
Hurricane Laura's impact lingered with nightmarish mosquito swarms
After Hurricane Laura hit land in the southern United States in late August and devastated the Louis
Read More
2025-06-26 02:39
1225 views
Apple Maps' new bike
Apple Maps is getting a new bike-route feature in iOS 14, and, if Monday's WWDC presentation is any
Read More
2025-06-26 02:07
255 views
Stephen Colbert calls John Bolton 'naive' during no
If John Bolton was expecting an easy interview with Stephen Colbert, he was way off base.Appearing o
Read More