From 9525f70b103b6d84cb04a59889e79a7f680724b4 Mon Sep 17 00:00:00 2001 From: Junyang Lin Date: Wed, 13 Sep 2023 16:55:42 +0800 Subject: [PATCH] Update README.md --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index 34a9a98..1ac4c21 100644 --- a/README.md +++ b/README.md @@ -15,6 +15,10 @@



+__Will be back soon...___ + +--- + **Qwen-VL** (Qwen Large Vision Language Model) is the multimodal version of the large model series, Qwen (abbr. Tongyi Qianwen), proposed by Alibaba Cloud. Qwen-VL accepts image, text, and bounding box as inputs, outputs text and bounding box. The features of Qwen-VL include: - **Strong performance**: It significantly surpasses existing open-sourced Large Vision Language Models (LVLM) under similar model scale on multiple English evaluation benchmarks (including Zero-shot Captioning, VQA, DocVQA, and Grounding).