You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -20,7 +20,7 @@ ome
20
20
21
21
## Key Features
22
22
-**Compatibility**: Designed for various multimodal models.
23
-
-**Integration**: Currently integrated with **GPT-4o, o1, Gemini Pro Vision, Claude 3 and LLaVa.**
23
+
-**Integration**: Currently integrated with **GPT-4o, o1, Gemini Pro Vision, Claude 3, Qwen-VL and LLaVa.**
24
24
-**Future Plans**: Support for additional models.
25
25
26
26
## Demo
@@ -76,6 +76,13 @@ Use Claude 3 with Vision to see how it stacks up to GPT-4-Vision at operating a
76
76
operate -m claude-3
77
77
```
78
78
79
+
#### Try qwen `-m qwen-vl`
80
+
Use Qwen-vl with Vision to see how it stacks up to GPT-4-Vision at operating a computer. Navigate to the [Qwen dashboard](https://bailian.console.aliyun.com/) to get an API key and run the command below to try it.
81
+
82
+
```
83
+
operate -m qwen-vl
84
+
```
85
+
79
86
#### Try LLaVa Hosted Through Ollama `-m llava`
80
87
If you wish to experiment with the Self-Operating Computer Framework using LLaVA on your own machine, you can with Ollama!
81
88
*Note: Ollama currently only supports MacOS and Linux. Windows now in Preview*
0 commit comments