Skip to content

xlang-ai/AgentNetTool

Repository files navigation

AgentNetTool: Computer-Use Task Annotation Platform

  🌐 Website   |   📑 Paper   |   🤗 Dataset   |   🤖 Model   |   🔧 Tool Document   |   🎮 Model Demo  

This is the code base of AgentNetTool in OpenCUA: Open Foundations for Computer-Use Agents. AgentNetTool is a cross-platform software to annotate computer-use agent tasks. It can record the full video with all the keyboard and mouse inputs as well as the other important system meta data (HTML, A11ytree elements, ...) during the annotator performing the task.

Document: Installation, Setup and User Guidelines

To use the AgentNetTool out of box, you can directly install AgentNetTool on your computer. Please follow the AgentNetTool Documentation. The guide covers downloading, installation, setup, and instructions on how to start annotating with the tool.

Build from source

If you’d like to build the tool from source or modify its code, please follow the instructions below.

Have Python >=3.11.

Clone this repo and cd into it:

$ git clone https://github.com/xlang-ai/AgentNetTool.git
$ cd AgentNetTool

Install the dependencies for this project:

$ pip install -r requirements.txt
$ cd agentnet-annotator
$ npm install

Build the application:

$ npm run build-flask  # build backend
$ npm run make

The built application should be located in the generated agentnet-annotator/out directory, where you may find the software icon to start the tool. After this, follow the remaining relevant setup instructions in the document to set up OBS, plugin and other required configurations.

Note: If you'd like to build the tool on Ubuntu, please use Ubuntu 22.04.

Acknowledge

We thank Yu Su, Caiming Xiong, and the anonymous reviewers for their insightful discussions and valuable feedback. We are grateful to Moonshot AI for providing training infrastructure and annotated data. We also sincerely appreciate Hao Yang, Zhengtao Wang, and Yanxu Chen from the Kimi Team for their strong infrastructure support and helpful guidance. The development of our tool is based on the open-source projects-DuckTrack and OpenAdapt. We are very grateful to their commitment to the open source community. Finally, we extend our deepest thanks to all annotators for their tremendous effort and contributions to this project.

Research Use and Disclaimer

OpenCUA is intended for research and educational purposes only.

Prohibited Uses

  • The model, dataset, tool, and code may not be used for any purpose or activity that violates applicable laws or regulations in any jurisdiction
  • Use for illegal, unethical, or harmful activities is strictly prohibited

Disclaimer

  • The authors, contributors, and copyright holders are not responsible for any illegal, unethical, or harmful use of the Software, nor for any direct or indirect damages resulting from such use
  • Use of the "OpenCUA" name, logo, or trademarks does not imply any endorsement or affiliation unless separate written permission is obtained
  • Users are solely responsible for ensuring their use complies with applicable laws and regulations

Citation

If you use OpenCUA in your research, please cite our work:

@misc{wang2025opencuaopenfoundationscomputeruse,
      title={OpenCUA: Open Foundations for Computer-Use Agents}, 
      author={Xinyuan Wang and Bowen Wang and Dunjie Lu and Junlin Yang and Tianbao Xie and Junli Wang and Jiaqi Deng and Xiaole Guo and Yiheng Xu and Chen Henry Wu and Zhennan Shen and Zhuokai Li and Ryan Li and Xiaochuan Li and Junda Chen and Boyuan Zheng and Peihang Li and Fangyu Lei and Ruisheng Cao and Yeqiao Fu and Dongchan Shin and Martin Shin and Jiarui Hu and Yuyan Wang and Jixuan Chen and Yuxiao Ye and Danyang Zhang and Dikang Du and Hao Hu and Huarong Chen and Zaida Zhou and Haotian Yao and Ziwei Chen and Qizheng Gu and Yipu Wang and Heng Wang and Diyi Yang and Victor Zhong and Flood Sung and Y. Charles and Zhilin Yang and Tao Yu},
      year={2025},
      eprint={2508.09123},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2508.09123}, 
}

About

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published