Enhancing Digital Interactions with AI Agents for Greater Convenience and Efficiency

Authors

  • Mengtong Xiang Department of Computer Science, George Mason University, Fairfax, VA
  • Ziyu Yao Department of Computer Science, George Mason University, Fairfax, VA

Abstract

AI agents are software entities capable of performing digital tasks autonomously, adapting to new challenges and
environments without direct human control. Recently, AI agents powered by large language models like GPT-4 have been
integrated into operating systems such as Ubuntu, Windows, and macOS. These agents can install apps, edit images,
remove tracking devices from websites, and automate routine digital tasks. OSWorld is a platform that supports the
research of these agents. In this project, we reproduced the OSWorld environment and analyzed the agents’ ability to
complete digital tasks using GPT-4o. The research results showed that AI agents can be used in a wide range of digital
applications, offering significant practical benefits and assistance to users in various settings.

Published

2024-10-13

Issue

Section

College of Engineering and Computing: Department of Computer Science