Microsoft is ushering in a transformative new era of computing, fundamentally redefining the user experience in Windows 11 by endowing its Copilot AI assistant with powerful new “senses” and capabilities. The assistant is evolving beyond a simple chatbot to become a true digital collaborator, equipped with Voice, Vision, and the ability to take Action—effectively turning every Windows 11 machine into an “AI PC.”
This substantial upgrade shifts Copilot’s role from a passive question-answer tool to an agentic AI, meaning it can now process complex, multi-step requests and execute them across the operating system and applications on the user’s behalf.
The most immediate and accessible change is the introduction of Copilot Voice, which allows for completely natural, hands-free interaction with the PC. By simply saying the wake phrase, “Hey Copilot,” users can launch the assistant and engage in a fluid conversation, similar to interacting with a smart speaker.
This conversational mode is designed to bridge the gap between human intent and AI prompting. Instead of needing to craft the perfect search query, a user can simply express what they want in plain language: “Find the budget report I downloaded last week and create a summary,” or “Turn on Focus mode for 45 minutes with no notifications.” The feature also introduces a polite “Goodbye” command to end the session, adding a human-like touch to the interaction. Microsoft notes that early usage shows users engage with Copilot twice as much when using voice, underscoring the power of natural speech in making AI more accessible.
Adding a crucial layer of context, Copilot Vision grants the AI the ability to “see” and interpret what is currently displayed on the user’s screen. When given permission, Copilot can analyze open apps, documents, or even a full desktop view, offering guidance based on the visual information.
This capability is a game-changer for step-by-step assistance and education. For example:
The Vision feature is explicitly opt-in, with users selecting which application (or up to two at a time) to share, giving them clear control over their privacy.
The most significant step toward an “agentic OS” is Copilot Actions. This experimental feature, rolling out first to Windows Insiders through Copilot Labs, empowers the AI to move beyond giving advice and actually perform complex tasks across local files and desktop applications.
Acting as a digital assistant, Copilot Actions can “click, type, and scroll” to execute multi-step requests defined by the user in natural language. Practical applications include:
Microsoft has been careful to address security and control concerns, particularly following previous discussions around AI features. Copilot Actions are off by default and operate within a contained workspace separate from the main user environment. Users maintain full control, able to monitor the AI’s progress in real-time, pause the action, take over control, or revoke access at any point.
By integrating Voice, Vision, and Actions, Microsoft is positioning Copilot as the central, intelligent layer of the Windows 11 experience. This transformation is not limited to new “Copilot+ PCs”; Microsoft has confirmed that these new capabilities will be rolling out to all supported Windows 11 devices, effectively bringing powerful agentic AI to a massive user base.
The update signals a bold future where the PC is no longer just a collection of apps and files, but an active, intelligent partner ready to take on the heavy lifting of daily tasks—simply by being asked.