Autonomous generative AI agents: Under development

Autonomous gen AI agents—agentic AI—could increase the productivity of knowledge workers and make workflows of all kinds more efficient. But the “autonomous” part may take time for wide adoption.

Jeff Loucks

United States

Gillian Crossan

United States

Baris Sarer

United States

China Widener

United States

Autonomous generative AI agents, referred to as “agentic AI,” are software solutions that can complete complex tasks and meet objectives with little or no human supervision. Agentic AI is different from today’s chatbots and co-pilots, which themselves are often called “agents.” Agentic AI has the potential to make knowledge workers more productive and to automate multi-step processes across business functions. Deloitte predicts that in 2025, 25% of companies that use gen AI will launch agentic AI pilots or proofs of concept, growing to 50% in 2027.1 Some agentic AI applications, in some industries, and for some use cases, could see actual adoption into existing workflows in 2025, especially by the back half of the year.

Their efforts are being aided by startups and established tech companies developing agentic AI, both of which see the technology’s potential to spur revenue growth. Investors have poured over $2 billion into agentic AI startups in the past two years, focusing their investment on companies that target the enterprise market.2 Meanwhile, many tech companies, cloud providers, and others are developing their own agentic AI offerings. They are also making strategic acquisitions, and increasingly licensing agentic AI technology from startups and hiring their employees, instead of buying the companies outright.3

Agentic AI puts the “agency” in agent

Gen AI chatbots and co-pilots are sophisticated; they can interact intuitively with humans, synthesize complex information, and generate content. But they lack the degree of agency and autonomy that agentic AI promises. While chatbots and agents share the same foundation—large language models (LLM)—additional technologies and techniques enable agents to act independently, break a job down into discrete steps, and complete their work with minimal human supervision or intervention. AI agents don’t just interact. They more effectively reason and act on behalf of the user.

As its name suggests, agentic AI has “agency”: the ability to act, and to choose which actions to take.4 Agency implies autonomy, which is the power to act and make decisions independently.5 When we extend these concepts to agentic AI, we can say it can act on its own to plan, execute, and achieve a goal—it becomes “agentic”. 6 The goals are set by humans, but the agents determine how to fulfill those goals.

An example can illustrate the difference between agentic AI and co-pilots and chatbots. Co-pilots that assist software developers by testing and suggesting code are one of the most successful gen AI use cases to date.7 They can make experienced software engineers more productive and increase the effectiveness of junior coders. They can convert natural language prompts (in multiple languages) into suggestions for code, and test code for consistency. But such co-pilots only respond to prompts from engineers and do not show agency. With agentic AI, the software “engineer” takes this a step further. A human coder can enter ideas for software through a prompt, and the agentic AI “software engineer” converts those ideas into executable code, a process that automates multiple steps in the software development process.

For example, Cognition Software launched “Devin” in March 2024 with the goal of creating an autonomous software engineer capable of reasoning, planning, and completing complex engineering tasks that require thousands of decisions.8 Devin was designed to perform programming jobs unassisted, based on natural language prompts from human programmers. These jobs include designing full applications, testing and fixing codebases, and training and tuning LLMs.9 Competitors like Codeium, which focuses on enterprise software development, and open-source versions of Devin, hit the market in summer 2024.10

Agentic AI software engineers share similar capabilities and vulnerabilities.11 One vulnerability is that they currently make too many errors to handle full, or even partial, jobs without human oversight. In a recent benchmarking test, Devin was able to resolve nearly 14% of GitHub issues from real-world code repositories—twice as good as LLM-based chatbots,12 but not fully autonomous. Big tech companies13 and startups are striving to make agentic AI software engineers more autonomous and reliable, so human coders—and their employers—can trust them to handle parts of their workload (figure 1). 

Augmenting and amplifying labor productivity

Agentic AI software engineers are just one example of how autonomous generative AI agents could transform how work is done (see Promising use cases for autonomous gen AI agents below). As agentic AI improves, its impact could be enormous. There are over 100 million knowledge workers in the US, and over 1.25 billion knowledge workers globally.14 Total factor productivity,15 a useful proxy for knowledge work, has stagnated in the United States, growing 0.8% from 1987 to 2023 and only 0.5% from 2019 to 2023.16 In most OECD countries, the story is the same.17 Attempts to increase the productivity of knowledge work by automating tasks have met with only partial success. Many companies also need more knowledge workers. Shortfalls of customer service representatives, semiconductor engineers, and seemingly everything in between persist. When new workers start, they need to be productive quickly.

Expert systems and robotic process automation (RPA) can falter when processes are ambiguous or require multiple steps. Systems based on traditional machine learning require extensive training, which is tailored for specific purposes. Built on LLMs, agentic AI can be more flexible, and it can address a broader range of use cases than machine learning or deep learning.

Agentic AI can significantly advance the capabilities of LLMs and could vindicate the investments companies are making in gen AI. The public release of gen AI tools has quickly captured the attention of executives. It was easy to imagine how their organizations could use the technology. Quantifiable business value from gen AI has often been elusive, however. Challenges with data foundations, risk and governance policies, and talent gaps make it hard for companies to scale gen AI initiatives.18 Only 30% of gen AI pilots make it to full production.19 Lack of trust in gen AI output, and potential “real world” consequences from gen AI mistakes, give executives pause.20

Companies that develop and implement agentic AI need to consider the challenges of gen AI, plus the complexity of building bots that can reason, act, collaborate, and create. Most importantly, gen AI agents of all kinds need to be reliable for enterprises to use them: Getting the job right most of the time isn’t enough. There are some use cases and applications in late 2024 that show encouraging signs of being reliable enough for adoption in early 2025.

The potential payoff is worth the effort, however, and early results seem promising. Companies are learning how to boost LLM performance by combining these models with other AI technologies and training techniques. While autonomous and reliable agents are the goal, incremental increases in accuracy and independence could help companies reach their early productivity and efficiency goals for gen AI overall.21 With their range of applications—both horizontal and vertical—and clear business goals, agentic AI looks more like the gen AI solutions executives may have expected in the first place.

Gen AI agents explained: A closer look

Generative AI agents can break down a complex task into a series of steps, execute them, and work through unexpected barriers. They can sense their environment, which depending on the use case can be virtual, physical, or a combination of the two. To complete a task, agentic AI can determine which actions to take, recruit assistance from tools, databases, and other agents, and deliver results based on its goals set by humans.

Agentic AI is an emerging technology—and it continues to evolve—but it has some common characteristics and capabilities:

  • Built on foundation models: Foundation models like LLMs enable agentic AI to reason, analyze, and adapt to complex and unpredictable workflows. This makes it more flexible than RPA and expert systems. LLMs are improving rapidly, with enhanced reasoning and ability to break tasks into smaller steps among the most recent breakthroughs.22 But foundation models alone can’t interact with their environment, make decisions, or execute tasks.23 They must be augmented by other technologies and capabilities.
  • Acts autonomously: While the degree of autonomy varies, agentic AI can be trained to plan and execute complex tasks largely on its own. By introducing reasoning tokens, chain-of-thought models can solve more complex challenges than previous LLMs. Chain-of-thought models are slower to respond but are more deliberative in how they reason through a problem; they can correct their own errors; and they can show the steps they have taken to reach a solution.24
  • Senses the environment: Agentic AI can perceive the environment, process information, and understand the context of the tasks it is given.25 Advanced agentic AI can process multimodal data, such as videos, images, audio, text, and numbers.
  • Uses tools: Agentic AI interacts with tools and systems to complete tasks, such as software, enterprise applications, and the internet.
  • Orchestrates: Agentic AI can direct the participation of other systems and bots to complete a task. With multiagent systems, this means collaborating with other autonomous generative AI agents.
  • Accesses memory: LLMs are stateless: Each interaction is processed independently, and information is not retained when an interaction is complete. With the addition of retrieval mechanisms and databases, agentic AI can access short-term memory to maintain context while performing a specific task, and long-term memory to learn and improve from experience.26

Startups and big tech are developing multiagent gen AI systems, including tools that can help organizations build their own custom agents.

Some of the latest models employ chain-of-thought functions that, while slower and more deliberative than prior large-scale models, enable higher-order reasoning on complex problems.27 Multimodal data analysis can make agentic AI more flexible by expanding the kinds of data that can be interpreted and produced. Multimodal AI also shows that agentic AI can be even more powerful when combined with other kinds of AI technologies such as computer vision (image recognition), and transcription and translation.28 Like agents themselves, multimodal AI is still developing.

True multiagent systems, in which work is orchestrated among a network of autonomous agents, are being developed now, with some pilots being launched in late 2024.29 Multiagent models often outperform single-model systems by distributing tasks, especially in complex environments.30 Startups and big tech are developing multiagent gen AI systems, including tools that can help organizations build their own custom agents.31

Promising use cases for autonomous gen AI agents

Big tech companies and startups are developing early-stage solutions that can partially automate functions like software development, sales, marketing, and regulatory compliance. What follows is a snapshot of today’s examples, not an exhaustive list of applications. Some are based on proofs of concept and demos that are promising but are not ready for enterprise deployment. While these examples are cross-industry, industry-specific agentic applications are also emerging.

Customer support: Customer service is an essential—and often stressful—job, with an annual turnover rate of 38%.32 Effective automation of parts of the customer support workflow could reduce stress and tedium for staff, and help companies serve more customers.33 Agentic AI can handle more complex customer inquiries than today’s customer support chatbots, and they can act autonomously to resolve issues. In one example, an audio company is using agentic AI to help customers set up new equipment, a multistep process that usually requires a human agent. If a human agent is required, the agentic AI compiles relevant information and summarizes the issue before transferring the customer.34 The next wave of customer support agents will likely integrate multimodal data such as voice and video in addition to text-based chat.

Cybersecurity: Cybersecurity experts epitomize the shortage of skilled knowledge workers: Globally, there’s a shortfall of four million today.35 Meanwhile, malicious actors are using gen AI to infiltrate cybersecurity systems. Emerging agentic cybersecurity systems can make human experts more efficient by automating aspects of their work. They can autonomously detect attacks and generate reports, improving system security and reducing the workload of human experts by up to 90%.36 Agentic AI can also help software development teams detect vulnerabilities in new code. It can run tests and communicate directly with developers to explain how to fix a problem—something human engineers must do manually today.37

Regulatory compliance: Companies across industries, including financial services and healthcare, are required to conduct periodic regulatory compliance reviews. The increased size and complexity of relevant regulations, and the dearth of compliance professionals, makes compliance a growing challenge. Startups are developing agentic AI that can analyze regulations and corporate documents, and quickly determine whether the company is compliant. The agent can cite specific regulations, and proactively provide analysis and advice to human regulatory professionals.38 Companies that use gen AI today cite regulatory compliance as their top barrier to developing and deploying gen AI, ahead of issues like a lack of AI technical talent, and implementation challenges.39 Regulatory uncertainty plays a role, but so does the reach and complexity of new regulations. By helping companies understand and comply with regulations as they’re enacted, a more agentic AI solution could help accelerate wider gen AI adoption across enterprises.

Agent builders and orchestrators: Agentic AI solutions are emerging to help automate other cross-industry and vertical-specific workflows. Companies may not need to wait for the market, however. They can build their own agents and multiagent systems. With Google’s Vertex, companies can use no-code tools to create agents for specific tasks, such as building marketing collateral based on previous marketing campaigns.40 LangChain uses open-source technology to help companies construct multiagentic systems. For example, startup Paradigm has launched a “smart spreadsheet” in which multiple agentic AIs partner to collect data from diverse sources, structure it, and complete tasks.41

Bottom line

Agentic AI has enormous potential to help increase the productivity of knowledge workers by automating entire workflows and discrete tasks. Its ability to take independent action, as single agents or in concert with other agents, sets it apart from today’s chatbots and co-pilots. Yet, agentic AI is in the early stages of development and adoption. As impressive as early agentic examples may be, these agents can make mistakes and get stuck in loops. In multiagent systems, “hallucinations” can spread from one agent to another; they can persuade other agents to take the wrong steps and give incorrect answers.42 Although agentic AI can be mainly autonomous, often having a human review decisions after they’ve been made (also known as “human on the loop” rather than the more restrictive “human in the loop”) can make agentic AI more suitable for deployment today. When gen AI agents get stuck, they can consult human experts who help them resolve the challenge and move forward. In this model, agentic AI is like a junior employee who can learn by experience while performing valuable work.43

Because the vision for agentic AI is compelling and the technology is evolving rapidly, companies should prepare themselves now.

While some companies are investing billions to create consistent and reliable agentic AI, it’s not clear when this will happen, or under what circumstances. Will agentic AI reach widespread adoption in 2025, or within the next five years? Will ubiquity require breakthrough innovation, or tweaking current AI technologies and training methods? If the big companies and startups developing agentic AI are successful, the game will change quickly. Imagine autonomous gen AI agents that can process multimodal data, use tools, orchestrate other agents, remember and learn, and execute tasks consistently and reliably. Imagine further that custom agents can be quickly and easily developed by enterprises in “no-code environments” using just conversational text prompts.

Because the vision for agentic AI is compelling and the technology is evolving rapidly, companies should prepare themselves now. As they prepare, they should consider the following approaches.

Prioritize and redesign workflows for agentic AI: Consider which tasks and workflows are well-suited for agentic AI to execute, based on the technology’s capabilities and where the highest value is for your company. Redesign them to remove unnecessary steps. Ensure that agentic AI solutions have a clear goal, and access to the data, tools, and systems they will require. Although these agents can help other agents navigate their environment, cluttered and sub-optimized processes could deliver disappointing results. 

Focus on data governance and cybersecurity: For agentic AI to deliver value, it must have access to valuable and potentially sensitive enterprise data, as well as internal systems and external resources. Companies should put strong data governance and cybersecurity in place before getting started with autonomous generative AI agents. For gen AI early adopters, the top areas where they’re increasing IT investment are data management (75%) and cybersecurity (73%).44 Despite these investments, 58% are highly concerned about using sensitive data in models and managing data security. And only 23% say they’re highly prepared for managing gen AI risk and governance. In short, many of today’s gen AI leaders seem unprepared for the advent of agentic AI. If these leaders are not ready, companies that are still on the gen AI sidelines surely have further to go.

Balance risk and reward: When starting with agentic AI, companies should consider the level of autonomy and data access agents are permitted. Low risk use cases with non-critical data and human oversight can help companies build the data management, cybersecurity, and governance for safe agentic AI applications. Once these are in place, companies should consider higher value use cases that use strategic data, access to more tools, and more autonomy.

Maintain healthy skepticism: Agentic AI is evolving and will likely be more capable in the next year, and will be applied to more horizontal and vertical-specific use cases. Expect impressive demos, simulations, and product announcements throughout 2025. But the challenges we’ve noted may take some time to resolve. Until these challenges are addressed, agentic AI performance in controlled settings is unlikely to deliver improved enterprise performance. Evaluate and question carefully.

By

Jeff Loucks

United States

Gillian Crossan

United States

Baris Sarer

United States

China Widener

United States

Endnotes

  1. According to Deloitte’s State of Generative AI in the Enterprise survey, 23% of enterprises that currently use gen AI are exploring “gen AI agents” to a “large” or “very large” extent, with another 42% exploring it “to some extent.” Given the high interest in agentic AI and the products and services that are being launched by startups and established tech companies, we expect this interest to turn to action, at least on an experimental scale.

    View in Article
  2. CB Insights. Gen AI Investment Database, Aug 21, 2024. This data excludes Open AI. It includes funding to companies that are developing agentic AI with “varying degrees of autonomy.”

    View in Article
  3. Kate Clark, “Investors undaunted by spate of AI acqui-hires,” The Information, Aug. 19, 2024.

    View in Article
  4. Cambridge English Dictionary, “Agency,” accessed Aug. 26, 2024.

    View in Article
  5. Cambridge English Dictionary, “Autonomous,” accessed Aug. 26, 2024.

    View in Article
  6. For humans, agency and autonomy are moral and political concepts. In the context of gen AI agents, we are speaking only of the extent to which software-based technology has scope to design and perform tasks without human direction.

    View in Article
  7. Faruk Muratovic, Duncan Stewart, and Prashant Raman, “Tech companies lead the way on generative AI: Does code deserve the credit?” Deloitte Insights, Aug. 2, 2024.

    View in Article
  8. Scott Wu, “Introducing Devin, the first AI software engineer,” Cognition Software, March 12, 2024.

    View in Article
  9. Rina Diane Caballar, “AI Coding is going from copilot to autopilot,” IEEE Spectrum, April 9, 2024.

    View in Article
  10. Jenna Barron, “Codeium’s new Cortex assistant utilizes complex reasoning engine for coding help,” SD Times, Aug. 14, 2024; Aswin Ak, “OpenDevin: An artificial intelligence platform for the development of powerful AI agents that interact in similar ways to those of a human developer,” Marktechpost, July 28, 2024.

    View in Article
  11. Carl Franzen, “Codium announces Codiumate, a new AI agent that seeks to be Devin for enterprise software development,” VentureBeat, April 3, 2024.

    View in Article
  12. Cognition Software, “SWE-bench technical report,” March 15, 2024.

    View in Article
  13. Big tech companies continue to improve their software co-pilots to make them more like gen AI agents. For example, see: Alex Woodie, “The semi-autonomous agents of amazon Q,” BigDATAWire, May 3, 2024.

    View in Article
  14. Molly Talbert, “Overcoming disruption in a distributed world: Insights from the Anatomy of Work Index 2021,” Asana, January 14, 2024.

    View in Article
  15. Total factor productivity, which measures how efficiently both capital and labour are used, can be a proxy for knowledge worker efficiency. Knowledge work requires access to capital-intensive technology, and effectively designed processes.

    View in Article
  16. US Bureau of Labor Statistics, “Table A. Productivity, output, and inputs in the private nonfarm business and private business sectors for selected periods, 1987-2023,” March 3, 2024.

    View in Article
  17. Organisation for Economic Co-operation and Development, “Multifactor productivity,” accessed Oct. 30, 2024.

    View in Article
  18. Jim Rowan, Beena Ammanath, Costi Perricos, Brenna Sniderman, and David Jarvis, State of gen AI in the Enterprise, Q3 report, Deloitte, August 2024.

    View in Article
  19. Ibid.

    View in Article
  20. Ibid. 

    View in Article
  21. Ibid.

    View in Article
  22. James O’Donnell, “Why OpenAI’s new model is such a big deal,” MIT Technology Review, Sept. 17, 2024.

    View in Article
  23. Janakiram MSV, “AI agents: Key concepts and how they overcome LLM limitations,” The New Stack, June 11, 2024.

    View in Article
  24. “OpenAI, “Learning to Reason with LLMs,” Sept. 12, 2024.

    View in Article
  25. Anna Gutowska, “What are AI Agents?” IBM, July 3, 2024.

    View in Article
  26. Janakiram MSV, “AI agents: Key concepts and how they overcome LLM limitations.”

    View in Article
  27. Simon Willison, “Notes on OpenAI’s new o1 chain-of-thought models,” Simon Willison’s Blog, Sept. 12, 2024.

    View in Article
  28. Hamidou Dia, “So much more than gen AI: Meet all the other AI making AI agents possible,” Google Cloud Blog, Aug. 20, 2024.

    View in Article
  29. Vivek Kulkarni, Scott Holcomb, Prakul Sharma, Edward Van Buren and Caroline Ritter, “How AI agents are reshaping the future of work,” Deloitte AI Institute, November 2024.

    View in Article
  30. The Economist, “Today’s AI models are impressive. Teams of them will be formidable,” May 13, 2024.

    View in Article
  31. CB Insights, “The multi-agent AI outlook: Here’s what you need to know about the next major development in genAI,” Aug. 30, 2024.

    View in Article
  32. Mike Desmarais, “The call center burnout problem,” SQM Group, Feb. 24, 2023.

    View in Article
  33. It’s important to balance the work of human agents. When they get only the most complicated and difficult cases, it can lead to burnout. See Sue Cantrell, et al., “Strengthening the bonds of human and machine collaboration,” Deloitte Insights, Nov. 22, 2022.

    View in Article
  34. Sierra, “Sonos elevates the listener experience,” Feb. 13, 2024.

    View in Article
  35. Michelle Meineke, “The cybersecurity industry has an urgent talent shortage. Here’s how to plug the gap,” World Economic Forum, April 28, 2024.

    View in Article
  36. Ken Yeung, “Dropzone AI gets $16.85M for autonomous cybersecurity AI agents that reduce manual work by 90 percent,” VentureBeat, April 25, 2024.

    View in Article
  37. Simon Thomsen, “Software development cybersec startup Nullify banks $1.1 million pre-seed round,” Startup Daily, June 26, 2023.

    View in Article
  38. Kyt, Dotson, “Norm Ai raises $27M to help businesses handle regulatory compliance with AI agents,” SiliconANGLE, June 26, 2024.

    View in Article
  39. Rowan, State of Generative AI in the Enterprise, Q3 report.

    View in Article
  40. Ron Miller, “With Vertex AI Agent Builder, Google Cloud aims to simplify agent creation,” TechCrunch, April 9, 2024.

    View in Article
  41. Iris Coleman, “Paradigm utilizes LangChain and LangSmith for advanced AI-driven spreadsheets,” Blockchain.News, Sept. 5, 2024.

    View in Article
  42. The Economist, “Today’s AI models are impressive.”

    View in Article
  43. Maria Korolov, “AI agents will transform business processes — and magnify risks,” CIO, Aug. 21, 2024.

    View in Article
  44. Rowan, State of Generative AI in the Enterprise, Q3 report. 

    View in Article

Acknowledgments

Authors would like thank Chris Arkenberg, Duncan Stewart, and Ankit Dhameja.

Cover image by: Jaime Austin; Getty Images, Adobe Stock.

Copyright and legal information

This article contains general information and predictions only and Deloitte is not, by means of this article, rendering accounting, business, financial, investment, legal, tax, or other professional advice or services. This article is not a substitute for such professional advice or services, nor should it be used as a basis for any decision or action that may affect your business. Before making any decision or taking any action that may affect your business, you should consult a qualified professional advisor.

Deloitte shall not be responsible for any loss sustained by any person who relies on this article.

About Deloitte

Deloitte refers to one or more of Deloitte Touche Tohmatsu Limited, a UK private company limited by guarantee (“DTTL”), its network of member firms, and their related entities. DTTL and each of its member firms are legally separate and independent entities. DTTL (also referred to as “Deloitte Global”) does not provide services to clients. In the United States, Deloitte refers to one or more of the US member firms of DTTL, their related entities that operate using the “Deloitte” name in the United States and their respective affiliates. Certain services may not be available to attest clients under the rules and regulations of public accounting. Please see www.deloitte.com/about to learn more about our global network of member firms.