What Is an AI Agent? A Plain-English Definition

If you’ve spent five minutes reading AI news lately, you’ve seen the word “agent” twenty times. Agentic workflows. Agent frameworks. Autonomous agents. Multi-agent systems. AGI is three agents away.

Most of the people using the word can’t actually tell you what an agent is, because there isn’t one clean definition. I’ll give you mine, and then I’ll tell you what agents actually do, what they don’t do, and what you should know if you want to build one.

The simplest definition

An AI chatbot takes a message and returns a response. One in, one out.

An AI agent takes a goal and works toward it over multiple steps, usually by taking actions in the world (calling tools, running code, searching the web) and deciding what to do next based on what happens.

The key difference is action and iteration. A chatbot is a request-response box. An agent is a loop.

A concrete example

Imagine you ask an AI: “Find me the cheapest direct flight from Berlin to Tokyo next Saturday.”

A chatbot’s response: “I can’t browse the web, but typical prices are around…” followed by generic information. Not useful.

An agent’s response: [searches Google Flights] [filters direct flights] [sorts by price] [reads results] [confirms the cheapest option exists] “The cheapest direct flight is Lufthansa LH714 at €680 departing 13:45. Here’s the booking link.”

The agent did multiple things. It picked tools (a search engine), used them, read the results, decided the task was complete, and reported back.

That’s the thing people mean by “agent.” Everything else is decoration.

The loop, in slightly more detail

Every AI agent is running some version of this loop:

Plan. Given the goal and what’s happened so far, decide the next action.
Act. Take that action. This usually means calling a tool: a search, an API, a script, a database query.
Observe. Read the result of the action.
Decide. Is the goal achieved? If yes, stop. If no, go back to step 1 with new information.

That loop is the agent. A language model handles the planning and deciding. Code that’s not the language model handles the acting and observing.

You don’t need a fancy framework to build this. You need a language model, a list of tools the model can call, and a loop that keeps running until the model says “I’m done.”

What makes agents hard

The loop above looks simple. In practice, agents fail in specific ways that beginners don’t anticipate.

They lose track of the goal. Fifteen steps in, the agent has forgotten what it was originally trying to do. It’s now running some side-quest the planning step introduced. This is the single biggest agent failure mode. Cloud models have gotten better at it. Local models still fail at this constantly.

They can’t recover from errors. A tool returns an unexpected format. The agent doesn’t know what to do with that, tries the same thing again, gets the same error, and loops.

They hallucinate tool outputs. The model generates what a tool’s response “probably” looked like rather than reading the actual response. Rare in well-built agents but happens.

They over-commit to bad plans. The first plan the agent produces often isn’t the best plan. But once it’s running, it rarely reconsiders. This is why “let me think step by step first” prompts help.

They waste resources. An agent given a hard task will often keep trying approaches until you run out of API budget. Putting a step limit or cost budget in the loop is essential.

What agents can actually do well right now

Research tasks with clear goals and accessible tools. “Summarize what three competitors are charging for X.” “Find the top 10 papers on topic Y and extract the key findings.”
Repetitive data work. “For each of these 200 documents, extract the author, date, and main conclusion.”
Coding tasks within a known codebase. “Implement feature X following the pattern used in existing feature Y.”
Operational glue. “When a new ticket arrives, triage it and assign to the right team.”
Interactive workflows where a human reviews each step. This is where agents shine right now. Not “autonomous,” but “assisted.”

What agents cannot do well right now

Long-horizon creative work. “Write my novel” is going to fail or produce nothing you’d want.
Tasks with unclear success criteria. If you can’t tell the agent what “done” looks like, it can’t tell either.
Anything requiring real physical awareness. Agents can’t reliably check whether something in the real world happened the way they expect.
Tasks where hallucinated intermediate steps cause irreversible damage. Sending emails, making purchases, deleting files. Keep a human in the loop for anything destructive.
Truly autonomous multi-day operation. Despite the marketing, no agent runs unsupervised for a full day on a complex task and delivers good results. Supervision intervals need to be shorter than the agent’s reliability horizon, which is still on the order of minutes to hours, not days.

Frameworks: do you need one?

There are dozens of agent frameworks. LangChain. LangGraph. CrewAI. Semantic Kernel. AutoGen. Anthropic’s own SDK. OpenAI’s Agents SDK.

My honest opinion after using several of them: for most projects, you don’t need a framework. You need a few hundred lines of your own code. The loop I described above fits in 200 lines of Python.

Frameworks become useful when: - You have multiple agents that need to communicate with each other. - You have a complex tool ecosystem that benefits from standardized interfaces. - You’re in a team where standardization beats flexibility. - You need features like streaming, observability, or retry logic that you don’t want to build yourself.

For a solo builder making their first agent, skip the framework. Build the loop. You’ll understand agents properly and you’ll own every line of your code. Adopt a framework later if you actually hit its use case.

How to start building one

Here’s the minimum viable agent, conceptually:

1. Define your goal and success criteria.
2. Define the tools the agent can use (functions the model can call).
3. Loop:
   a. Ask the model: "Given the goal and conversation history, what tool should I call, or is the task done?"
   b. If done, return the final answer.
   c. If a tool call, execute it and append the result to the conversation history.
   d. Apply a step limit or budget limit to prevent runaway loops.

Every agent framework is a variation on this. Build this first, in plain code, before reaching for a library.

The hype-check

When someone sells you “AI agents” in 2026, ask them:

What’s the agent’s reliability horizon? (How long can it run before needing supervision?)
What’s the failure mode when it breaks? (Silently wrong? Crashes? Asks for help?)
How do you keep it from wasting budget on stuck loops?
What tasks specifically is it good at, and what tasks does it fail at?

If they can’t answer those, they’re selling you a demo, not a product.

Final thought

Agents are real, agents are useful, and agents are going to keep getting better. They’re also, right now, much less autonomous than the marketing implies.

The right framing for 2026: agents are a way to let AI take multiple steps instead of one. That’s a real upgrade over chatbots. It’s not yet “hire an AI employee and it runs your business.” Adjust your expectations accordingly and you’ll get value out of them without being disappointed.