Question 1

What is AI or LLM pentesting?

Accepted Answer

It means putting your artificial intelligence systems to the test by attacking them as a real adversary would: tricking the model with hidden instructions, getting data out of it that it should not give and, if it has tools connected, making it carry out actions for you. Unlike classic pentesting, here you do not break in by force, you convince the model.

Question 2

Is this the same as an AI security audit?

Accepted Answer

Yes, it is its offensive version. An AI security audit reviews and checks; we go further and attack: we do ethical hacking on your artificial intelligence to prove with evidence which flaws are really exploited. The name changes depending on who asks for it, AI pentesting, security audit or AI red teaming, but the work is the same: putting it to the test as a real adversary would.

Question 3

What is a prompt injection?

Accepted Answer

It is the flagship technique against LLM: slipping in instructions that the model obeys as if they came from its owner. It can be direct, in what the user writes, or indirect, hidden in a document, a website or an email that the model reads. With it, the model is made to ignore its rules, reveal data or misuse its tools.

Question 4

Do you test AI agents and MCP servers?

Accepted Answer

Yes, and it is one of the most important things right now. When AI does not just respond but acts, using tools, calling APIs or connecting through MCP, a flaw stops being an improper response and becomes a real action in your systems. We test tool abuse, agents with too many permissions and the security of MCP servers.

Question 5

What is RAG and why is it attacked?

Accepted Answer

RAG is when the model responds by reading your documents or your knowledge base. If an attacker manages to insert content into that source, they manipulate what the model retrieves and, with it, what it responds. We test source poisoning and data leakage through the retrieval path.

Question 6

Do you follow any reference framework?

Accepted Answer

Yes. We rely on the OWASP Top 10 for LLM and on MITRE ATLAS, which are the reference catalogs of attack techniques against AI. They give us a common base, but the interesting part is usually in how your specific system fits together.

Question 7

Is it useful for the AI Act or ISO 42001?

Accepted Answer

Yes, and it is one of its biggest advantages. The findings prove real risks in your AI systems, so they count as evidence for your AI Act compliance and for your ISO 42001. The same AI you govern with those standards, here you put to the test: governance and attack complement each other.

Question 8

How does it differ from a normal application pentest?

Accepted Answer

An application with AI is also an application, so application pentesting covers its classic part and this one handles what is specific to the model. And since what we find proves real risks, it becomes evidence for your AI Act and your ISO 42001: the same work governs and attacks your AI.

And what we uncover here, with Sondriva, our SOC, we monitor afterwards: we detect abuse attempts against your AI in real time, while your team closes the flaws.

Question 9

Is it safe to do this on an AI in production?

Accepted Answer

We agree on it beforehand and work carefully, just as in any test. When there is a risk of affecting real data or operations, we use an equivalent environment. The priority is to prove the flaw without causing harm.

AI and LLM pentesting: we put your artificial intelligence to the test like an attacker

The attacker no longer breaks in: they convince

What we put to the test

LLM applications

RAG and knowledge bases

Agents, tools and MCP

Few govern your AI. Even fewer attack it

When you need to put your AI to the test

Before putting it into production

Your AI touches data or systems

The AI Act or ISO 42001 applies to you

You use agents or MCP

How we work

Scope and rules

Attack

Findings with proof

Verification

It does not end with the report

Frequently asked questions