How long does an AI/ML penetration test take?

A focused LLM application assessment typically takes 2-3 weeks. Engagements that include training pipelines, multiple agents, or MCP integrations usually run 3-5 weeks depending on scope.

What is in scope for an AI/ML test?

Scope can include hosted LLM applications, fine-tuned or self-hosted models, RAG pipelines, agent frameworks, MCP servers, training and data pipelines, and the surrounding APIs and infrastructure.

Do you test for prompt injection and jailbreaks specifically?

Yes. We perform direct and indirect prompt injection, system-prompt extraction, tool-call hijacking, and policy-bypass jailbreaks aligned with OWASP LLM Top 10.

Does this help with EU AI Act or NIST AI RMF requirements?

Yes. Findings are mapped to NIST AI RMF functions and EU AI Act control expectations, and can feed directly into your AI governance and risk documentation.

Do you offer retests after we fix issues?

Yes. A free retest of remediated findings is included within 30 days of the final report to verify that fixes are effective and do not introduce new issues.

AI/ML Penetration Testing

Adversarial security testing of AI systems, large language models, and machine learning pipelines against prompt injection, model theft, data poisoning, and agentic abuse.

// Overview

Service Overview

AI and machine learning systems introduce a new class of vulnerabilities that traditional application testing does not address. Our AI/ML penetration testing aligns with the OWASP Top 10 for LLM Applications, OWASP Machine Learning Security Top 10, NIST AI RMF, and MITRE ATLAS to evaluate model behavior, training pipelines, inference endpoints, agent frameworks, and MCP integrations. We assess adversarial evasion, prompt injection, model extraction, training-data poisoning, and supply-chain risk across the full ML lifecycle.

ai-ml-pentest--scan

$ armour --module ai-ml-pentest

[*] Loading AI/ML Penetration Testing module...

[*] 14 tools available

[!] 6-phase methodology loaded

[+] Ready for engagement

[+] Deliverables: 8 items

$ _

// Methodology

Our Approach

Model & System Reconnaissance

Identify model architectures, hosting endpoints, training data sources, third-party model dependencies, and integrated agent or MCP tooling.

Input Attack Surface Mapping

Enumerate all prompt entry points, retrieval-augmented generation flows, tool-calling boundaries, and trust transitions between user, system, and tool contexts.

Evasion & Poisoning Testing

Craft adversarial inputs, jailbreaks, and indirect prompt injection payloads, and evaluate exposure of training and fine-tuning pipelines to data poisoning.

Model Extraction & Inversion

Test for model theft via query-based extraction, membership inference, and training-data reconstruction against confidentiality of proprietary models.

Supply-Chain Assessment

Review model registries, pretrained weights, datasets, Python dependencies, and inference container images for tampering and known-vulnerable components.

AI Governance Review

Map controls against NIST AI RMF, ISO/IEC 42001, and EU AI Act expectations including logging, human oversight, and abuse monitoring.

// Arsenal

Tools & Technologies

Garak

PyRIT

Adversarial Robustness Toolbox

Counterfit

Promptfoo

LLM Guard

Burp Suite Pro

Custom Prompt-Injection Harnesses

Hugging Face Transformers

PyTorch

LangChain

MCP Inspector

Semgrep

Trivy

// the process

Our Assessment Lifecycle

Six phases, end to end — from first recon to verified remediation.

PHASE 01 / 06

Discover

We inventory AI/ML assets — hosted models, APIs, agent frameworks, MCP integrations, training pipelines, and data sources — and build a threat model specific to your AI system architecture.

Deliverables

AI/ML threat model and attack surface map
OWASP LLM Top 10 coverage matrix
OWASP ML Top 10 findings report
Prompt injection and jailbreak evidence
Model extraction and inversion results
Training and supply-chain risk register
Guardrail and monitoring recommendations
Re-test verification after remediation

Industries Served

SaaS

FinTech

Healthcare

Government

Legal

Education

E-Commerce

Defense

Key Benefits

LLM-Specific Coverage

Testing aligned to OWASP LLM Top 10 and MITRE ATLAS rather than generic web testing repurposed for AI.

Protect Proprietary Models

Identify model extraction, inversion, and membership-inference exposure before competitors or attackers exploit them.

Agent & MCP Hardening

Validate tool-calling boundaries, MCP server trust, and agentic workflows against prompt-injection-driven abuse.

Data Pipeline Assurance

Surface poisoning and integrity risks across training, fine-tuning, and retrieval data sources.

Regulatory Alignment

Map findings to NIST AI RMF, ISO/IEC 42001, and EU AI Act control expectations for AI risk management.

Actionable Guardrails

Concrete recommendations for input filtering, output validation, rate limiting, and abuse monitoring instead of generic advice.

// FAQ

Frequently Asked Questions

Common questions about our services, methodology, and engagement process.

Ready to Get Started?

Contact our team to discuss your security requirements and receive a customized proposal.

Request Assessment View All Services