Break the Model.
Before Someone Else Does.
Noctis Research operates at the frontier of adversarial AI. We red team large language models, agentic systems, and AI pipelines — finding the failures your safety evals never will.
Systematic adversarial prompting, multi-turn manipulation, and roleplay-based bypass against safety-tuned models — including closed-weight frontier systems.
Indirect prompt injection via tool outputs, web content, and memory stores — escalating to full agent goal hijack, credential theft, and unauthorised action execution.
End-to-end red teaming of autonomous AI agents — tool misuse, context poisoning, reward hacking, and privilege escalation across multi-agent orchestration frameworks.
Membership inference attacks, training data reconstruction, and system prompt exfiltration — quantifying what your model leaks about proprietary data and instructions.
Backdoor injection via poisoned fine-tuning datasets, adversarial model weights, and compromised adapters — validating your AI supply chain from base model to deployment.
Image-embedded instruction bypass, audio adversarial inputs, and RAG knowledge-base poisoning — attacking the full input surface of production AI systems.
- Automated jailbreak scanning — 5 model endpoints
- Prompt injection surface enumeration
- Monthly adversarial prompt library — 840+ templates
- Safety classifier bypass report
- Risk-tiered findings export — PDF + JSON
- API access — 50,000 req/month
- Email & Signal support — 48h SLA
- Agentic pipeline red team
- Bespoke attack development
- Dedicated researcher
- Everything in Pro
- Unlimited model endpoint scanning — stealth mode
- Agentic pipeline red team — LangChain, AutoGen, CrewAI
- RAG & retrieval poisoning assessment
- Multimodal attack surface coverage
- Weekly findings briefings — researcher-narrated
- API access — 500,000 req/month + webhooks
- Red team scoping sessions — 2/quarter
- Priority support — 8h SLA — Signal & secure onion
- Bespoke exploit & bypass development
- Everything in Business
- Dedicated AI red team cell — embedded researchers
- Full fine-tune & supply chain poisoning assessment
- Bespoke jailbreak & bypass development
- Model inversion & data exfiltration quantification
- Pre-launch safety evaluation — regulatory alignment
- 24/7 incident response retainer — AI misuse events
- Quarterly adversarial simulation — board-level briefing
- EU AI Act, NIST AI RMF, ISO 42001 compliance mapping
- Unlimited API — on-prem or air-gapped deployment
AI red team engagements, pre-launch safety evaluations, and vulnerability disclosures are handled through encrypted communications only. All findings governed by responsible disclosure policy. PGP key available on keyserver.