Applied AI Engineer

I make AI agents reliable, secure and governed in production.

Most AI demos work once. The hard part is the system that still works on the thousandth call — under adversarial input, on real data, where a wrong answer has a cost. That layer is what I build, break, and harden.

See the work

Get in touch

The layer that decides if it ships

Reliability, security, governance

Reliability

A demo proves it can work once. I build the evals, guardrails, and retries that prove it keeps working — on real inputs, at volume, when it matters.

Security

AI agents are a fresh attack surface: prompt injection, tool abuse, data exfiltration. I red-team systems the way an attacker would, then close the gaps.

Governance

Regulated industries can't ship a black box. I make agents auditable and explainable — so a bank, insurer, or regulator can trust what they do.

Built to prove, not to promise

We ship working AI systems you can test yourself. No testimonials from imaginary clients — just real capability you can verify.

Eight open-source AI systems, shipped with 400+ automated tests — built solo with AI-native tooling. Read every line on GitHub.

Open-source portfolio

github.com/Jbermingham1

My focus is where AI agents fail in production — prompt injection, tool abuse, data leakage. I document how systems break and how to harden them, in the open.

AI-security practice

Reliability · Security · Governance

No client logos. No invented case studies. The proof is code you can run and failures I can show you.

How I earn trust

Verify everything

Building something that can't afford to get AI wrong?

If you're deploying AI agents where reliability, security, or compliance actually matter, I'd like to hear about it. A plain conversation — no pitch.

Get in touch

Australia-based. Focused on regulated industries.