Tag
#guardrails
6 posts tagged guardrails.
- guardrails
Choosing an LLM Guardrail: Llama Guard, NeMo Guardrails, Guardrails AI
A decision guide for picking an LLM guardrail in 2026 — Meta's Llama Guard 4, NVIDIA's NeMo Guardrails, and Guardrails AI. What each one actually is, and which shape fits your problem.
- guardrails
LLM Guard: Input and Output Scanning for Production LLM Apps
A practical breakdown of LLM Guard by Protect AI — its input and output scanners, how the sanitize/scan pipeline works, where it fits as a runtime guardrail, and its real limits.
- evaluation
False Positive Cost in Refusal Systems: Measure and Tune
Practical methods for quantifying the cost of refusal false positives in LLM products — eval design, baseline rates, threshold tuning, and the regression suite you need to keep them stable.
- tools
Best LLM Security Scanners: Open-Source and Enterprise Compared
A practitioner's comparison of the best LLM security scanners — Garak, PyRIT, LLM Guard, Promptfoo, Vigil, and enterprise options. Coverage, CI/CD fit, and runtime use cases.
- guardrails
Classifier-on-Output: Catching Misbehavior Post-Generation
How production teams use post-generation classifiers to catch what input filters and refusal training miss — architectures, tradeoffs, and where output classifiers earn their latency budget.
- guardrails
Llama Guard vs NeMo vs OpenAI Moderation: Production Tradeoffs
A practitioner comparison of Llama Guard, NeMo Guardrails, and the OpenAI Moderation API — coverage, latency, customization, and where each one breaks in production.