JavelinGuard: Low-Cost Transformer Architectures for LLM Security
Compact, stateless guardrail models (~110M to 450M params) for single-request analysis. Over 95% F1 on prompt injection at sub-100ms latency.
Read on arXiv →| Model | Avg latency (CPU) | Size (params) | Injection F1 | Edge |
|---|---|---|---|---|
| Pulse | ~47ms | ~450M | ~95.3% | Excellent |
| DeBERTa-v3-base (tuned) | ~80-150ms | ~184M | ~93-95% | Excellent |
| LlamaGuard (7B) | ~500-1000ms+ | ~7B | High (context-dependent) | Poor |
| Large commercial LLM API (e.g. GPT-4) | ~1000-3000ms+ | Billions+ | High (slow, costly) | Not applicable |








