We broke AI guardrails down to six categories.
We curated datasets and models that demonstrate the state of AI safety using LLMs and other open source models.
| Developer | Model | Latency | Metric |
|---|---|---|---|
| Guardrails AI | ProvenanceLLM | 4.2651 ms | 0.7662 |
| Bespoke Labs | Minicheck | 0.6898 ms | 0.7516 |
| Microsoft | Detect Groundedness | 0.5270 ms | 0.6478 |
| Vectara | Hallucination Evaluation Model | 0.5537 ms | 0.6318 |
| Grounded Generation | N/A | 0.4961 |
| Developer | Samples |
|---|---|
| Intrinsic Entity Error | 128 |
| Intrinsic Predicate Error | 116 |
| Extrinsic Entity Error | 115 |
| Coreference Error | 98 |
| Intrinsic Circumstance Error | 82 |
| Extrinsic Circumstance Error | 78 |
| Extrinsic Predicate Error | 76 |