We broke AI guardrails down to six categories.
We curated datasets and models that demonstrate the state of AI safety using LLMs and other open source models.
Developer | Model | Latency | Metric |
---|---|---|---|
Guardrails AI | ProvenanceLLM | 4.2651 ms | 0.7662 |
Bespoke Labs | Minicheck | 0.6898 ms | 0.7516 |
Microsoft | Detect Groundedness | 0.5270 ms | 0.6478 |
Vectara | Hallucination Evaluation Model | 0.5537 ms | 0.6318 |
Grounded Generation | N/A | 0.4961 |
Developer | Samples |
---|---|
Intrinsic Entity Error | 128 |
Intrinsic Predicate Error | 116 |
Extrinsic Entity Error | 115 |
Coreference Error | 98 |
Intrinsic Circumstance Error | 82 |
Extrinsic Circumstance Error | 78 |
Extrinsic Predicate Error | 76 |