Tag: Consultation Inputs
Like the “EU AI Act-related Consultations” tag, outputs under this tag are responses to an institution’s public call for input. However, these consultations are NOT related to the EU AI Act.
-
NIST AI 800-2: Our Recommendations on Benchmark Lifecycle and Deprecation
We submitted feedback to NIST on its draft AI 800-2 on Best Practices for Automated Benchmark Evaluations. We recommend that the draft treat benchmarks as active tools requiring lifecycle management, not static instruments. Our submission covers deprecation criteria, versioning, saturation, annotation quality, semantic drift, and the risks of relying on popular but flawed benchmarks. Our…
