1 article in this category
IBM and University of Notre Dame released 105 validated benchmark cards and a dataset of 4,000 cards to improve LLM evaluation transparency.