Valideval - Search News

Foundation Model Evaluations Library

fmeval is a library to evaluate Large Language Models (LLMs) in order to help select the best LLM for your use case. The library evaluates LLMs for the following tasks: Open-ended generation - The ...

GitHub

microsoft/a11y-llm-eval

LLMs currently generate code with accessibility bugs, resulting in blockers for people with disabilities and costly re-work and fixes downstream.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Foundation Model Evaluations Library

microsoft/a11y-llm-eval

Trending now