Eval Factsheets

A web-based tool for generating standardized LaTeX/Markdown/Yaml/CSV evaluation cards for AI/ML model assessments. This tool helps researchers and practitioners create consistent, comprehensive documentation of their evaluation methodologies.

Quick Start

Try it now: https://facebookresearch.github.io/EvalFactsheets

Link to the paper: https://arxiv.org/abs/2512.04062

What are Eval Factsheets?

Eval Factsheets provide standardized documentation for AI model evaluations, similar to how Model Cards document model development. They help ensure transparency and reproducibility in evaluation practices by capturing:

Evaluation Context and Scope
Evaluation Structure and Method
Evaluation Alignment

Features

Interactive Form Interface: Easy-to-use web form for inputting evaluation details.
Interactive Database: Explore the csv database with a simple interface.
Multiple Export Options:
- Copy to clipboard with one click
- Download as .tex file
- Direct integration with evaluationcard LaTeX package
Form Validation: Ensures all required fields are completed
Responsive Design: Works seamlessly on desktop and mobile devices
No Installation Required: Browser-based tool, no dependencies needed

How to Use

Basic Usage

Navigate to the tool: Visit https://facebookresearch.github.io/EvalFactsheets
Fill in the form: Enter your evaluation details in the provided fields
Generate LaTeX: Click the "Generate LaTeX" button
Export your card:
- Use "Copy to Clipboard" for quick pasting
- Click "Download .tex" to save the file locally

Using with LaTeX

Once you have your generated .tex file:

\documentclass{article}
\usepackage{evaluationcard}

\begin{document}
\input{your-evaluation-card.tex}
\end{document}

Note: You'll need the evaluationcard.sty LaTeX package in your folder.

Use Cases

Research Papers: Document evaluation methodology for academic publications
Model Development: Track evaluation procedures during model iterations
Team Collaboration: Share standardized evaluation details across teams
Reproducibility: Provide clear documentation for others to replicate evaluations

Contributing

We welcome contributions! Here's how you can help:

Contributor License Agreement ("CLA")

In order to accept your pull request, we need you to submit a CLA. You only need to do this once to work on any of Meta's open source projects.

Complete your CLA here: https://code.facebook.com/cla

Contributing to the EvalFactSheets Database

Just fill the form, generate the code, then copy the csv line and click on the "Add to GitHub" button. It will open the evaluation_cards_database.csv file that you can edit to add your csv line at the end of the file. Then, we will review the pull request and let you know if there is any issues.

Reporting Issues

If you find a bug or have a feature request:

Check if it's already reported in Issues
If not, create a new issue with:
- Clear description of the problem/feature
- Steps to reproduce (for bugs)
- Expected vs actual behavior
- Screenshots if applicable

Pull Requests

Fork the repository
Create a feature branch: git checkout -b feature/your-feature-name
Make your changes
Test thoroughly
Commit with clear messages: git commit -m "Add: feature description"
Push to your fork: git push origin feature/your-feature-name
Open a Pull Request with:
- Description of changes
- Any related issue numbers
- Screenshots/examples if applicable

Development Guidelines

Keep code clean and well-commented
Maintain the existing code style
Test across different browsers
Update documentation for new features

DISCLAIMER

Some items in the csv database were generated using agentic tools. Even after human reviews, it's still possible that some mistakes might be there. We will be performing an ongoing monitoring to make the database truthful. If you see any error in one of the benchmark, please make a pull request.

License

This project is licensed under the cc-by-nc License - see the LICENSE.md file for details.

Citation: If you use this tool in your research, please cite:

@misc{bordes2025evalfactsheetsstructuredframework,
      title={Eval Factsheets: A Structured Framework for Documenting AI Evaluations}, 
      author={Florian Bordes and Candace Ross and Justine T Kao and Evangelia Spiliopoulou and Adina Williams},
      year={2025},
      eprint={2512.04062},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2512.04062}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
evaluation_cards_database.csv		evaluation_cards_database.csv
evaluationcard.sty		evaluationcard.sty
index.html		index.html
main.css		main.css
main.js		main.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Eval Factsheets

Quick Start

What are Eval Factsheets?

Features

How to Use

Basic Usage

Using with LaTeX

Use Cases

Contributing

Contributor License Agreement ("CLA")

Contributing to the EvalFactSheets Database

Reporting Issues

Pull Requests

Development Guidelines

DISCLAIMER

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

facebookresearch/EvalFactsheets

Folders and files

Latest commit

History

Repository files navigation

Eval Factsheets

Quick Start

What are Eval Factsheets?

Features

How to Use

Basic Usage

Using with LaTeX

Use Cases

Contributing

Contributor License Agreement ("CLA")

Contributing to the EvalFactSheets Database

Reporting Issues

Pull Requests

Development Guidelines

DISCLAIMER

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages