All official submissions to the NoveltyBench leaderboard are maintained at novelty-bench/novelty-bench

Submit to NoveltyBench Leaderboard

If you are interested in submitting your model to the NoveltyBench Leaderboard, please do the following:

  1. Fork the official repository;
  2. Clone your fork;
  3. Under evaluation/, create a new folder with the submission date and your model name (e.g., 2025-03-27_gemini-1.5-pro);
  4. Within the folder (evaluation/<date + name>/), please include the following required assets:
    • Follow the instruction in the README of the official repository to get the following files for each subset NB-Curated and NB-WildChat:
      • generations.jsonl
      • partitions.jsonl
      • scores.jsonl
      • summary.json
    • Put your scores.jsonl and summary.json under the folder. You final folder should look like:
    • - evaluation/
          - <date + name>/
              - nb-curated/
                  - scores.jsonl
                  - summary.json
              - nb-wildchat/
                  - scores.jsonl
                  - summary.json
  5. Create a pull request to this repository with the new folder.

The NoveltyBench team will:

  • Review and merge your submission
  • Update the leaderboard with your results