Skip to content
@logic-star-ai

logic-star-ai

Popular repositories Loading

  1. swt-bench swt-bench Public

    [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation

    Python 51 7

  2. baxbench baxbench Public

    Python 47 7

  3. insights insights Public

    We track and analyze the activity and performance of autonomous code agents in the wild

    TypeScript 25

  4. SWEBench SWEBench Public

    Forked from SWE-bench/SWE-bench

    SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

    Python

  5. tests tests Public

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…