Lists (4)
Sort Name ascending (A-Z)
Stars
A benchmark for LLMs on complicated tasks in the terminal
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.
Repo for the Claude Code Marketplace to use with the Claude for Life Sciences Launch. This will continue to host the marketplace.json long-term, but not the actual MCP servers.
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
A visual playground for agentic workflows: Iterate over your agents 10x faster
MinusX is an AI Data Analyst that you can add to your Metabase. It helps you ask business questions to your dashboards and dig deeper through followups.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Ultimate Context Engineering Infrastructure, starting from MCPs and Integrations
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
a generalist algorithm for cellular segmentation with human-in-the-loop capabilities
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.
SWE-bench: Can Language Models Resolve Real-world Github Issues?
An open-source AI agent that lives in your terminal.
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
Biomni: a general-purpose biomedical AI agent
(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
nuclei.io: Human-in-the-loop active learning framework for pathology image analysis
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Copilot Chat extension for VS Code
A napari plugin to process and analyse images with chatGPT!