Highlights
- Pro
Stars
Repository of Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Fara-7B: An Efficient Agentic Model for Computer Use
[ACL 2024]Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight]
A public repository going to study how we can leverage LLMs as tools to improve our academic writing.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
This repository serves as a comprehensive knowledge hub, curating cutting-edge research papers and developments across 25+ specialized domains
[NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs 🤖
SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.
An Illusion of Progress? Assessing the Current State of Web Agents
An example starter repo for Python projects
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
A curated list of retrieval-augmented generation (RAG) in large language models
[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
A simple screen parsing tool towards pure vision based GUI agent
Fully open reproduction of DeepSeek-R1
Building a comprehensive and handy list of papers for GUI agents



