Skip to content
View smashfan's full-sized avatar

Block or report smashfan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、AI Agent、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

2,489 285 Updated Nov 9, 2025

Ongoing research training transformer models at scale

Python 14,143 3,257 Updated Nov 7, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,390 3,995 Updated Nov 6, 2025

AI-powered ESG report generator for detailed, accurate, and customized ESG reports.

Python 2 2 Updated Jun 10, 2024

Question-and-Answer app to explore ESG reports using RAG-LLM

Jupyter Notebook 4 1 Updated Nov 5, 2023

Comprehensive ESG dataset from Chinese listed companies' reports. Contains 8,467 annotated sentences with 36 topic labels and 2 quality labels. Ideal for evaluating ESG report completeness, automat…

28 4 Updated Jun 6, 2024

使用 Qwen2ForSequenceClassification 简单实现文本分类任务。

Python 84 7 Updated Jun 12, 2024

TianGong-AI-Unstructure

Python 69 36 Updated Oct 7, 2025

Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with 🤗 transformers.

Python 62 8 Updated Dec 12, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,607 132 Updated Jan 24, 2025
Jsonnet 346 50 Updated May 2, 2024

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 67,361 7,187 Updated Nov 10, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,138 7,517 Updated Nov 6, 2025

Question and Answer based on Anything.

Python 13,726 1,326 Updated Mar 24, 2025

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 548 79 Updated Nov 5, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 59,297 7,209 Updated Oct 4, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,583 586 Updated Oct 24, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 500 45 Updated Oct 20, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,068 219 Updated Aug 17, 2024
Python 626 57 Updated Jul 31, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,344 809 Updated Nov 9, 2025

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Python 1,923 240 Updated Jul 2, 2025

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,255 225 Updated Sep 22, 2025

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,594 2,457 Updated Nov 6, 2025

Integrating ONgDB database into langchain ecosystem

Python 76 7 Updated May 10, 2023

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,176 568 Updated Jul 15, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,122 294 Updated Nov 8, 2024

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Python 703 70 Updated Dec 30, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,472 529 Updated Feb 27, 2024

SoTA LLM for converting natural language questions to SQL queries

Jupyter Notebook 3,936 266 Updated May 23, 2024
Next