Skip to content
View simonchance's full-sized avatar

Block or report simonchance

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source unified multimodal model

Python 5,508 481 Updated Oct 27, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,900 4,114 Updated Dec 23, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,135 4,266 Updated Dec 24, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 2,130 160 Updated Dec 16, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,089 141 Updated Dec 18, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,989 603 Updated Jul 4, 2025

Train transformer language models with reinforcement learning.

Python 16,781 2,374 Updated Dec 24, 2025

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

Python 393 27 Updated Jul 11, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,027 57 Updated Aug 17, 2025

Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.

Python 145 20 Updated Feb 7, 2025

A minimal PyTorch re-implementation of Qwen3 VL with a fancy CLI

Python 292 16 Updated Dec 2, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,392 1,457 Updated Nov 28, 2025

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 4,205 417 Updated Dec 2, 2025

A comprehensive guide to understanding and building Agentic AI systems for beginners

11 5 Updated Mar 24, 2025

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 47,545 16,367 Updated Dec 25, 2025

happy-llm 实践练习:colab 版本、pynb 格式,GPU 免费

Python 60 11 Updated Nov 12, 2025

Happy experimenting with MLLM and LLM models!

Jupyter Notebook 128 30 Updated Oct 16, 2024

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 23,210 2,111 Updated Dec 25, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 15,552 1,806 Updated Dec 18, 2025

用 FastAPI 和 Vue3 搭建的 ChatGLM 网页 (前端样式仿照chatgpt-web, 支持chatglm流式输出、前端调整参数、上下文选择、保存图片、知识库问答等功能)

Vue 469 76 Updated Jul 16, 2023

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,197 97 Updated May 3, 2025

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

Python 1,027 160 Updated May 19, 2023

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

Python 3,295 494 Updated Apr 15, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,779 316 Updated Dec 12, 2023

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,731 475 Updated Oct 12, 2023

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 14,149 3,110 Updated Nov 24, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 26,985 5,109 Updated Oct 9, 2025

提供一个基于优化lift为目标的决策树搜索工具,可以用于搜索最优阈值、最优策略组合,快速确定拒绝、捞回区域以及提供一些直观的结果输出。该工具主要适用于风控策略场景。

Python 13 3 Updated Oct 16, 2024

pre train a new llm

Python 70 22 Updated Jan 16, 2024
Next