Skip to content
View 26hzhang's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report 26hzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

跑Alpah的工具

Python 134 31 Updated Jan 18, 2025

📚 知乎 Quant 问答集

9 8 Updated Jul 13, 2023
Jupyter Notebook 7 3 Updated Aug 6, 2024

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Python 483 72 Updated Apr 21, 2025

A simple tutorial about effectively using pdb

Python 19 3 Updated Aug 31, 2023
Python 570 23 Updated Jun 30, 2025

Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning

21 Updated Jun 14, 2025

Leverage WorldQuant API to generate alpha signals, and mine promising alpha expressions.

TypeScript 93 17 Updated May 4, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 14,915 2,418 Updated Jun 18, 2025

Statsmodels: statistical modeling and econometrics in Python

Python 10,768 3,275 Updated Jun 24, 2025

Python wrapper for TA-Lib (http://ta-lib.org/).

Cython 10,818 1,887 Updated Jun 8, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,225 361 Updated Jun 2, 2025

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 26,083 3,993 Updated Jul 1, 2025

📈 Get real-time stocks from TradingView

JavaScript 2,164 464 Updated Jun 9, 2025

Trading Framework and Bot based on Moomoo/Futu

Python 59 24 Updated Jun 1, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 469 33 Updated Jun 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,189 1,684 Updated Jul 1, 2025

An AI Hedge Fund Team

Python 37,516 6,544 Updated Jul 1, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

3,182 186 Updated May 7, 2025

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Python 116 7 Updated Apr 17, 2025

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Python 29 1 Updated Jun 5, 2025

Awesome RL Reasoning Recipes ("Triple R")

717 43 Updated Jun 16, 2025
Python 84 6 Updated Jun 10, 2025

What Makes a Reward Model a Good Teacher? An Optimization Perspective

Python 32 3 Updated Jun 26, 2025
Python 23 Updated Apr 9, 2025
Python 46 2 Updated Apr 9, 2025

Model merging is a highly efficient approach for long-to-short reasoning.

Python 67 3 Updated Jun 4, 2025

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 625 14 Updated Jun 26, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,002 48 Updated Jun 24, 2025

MMR1: Advancing the Frontiers of Multimodal Reasoning

161 5 Updated Mar 17, 2025
Next