Skip to content
View Zhiyuan-Zeng's full-sized avatar

Highlights

  • Pro

Block or report Zhiyuan-Zeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time compute

Python 85 7 Updated Dec 8, 2025

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 162 16 Updated Nov 14, 2025

[NeurIPS 2025] Precise Information Control in Long-Form Text Generation

Python 9 Updated Oct 1, 2025

[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Python 389 38 Updated Nov 21, 2025

[COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Python 30 4 Updated Jul 11, 2025
Python 142 15 Updated Jul 21, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 635 56 Updated Mar 4, 2024

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Python 134 9 Updated Jul 8, 2024

[ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models

Python 62 3 Updated Apr 1, 2024

[ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers

Python 26 2 Updated Jun 7, 2023