NLP Group @ HKUST

All

26 repositories

COMP4901B-LLMs
Public
"Large Language Models" Course (COMP4901B) offered in HKUST
Python
•9•9•0•1•Updated Nov 23, 2025Nov 23, 2025
Toolathlon
Public
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
Python
•9•143•2•0•Updated Nov 22, 2025Nov 22, 2025
deepsearch-tts
Public
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
Python
•1•20•1•0•Updated Oct 8, 2025Oct 8, 2025
RL-Verifier-Robustness
Public
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
Python
•
MIT License
•1•23•0•0•Updated Oct 7, 2025Oct 7, 2025
WebExplorer
Public
The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
Python
•1•87•0•0•Updated Sep 29, 2025Sep 29, 2025
model-task-align-rl
Public
The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
Python
•
MIT License
•0•15•0•0•Updated Sep 3, 2025Sep 3, 2025
simpleRL-reason
Public
Simple RL training for reasoning
Python
•
MIT License
•281•3.8k•30•1•Updated Aug 3, 2025Aug 3, 2025
ceval
Public
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Python
•
MIT License
•83•1.8k•6•0•Updated Jul 27, 2025Jul 27, 2025
mstar
Public
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
MIT License
•3•69•1•0•Updated Jul 13, 2025Jul 13, 2025
Laser
Public
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
Python
•4•60•3•0•Updated May 22, 2025May 22, 2025
B-STaR
Public
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Python
•11•86•0•0•Updated May 21, 2025May 21, 2025
CodeIO
Public
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Python
•32•561•0•1•Updated May 6, 2025May 6, 2025
GUIMid
Public
0•21•1•0•Updated May 3, 2025May 3, 2025
Vision4Chart
Public
The official repo of "On the Perception Bottleneck of VLMs for Chart Understanding"
Jupyter Notebook
•0•8•0•0•Updated Apr 12, 2025Apr 12, 2025
PreSelect
Public
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Python
•8•57•0•0•Updated Mar 4, 2025Mar 4, 2025
dart-math
Public
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
nlp deep-learning mathematics llm llm-training llm-inference llm-evaluation
Jupyter Notebook
•
MIT License
•7•116•3•0•Updated Dec 10, 2024Dec 10, 2024
deita
Public
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
alignment data-centric large-language-models instruction-tuning
Python
•
Apache License 2.0
•32•576•7•0•Updated Dec 9, 2024Dec 9, 2024
Universal_Truthfulness_Hyperplane
Public
On the Universal Truthfulness Hyperplane Inside LLMs (EMNLP 2024)
Python
•2•6•0•0•Updated Oct 3, 2024Oct 3, 2024
llm-compression-intelligence
Public
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Python
•
MIT License
•6•143•0•0•Updated Sep 20, 2024Sep 20, 2024
AgentBoard
Public
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
SAS
•37•366•11•5•Updated May 20, 2024May 20, 2024
Activation_Decoding
Public
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
Python
•9•62•3•1•Updated Mar 30, 2024Mar 30, 2024
hkust-nlp.github.io
Public
JavaScript
•0•0•0•0•Updated Jan 25, 2024Jan 25, 2024
felm
Public
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
Python
•1•61•3•0•Updated Dec 25, 2023Dec 25, 2023
PEM_composition
Public
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
Python
•
Apache License 2.0
•10•61•4•1•Updated Nov 26, 2023Nov 26, 2023
llmeval_sum_factual
Public
Python
•1•7•0•0•Updated Oct 3, 2023Oct 3, 2023
SynCSE
Public
This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
Python
•
MIT License
•7•39•1•0•Updated Jun 9, 2023Jun 9, 2023