Merge LoRA weights back into base model #1297

ttompa · 2025-12-01T19:09:48Z

This PR improves inference performance during validation and final model saving by merging LoRA weights back into the base model when gradients are not needed. This eliminates the inference overhead caused by additional torch calls in the LoRA implementation.
This is based off of and requires the E0 estimation branch.

Develop

…netuning

ttompa and others added 14 commits November 18, 2025 21:08

cleanup lora code

8e24229

Merge pull request #13 from vue1999/develop

ba7505a

Develop

add option to use estimated E0s for finetuning

0dcae04

fix cutoff dtype bug

82588bc

fix circular dependency

cc8f3e4

create batches to do forward pass with for E0 estimation

4c1d685

debug

08b807c

disable force prediciton for E0 estimation

bd75f72

update readme with the estimated E0s option

424cdde

merge LoRA weights back into the model after training

d9eaf0c

cache LoRA deltas in eval mode to speed up training

8310e97

add tests to check required_grad behaves correctly when using LoRA fi…

84d4f53

…netuning

Merge branch 'freeze-weights' into merge-lora-weights

d82cd6c

Merge branch 'develop' into merge-lora-weights

2df969d

ttompa marked this pull request as ready for review December 1, 2025 21:34

ttompa marked this pull request as draft December 1, 2025 21:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge LoRA weights back into base model #1297

Merge LoRA weights back into base model #1297

Uh oh!

ttompa commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Merge LoRA weights back into base model #1297

Are you sure you want to change the base?

Merge LoRA weights back into base model #1297

Uh oh!

Conversation

ttompa commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant