Skip to content

Conversation

@tgaddair
Copy link
Contributor

@tgaddair tgaddair commented May 21, 2024

This PR enables support for per-request inference using an adapter repo that consists of both a LoRA component and a Medusa component.

Example:

https://huggingface.co/arnavgrg/magicoder_medusa_lora_25K

@tgaddair tgaddair changed the title Medusa lora Support Medusa + LoRA adapters (jointly trained) May 21, 2024
@tgaddair tgaddair changed the title Support Medusa + LoRA adapters (jointly trained) Support jointly trained Medusa + LoRA adapters May 21, 2024
@tgaddair tgaddair marked this pull request as ready for review May 21, 2024 22:47
@tgaddair tgaddair requested a review from arnavgarg1 May 21, 2024 22:47
@tgaddair tgaddair merged commit a1ff52d into main May 22, 2024
@tgaddair tgaddair deleted the medusa-lora branch May 22, 2024 17:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants