Llama training finetuning interface #2246

howard0su · 2023-07-17T01:50:08Z

Like to get early feedback to adding fine tunning interface to llama.h.

Leverage same code for compute graph
leverage same model class to store the weights including LoRA weights

Questions to get feedback:

shall we have finetune context instead of llama_context today?
shall we have simple function to complete hide training? I feel it may miss the feature like using grammar CFG to finetune model.

ggerganov

Looks fine, though it's hard to tell what's the best approach.
I see you are applying LoRA at runtime. Can you make a prototype for loading the optional LoRA tensors and measure what is the performance difference compared to applying the LoRA once at the start?

ggerganov · 2023-07-21T10:05:22Z

llama.h

    LLAMA_API int llama_eval_export(struct llama_context * ctx, const char * fname);

+    // Enable finetune on the context, flags indicate what type of finetune
+    LLAMA_API int llama_enable_finetune(struct llama_context * ctx, enum llama_finetune_type flags);


Suggested change

LLAMA_API int llama_enable_finetune(struct llama_context * ctx, enum llama_finetune_type flags);

LLAMA_API int llama_finetune_enable(struct llama_context * ctx, enum llama_finetune_type flags);

howard0su added 2 commits July 6, 2023 21:12

Add new APIs

f607bd1

More changes

2c93852

howard0su changed the title ~~Llama train interface~~ Llama training finetuning interface Jul 17, 2023

ggerganov reviewed Jul 21, 2023

View reviewed changes

daboe01 mentioned this pull request Jul 22, 2023

Train Text from scratch #1652

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Llama training finetuning interface #2246

Llama training finetuning interface #2246

Uh oh!

howard0su commented Jul 17, 2023

Uh oh!

ggerganov left a comment

Uh oh!

ggerganov Jul 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	LLAMA_API int llama_enable_finetune(struct llama_context * ctx, enum llama_finetune_type flags);
	LLAMA_API int llama_finetune_enable(struct llama_context * ctx, enum llama_finetune_type flags);

Llama training finetuning interface #2246

Are you sure you want to change the base?

Llama training finetuning interface #2246

Uh oh!

Conversation

howard0su commented Jul 17, 2023

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

ggerganov Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants