Fine-Tuning LLM for Code Style Analysis

This repository contains the source code and resources for the research paper titled "Fine-Tuning LLM for Code Style Analysis: An Approach Augmented with DFA". Study explores the integration of Deterministic Finite Automata (DFA) into the fine-tuning process of Large Language Models (LLMs), specifically focusing on Llama-2 7B and Llama-3 8B models.

The primary objective is to improve the models' accuracy in distinguishing between PEP-8 compliant and non-compliant code indentation, particularly under conditions of limited training data.

Installation

$ git clone https://github.com/aholovko/pycodestyle-llm.git
$ cd pycodestyle-llm
$ python3.11 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt

Usage

Run zero-shot evaluation

$ python zero-shot-eval.py --model llama2 --iterations 10

Run W&B sweeps to search the hyperparameter space

$ wandb sweep sweep.yaml
$ wandb agent <sweep_id> -p pycodestyle-llm -e one-cleancode

Fine-tuning model with specific parameters

$ python llama-tune.py --batch_size=16 --epochs=4 --learning_rate=0.001 --lora_alpha=64 --lora_dropout=0.05 --lora_r=8 --model=llama3 --output_dir=llama-tuned-local

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
dfa_trainer.py		dfa_trainer.py
llama-tune.py		llama-tune.py
mps-assert.py		mps-assert.py
pep8_checker.py		pep8_checker.py
requirements.txt		requirements.txt
sweep.yaml		sweep.yaml
zero-shot-eval.py		zero-shot-eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fine-Tuning LLM for Code Style Analysis

Installation

Usage

Run zero-shot evaluation

Run W&B sweeps to search the hyperparameter space

Fine-tuning model with specific parameters

About

Uh oh!

Releases

Packages

Uh oh!

Languages

aholovko/pycodestyle-llm

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning LLM for Code Style Analysis

Installation

Usage

Run zero-shot evaluation

Run W&B sweeps to search the hyperparameter space

Fine-tuning model with specific parameters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages