Skip to content

zaixizhang/GeneBreaker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance

arxiv paper

Overview

GeneBreaker Overview

Introduction

GeneBreaker is the first framework to systematically evaluate jailbreak vulnerabilities in DNA foundation models. As DNA foundation models become increasingly capable of designing synthetic functional DNA sequences, understanding and mitigating their potential to generate harmful, pathogenic, or toxin-producing genes is critical for biosecurity.

Paper Abstract

DNA, encoding the genetic instructions for all living organisms, fuels groundbreaking advancements in genomics and synthetic biology. Recently, DNA Foundation Models have achieved the success of designing synthetic functional DNA sequences, but their susceptibility to jailbreaking remains underexplored, leading to potential concern of generating harmful sequences such as pathogenic or toxin-producing genes. In this paper, we introduce GeneBreaker, the first framework to systematically evaluate jailbreak vulnerabilities in DNA foundation models. GeneBreaker employs (1) an LLM agent with customized bioinformatics prompts to design high-homology, non-pathogenic prompts, (2) a beam search guided by PathoLM and log-probability heuristics to steer generation toward pathogen-like sequences, and (3) a BLAST-based evaluation pipeline against a curated Human Pathogen Database (JailbreakDNABench) to detect successful jailbreaks.

Evaluated on our curated JailbreakDNABench, GeneBreaker consistently successfully jailbreaks the latest Evo series models across 6 viral categories (up to 60% Attack Success Rate for Evo2-40B). Further case studies on SARS-CoV-2 spike protein and HIV-1 envelope protein demonstrate the sequence and structural fidelity of jailbreak outputs, while evolutionary modeling of SARS-CoV-2 underscores biosecurity risks. Our findings also reveal that scaling DNA foundation models amplifies dual-use risks, motivating enhanced safety alignment and tracing mechanisms.


GeneBreaker Pipeline

GeneBreaker consists of the following key components:

  1. LLM Agent with Bioinformatics Prompts
    • Designs high-homology, non-pathogenic prompts to initiate model generation.
  2. Beam Search Guided by PathoLM and Log-Probability Heuristics
    • Steers the DNA foundation model to generate sequences with high similarity to known pathogens.
  3. Evaluation Pipeline
    • Compares generated sequences against the curated Human Pathogen Database (JailbreakDNABench) to detect successful jailbreaks, defined as DNA sequences or translated protein sequences with >90% similarity to target pathogen sequences.

Benchmark: JailbreakDNABench

JailbreakDNABench Benchmark


Usage Example: Jailbreaking Evo2 to Generate HIV-like Sequences

1. Download Patho-LM Model

First, download and extract the Patho-LM model checkpoint:

gdown 'https://drive.google.com/uc?id=17o5RZoW7mHrk59xV_iuh6Z_SLhuNoTxi' -O Patho-LM.zip
unzip Patho-LM.zip

2. Configure Patho-LM Path

Update the Patho-LM model path in auto_jailbreak_hiv.py (see the comment at line 530) to point to your extracted Patho-LM directory. For example:

model_name = "../Patho-LM/finetuned_ckpt/"  # Change this to your Patho-LM model path

3. Run the Jailbreak Script

You can now run the script to generate and evaluate HIV-like sequences:

python auto_jailbreak_hiv.py --skip_chatgpt

What this script does:

  • Uses the GeneBreaker pipeline to prompt Evo2 for sequence generation.
  • Guides the generation toward HIV-1 envelope protein-like sequences.
  • Evaluates the generated sequences for similarity to known HIV sequences (achieving up to 92.50% DNA similarity and 87.79% protein similarity).

Case Study

Case Study


Acknowledgements

About

Jailbreak Evo

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages