SAST-AI-Workflow

🎯 Project Overview

SAST-AI-Workflow is a LLM-based tool designed to detect and flag suspected vulnerabilities through SAST(Static Application Security Testing). It inspects suspicious lines of code in a given repository and deeply review the legitimacy of errors. This workflow involves existing SAST reports, source code analysis, CWE data and other known examples.

Purpose

The SAST-AI-Workflow can be integrated into the vulnerability detection process as an AI-assisted tool. It offers enhanced insights that may be overlooked during manual verification, while also reducing the time required by engineers.

As an initial step, we applied the workflow to the SAST scanning of the RHEL systemd project (source: systemd GitHub). We intend to extend this approach to support additional C-based projects in the future.

📐 Architecture

Input Sources

SAST HTML Reports:
Processes scan results from SAST HTML reports.
Source Code:
Pipeline requires to access the exact source code that was used to generate the SAST HTML report.
Verified Data:
Incorporates a known error cases for continuous learning & better results.
CWE Information:
Embeds additional CWE(Common Weakness Enumeration) data extracted to enrich the context data used for the vulnerability analysis.

Embeddings & Vector Store

Converts input data(verified Data, source code) into embeddings using a specialized sentence transformer HuggingFace model (all-mpnet-base-v2) and stores them in a in-memory vector store(FAISS).

LLM Integration

Uses NVIDIA's API via the ChatNVIDIA integration / uses LLM model deployed on Red Hat Openshift AI platform to query the vector store and review potential SAST errors.

Evaluation

Applies metrics (from Ragas library) to assess the quality of model outputs.
Note: SAST-AI-Workflow is primarily focused on identifying false alarms (False Positives).

📊 Evaluation & Metrics

The evaluations of the model responses are being done using the following metrics:

Response Relevancy:
Ensures that the generated answers are directly related to the query.
Response Relevancy.

🔌 Installation & Setup

Please refer to how to run guideline.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
config		config
deploy/tekton		deploy/tekton
docs		docs
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Containerfile		Containerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAST-AI-Workflow

🎯 Project Overview

Purpose

📐 Architecture

Input Sources

Embeddings & Vector Store

LLM Integration

Evaluation

📊 Evaluation & Metrics

🔌 Installation & Setup

About

Releases

Packages

Contributors 4

Languages

License

RHEcosystemAppEng/sast-ai-workflow

Folders and files

Latest commit

History

Repository files navigation

SAST-AI-Workflow

🎯 Project Overview

Purpose

📐 Architecture

Input Sources

Embeddings & Vector Store

LLM Integration

Evaluation

📊 Evaluation & Metrics

🔌 Installation & Setup

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages