PCB Defect Detection System for Quality Control

Author: Dawson Burgess, Computer Science Department, University of Idaho, Moscow, ID, United States
Email: [email protected], [email protected]

Overview

This repository contains the code, proposal, and research paper for the project "PCB Defect Detection System for Quality Control," developed as part of CS555: Machine Vision at the University of Idaho. The system uses computer vision and deep learning to detect manufacturing defects in Printed Circuit Boards (PCBs), such as missing holes, broken traces, and soldering issues. It compares Convolutional Neural Networks (CNNs) with traditional machine learning models (SVM, Random Forest, Logistic Regression), achieving up to 96% accuracy with a deeper CNN.

Key Features

Data Preprocessing: Parses XML annotations, preprocesses images with histogram equalization and Gaussian blur.
Models: Baseline CNN, Deeper CNN, and classical ML models for defect classification.
Visualization: Confusion matrices, ROC/PR curves, PCA feature space, and misclassified samples.
Dataset: Utilizes the PCB Defect Dataset from Kaggle.

Project Structure

final_project_cs555_computer_vision.py: Main script for data preparation, model training, and evaluation.
PCB_Defect_Detection_System_for_Quality_Control.pdf: Research paper detailing methodology and results.
Final_Project_Proposal_1.pdf: Initial project proposal outlining goals and approach.

Dataset

The PCB Defect Dataset is sourced from Kaggle and available here. It includes high-resolution PCB images and XML annotations for six defect types: missing hole, mouse bite, open circuit, short, spur, and spurious copper. The dataset was parsed into a CSV file for streamlined processing, with bounding box coordinates used to crop defect regions.

Preprocessing Steps:

Parsed XML annotations into a pandas DataFrame.
Cropped defect regions, resized to 224x224 pixels, and normalized to [0, 1].
Applied histogram equalization and Gaussian blur to enhance defect visibility.

Methodology

Data Preparation

Converted XML annotations to a CSV format with bounding box coordinates and class labels.
Split data into 80% training and 20% testing sets.
Preprocessed images for CNNs (TensorFlow) and feature extraction for classical ML (scikit-learn).

Models

Baseline CNN:
- Architecture: 2 Conv2D layers, 2 MaxPooling2D layers, Dense layers (128, 6).
- Performance: 94% accuracy.
Deeper CNN:
- Architecture: 6 Conv2D layers, 3 MaxPooling2D layers, Dense layers (256, 6).
- Performance: 96% accuracy.
Classical ML Models:
- SVM (RBF Kernel): 90% accuracy.
- Random Forest: 91% accuracy.
- Logistic Regression: 82% accuracy.

Visualization Techniques

Confusion Matrices: Assessed class-wise performance (e.g., confusion_matrix_deeper_cnn.png).
ROC/PR Curves: Evaluated model robustness (roc_curves_deeper_cnn.png).
Training History: Plotted accuracy/loss trends (training_history_baseline_cnn.png).
PCA Visualization: Explored feature space separability (sklearn_results_pca.png).
Misclassified Samples: Highlighted challenging cases (misclassified_examples_deeper_cnn.png).

Results

Deeper CNN: Top performer with 96% accuracy, high precision/recall across all classes.
Baseline CNN: Achieved 94% accuracy, slightly less robust than the deeper model.
Classical ML: Random Forest (91%) outperformed SVM (90%) and Logistic Regression (82%), but lagged behind CNNs.
Key Insight: CNNs excelled at learning subtle defect patterns, with the deeper model showing balanced performance across classes like spur and spurious copper.

Challenges and Limitations

Visual Similarity: Defects like spur vs. spurious copper were harder to distinguish.
Data Variability: Limited exploration of lighting/design variations.
Time Constraints: Shifted focus from traditional image processing to end-to-end deep learning.

Future Work

Enhance preprocessing with advanced augmentation (e.g., rotations, brightness adjustments).
Integrate attention mechanisms or real-time detection with live feeds.
Expand dataset with diverse PCB designs for better generalization.

Installation and Usage

Clone the repository:

git clone https://github.com/yourusername/pcb-defect-detection.git

Download the dataset:

Access the PCB Defect Dataset here. Place it in the data/ directory or update file paths in final_project_cs555_computer_vision.py.
Install dependencies:
```
pip install -r requirements.txt
```

Run the script:

python final_project_cs555_computer_vision.py

View outputs: Check figures/ for visualizations and model_output_summary for metrics.

License

This project is licensed under the MIT License.

Contact

For inquiries, contact Dawson Burgess at [email protected] or [email protected].

Citation

If you use this project, please cite: Dawson Burgess. (2024). PCB Defect Detection System for Quality Control. University of Idaho.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CITATION.cff		CITATION.cff
CS555_DB_FinalProjectProposal1.pdf		CS555_DB_FinalProjectProposal1.pdf
CS555_DB_Final_Paper.pdf		CS555_DB_Final_Paper.pdf
LICENSE		LICENSE
README.md		README.md
final_project.py		final_project.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PCB Defect Detection System for Quality Control

Overview

Key Features

Project Structure

Dataset

Methodology

Data Preparation

Models

Visualization Techniques

Results

Challenges and Limitations

Future Work

Installation and Usage

License

Contact

Citation

About

Uh oh!

Releases

Packages

Languages

License

pegasora/PCB-Defect-Detection-System-for-Quality-Control

Folders and files

Latest commit

History

Repository files navigation

PCB Defect Detection System for Quality Control

Overview

Key Features

Project Structure

Dataset

Methodology

Data Preparation

Models

Visualization Techniques

Results

Challenges and Limitations

Future Work

Installation and Usage

License

Contact

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages