Name	Name	Last commit message	Last commit date
Latest commit History 15 Commits
lib	lib
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
anchors.py	anchors.py
coco_eval.py	coco_eval.py
dataloader.py	dataloader.py
losses.py	losses.py
model.py	model.py
test.py	test.py
train.py	train.py
utils.py	utils.py

Name

Last commit message

Last commit date

lib

pytorch-retinanet

Pytorch implementation of RetinaNet object detection as described in Focal Loss for Dense Object Detection by Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He and Piotr Dollár.

Results

Currently, this repo achieves 33.7% mAP at 600px resolution with a Resnet-50 backbone. The published result is 34.0% mAP. The difference is likely due to the use of Adam optimizer instead of SGD with weight decay.

Installation

Clone this repo
Install the required packages:

apt-get install tk-dev python-tk

Install the python packages:

pip install cffi

pip install pandas

pip install pycocotools

pip install cython

pip install pycocotools

pip install opencv-python

pip install requests

Build the NMS extension.

Training

The network can be trained using the train.py script. Currently, two dataloaders are available: COCO and CSV. For training on coco, use

python train.py coco <path/to/coco>

For training using a custom dataset, with annotations in CSV format (see below), use

python train.py csv <path/to/annotations.csv> <path/to/classes.csv>

Visualization

To visualize the network detection, use test.py.

CSV datasets

The CSVGenerator provides an easy way to define your own datasets. It uses two CSV files: one file containing annotations and one file containing a class name to ID mapping.

Annotations format

The CSV file with annotations should contain one annotation per line. Images with multiple bounding boxes should use one row per bounding box. Note that indexing for pixel values starts at 0. The expected format of each line is:

path/to/image.jpg,x1,y1,x2,y2,class_name

Some images may not contain any labeled objects. To add these images to the dataset as negative examples, add an annotation where x1, y1, x2, y2 and class_name are all empty:

path/to/image.jpg,,,,,

A full example:

/data/imgs/img_001.jpg,837,346,981,456,cow
/data/imgs/img_002.jpg,215,312,279,391,cat
/data/imgs/img_002.jpg,22,5,89,84,bird
/data/imgs/img_003.jpg,,,,,

This defines a dataset with 3 images. img_001.jpg contains a cow. img_002.jpg contains a cat and a bird. img_003.jpg contains no interesting objects/animals.

Class mapping format

The class name to ID mapping file should contain one mapping per line. Each line should use the following format:

class_name,id

Indexing for classes starts at 0. Do not include a background class as it is implicit.

For example:

cow,0
cat,1
bird,2

Acknowledgements

Significant amounts of code are borrowed from the keras retinanet implementation
The NMS module used is from the pytorch faster-rcnn implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pytorch-retinanet

Results

Installation

Training

Visualization

CSV datasets

Annotations format

Class mapping format

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

13308350476/pytorch-retinanet-coderead

Folders and files

Latest commit

History

Repository files navigation

pytorch-retinanet

Results

Installation

Training

Visualization

CSV datasets

Annotations format

Class mapping format

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages