pytorch-deep-image-matting

Non-official pytorch implementation of deep image matting

Performance

Test with method whole and max_size=1600.
training epoch = 25.
SAD normalized by 1000.
best epoch is the epoch of the best result while training.
input img is normalized by mean=[0.485, 0.456, 0.406] and std=[0.229, 0.224, 0.225].
erode the alpha for trimap as well as dialte(for 0 and 1 balance) & crop the patch based on the center point(randomly selected from where alpha > 0 and alpha < 1)
The performance is close to paper.

model	MSE	SAD	best epoch	note	link
paper-stage0	0.019	59.6
paper-stage1	0.017	54.6
paper-stage3	0.014	50.4
my-stage0	0.035	72.9	22	with crop error and with no normalized input	download
my-stage0	0.031	70.7	19	with no normalized input	download
my-stage0	0.027	69.1	12	fix crop error and with normalized input	download
my-stage0	0.020	62.0	14	erode as well as dialte & center crop patch
my-stage1				ongoing

Dependencies

pip install -r requirements.txt

Dataset

You should prepare dataset VOC-2012 and COCO-2017 first.
Please email the author for access to raw matting dataset.
Composite the dataset for training and test
```
python tools/composite.py
```

Pretrained model

python tools/chg_model.py

Train

bash train.sh

Training args
- size_h, size_w: final input size of image, 320x320 in paper.
- crop_h, crop_w: random crop size of image, 320x320, 480x480, 640x640 in paper.
- alphaDir, fgDir, bgDir, imgDir: directories of training matting dataset, which is generated by tool/composite.py.
- batchSize: batch size, perhaps batchSize=1 can get better result.
- nEpochs: training epochs.
- step: epochs for decay 1/10 of learning rate.
- lr: learning rate.
- resume: checkpoint for continue training.
- pretrain: pretrained model generated by tool/chg_model.py or checkpoints generated by training.
- saveDir: checkpoints saved directory.
- printFreq: print frequency.
- ckptSaveFreq: checkpoint saved frequency.
- wl_weight: loss weight mentioned in paper.
- stage: 0 for simple alpha loss encode-decoder training, 1 for encoder-decoder training, 2 for encoder-decoder fixed refinement training, 3 for encoder-decoder refinement training.
- testFreq: test frequency
- testImgDir, testAlaphaDir, testTrimapDir: directories of test matting dataset, which is generated by tool/composite.py.
- testResDir: test results saved directory.
- crop_or_resize: test method: crop, resize, whole(as paper).
- max_size: the max input size when test with whole method.

Deploy

bash deploy.sh

Training args
- size_h, size_w: input size when test with resize or crop method.
- crop_h, crop_w: random crop size of image, 320x320, 480x480, 640x640 in paper.
- imgDir, trimapDir, alphaDir: directories of test matting dataset, which is generated by tool/composite.py.
- resume: checkpoint for test.
- saveDir: test results saved directory.
- stage: 0, 1 for encoder-decoder test, 2, 3 for encoder-decoder refinement test
- crop_or_resize: test method: crop, resize, whole(as paper).
- max_size: the max input size when test with whole method.

Visualization

origin image / prediction result(sad=72.9) / alpha ground truth
boy-1518482_1920_12.png
sieve-641426_1920_1.png
light-bulb-376930_1920_11.png
spring-289527_1920_15.png
dandelion-1335575_1920_1.png

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
core		core
result/example		result/example
tools		tools
.gitignore		.gitignore
README.md		README.md
deploy.sh		deploy.sh
requirements.txt		requirements.txt
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pytorch-deep-image-matting

Performance

Dependencies

Dataset

Pretrained model

Train

Deploy

Visualization

About

Uh oh!

Releases

Packages

Languages

wangxu19920419/pytorch-deep-image-matting

Folders and files

Latest commit

History

Repository files navigation

pytorch-deep-image-matting

Performance

Dependencies

Dataset

Pretrained model

Train

Deploy

Visualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages