The Scan Tailor version that merges the features of the Scan Tailor Featured and Scan Tailor Enhanced versions,
brings new ones and fixes.
Scan Tailor is an interactive post-processing tool for scanned pages.
It performs operations such as:
- page splitting,
- deskewing,
- adding/removing borders,
- selecting content
- ... and others.
You give it raw scans, and you get pages ready to be printed or assembled into a PDF
or DjVu file. Scanning, optical character recognition,
and assembling multi-page documents are out of scope of this project.
This feature has been improved. See page area feature description.
This feature has been reworked and is now a part of the page area feature.
This feature has been reworked. See Scan Tailor Advanced fixes & improvements for more information.
This feature has been reworked. See rectangular picture shape feature description.
This feature had performance and drawing issues and has been reworked.
Auto dewarping vertical half correction
This patch corrects the original auto-dewarping in half
the cases when it fails. If the vertical content boundary angle (calculated by auto-dewarping)
exceeds an empirical value (2.75 degrees from vertical), the patch adds a new point to
the distortion model (with the coordinates equal to the neighboring points) to make
this boundary vertical. The patch works ONLY for the linear end of the top (bottom)
horizontal line of the blue mesh (and not for the opposite curved end).
You can move the topmost (bottom-most) horizontal blue line of the dewarping mesh up and
down as a whole - if you grab it at the most left (right) red point - holding down the CTRL key.
You can create the rectangular picture zones - holding down the CTRL key.
You can move the (rectangular) picture zones corners in an orthogonal manner - holding down the CTRL key.
This feature has been reworked and is now a part of new zone interaction modes feature.
This feature had performance issues and has been optimized.
This feature has been reworked. See rectangular picture shape feature description.
*Other features of this version, such as Export, Dont_Equalize_Illumination_Pic_Zones, Original_Foreground_Mixed
has't been moved due to dirty realization. Their functionality is fully covered by
full control over settings on output and
splitting output features.
Portability.
The setting is stored in the folder with a program.
Page splitting had an influence on output only in b&w mode with dewarping disabled.
Now it works in all the modes.
Page layout and all the other views now consider splitting settings.
Corresponding improvements are done to thumbnails.
Changed Scan Tailor behavior on page split stage.
Optimized memory usage on the output stage.
Reworking on multi column thumbnails view feature from ver. Enhanced.
Now thumbnails are shown evenly.
Added option to control highlighting (with red asterisks) the thumbnails of pages with high deviation.
The option refreshes the thumbnails instantly.
Support for processing of images with light content on dark background.
Now that kind of images can correctly be handled on all the stages. Many book covers are examples of such images.
Deviation feature reworked.
Fixed other bugs of official, Enhanced and Featured versions and made lots of other improvements.
You can choose a desired color scheme in settings.
This significantly increases the speed of processing. The count of threads to use can be
adjusted while processing.
Warning! More threads requires more memory to use. Exclude situations of that to be overflowed.
This feature enables to control cut margins, normalizing illumination before binarization,
normalizing illumination in color areas options, Savitzky-Golay and morphological smoothing on output
in any mode (of course, those setting that can be applied in the current mode).
Now outside pixels can be filled with the background color of the page.
Added filling setting with the following options:
1. Background: estimate the background and fill outside pixels with its color.
2. White: always fill with white.
There are two options in settings dialog: B&W and color compression.
1. The B&W one has None, LZW, Deflate and CCITT G4 (Default) options.
2. The color one has None, LZW (Default), Deflate and JPEG options.
Sauvola and Wolf binarization algorithms have been added. They can be applied when
normalizing illumination does not help.
The feature allows to split the mixed output scans into the pairs of a foreground (letters)
and background (images) layer.
You can choose between B&W or color (original) foreground.
It can be useful:
- for the further DjVu encoding,
- to apply different filters to letters and images, which when being applied to the whole
image gives worse results.
- to apply a binarization to the letters from a third party app without affecting the images.
Note: That does not rename files to 0001, 0002... It can be made by a third party app, for example
Bulk Rename Utility
It allows to preserve the original image background in the format ready for the further processing, when BW foreground is used.
It can be used to encode into DjVu the pages with the complex background using the semi-auto "split layers" method which gives much higher quality results than DjVu auto segmenter.
Also this feature can be used to extract high contrast elements of gradient images into the foreground layer by using second processing of the layer with pictures ("background").
Properties of the original background:
* Original background images are saved into "original_background" folder in "out" directory.
* Pure black (#000000) and white (#ffffff) colors of original background image are reserved into #010101 and #fefefe, respectively.
* Picture zones are marked with black when the BW content is marked with white. This property allow to use "select by color" feature of an image editor to select needed areas for their further processing, for example, apply blur to white holes and their nearest areas to get an effective compression level of the background layer in DjVu.
* Filling zones feature also removes trash and speckles from the original background when applied to the foreground layer.
Color segmentation allows to split the image into color segments and colorize b&w mask.
Posterization allows to reduce the number of colors of the image by grouping similar colors.
The main use of posterization is to be applied to segmented image to get an indexed image, that can then be encoded into DjVu as the foreground layer. It allows to create high-quality DjVu files with color text and elements having maximal compression level.
Posterization can also be used in color mode and can be applied to usual color gradient images for different purposes, for example, to increase their compression efficiency.
"Quadro" picture shape mode from Featured was merged with Rectangular one from Enhanced. Also removed restriction of ver. Featured on deleting all the auto zones. Before it resulted in resetting all the auto zones back.
Added sensitivity option. If sensitivity equals 25%, the results will be the same as they were in old "Quadro" mode, if 100% - as in old "Rectangular".
This feature is a further development of the auto margins and page box features from version Enhanced.
Selection content stage changes:
1. A page box implementation reworked. Now it's interactive and can be adjusted by the same way as a content box is done.
2. The page rectangle does not require refreshing page and won't be reset on the content area changes.
3. The page rectangle is now drawn in the thumbnails.
4. Implemented the new feature of applying the page box set manually to the other pages. For pages of a different size ST automatically corrects page box place. The same change has been made to applying the content box feature.
5. Added a feature of dragging both content and page areas by using Shift+LMB combination.
6. Added width and height parameters to regulate the page box size in manual mode.
7. A page area is now allowed to be out of the page bounds in manual mode.
Page layout stage changes:
1. Auto margins option has been moved out of alignment settings and does no more force to use only the original layout.
2. Auto margins feature now considers page box changes on the 4th stage.
3. Corresponding bug fixes.
Owing to the changes above, there appeared a new way to set margins for books and documents, pages of which have similar formatting. At content selection stage we just set page box and content box to a page, applying those content and page boxes to the pages with similar formatting, use auto margins for those pages at page layout stage instead of adjusting the margins relative to the content box, and then go to the output.
Shift+LMB when the cursor is over a zone - drag the zone.
Ctrl+Shift+LMB when the cursor is over a zone - drag the zone copying.
Del when the cursor is over a zone - delete the zone.
Ctrl when dragging a zone vertex - make the angle of the vertex right.
Ctrl+Alt+Click - copy the latest created zone to the cursor position.
Ctrl when creating a new zone - switch to rectangular mode.
Alt when creating a new zone - switch to lasso (free drawing) mode.
The save is precise and considers the images transformations.
Also added a feature of swithing the output tabs by using Ctrl+1..5 keys combinations.
The settings are available in the main window menu.
Available units: pixels, millimetres, centimetres and inches.
The system affects every aspect of the program, so, for example, it's now possible to adjust margins in pixels, but not only in millimetres or inches.
[L] or [R] - left or right page, if the page has been splitted).This feature is also affected by measurement units system.
The system allows to manage the default filter settings for every stage.
Those filter parameters will be set as defaults for any new project created.
For example, it allows to set your own default margins standard, but not default 5, 10, 5, 10 mm, and so for the other parameters.
The system consider the units settings from the measurement units system. Units are stored in the profile and ST automatically converts the values if needed.
Now group boxes containing filter options can be collapsed/expanded.
The collapse status is preserved between restarts of the application.
Go to this repository and follow the instructions given there.