The DDS3 Dataset consists of 8,000 mosaic images with a size of 400x400 pixels and their respective masks.

Mosaic images were generated with positive and negative patches taken from the digital fields of slides S01 to S15. Masks consists of a binary image with bacilli pixels inwhite and background pixels black.

The training set is composed of 6000 images, with 3000 mosaic images with about 50% positive and 50% negative patches, 1500 mosaic images with over 90% negative patches, and 1500 mosaic images with 100% negative patches.

The validation set is composed of 1000 mosaic images with about 50% positive and 50% negative patches.

The testing set is composed of 1000 mosaic images with about 50% positive and 50% negative patches. The flowchart below shows the organization of the data.

 

 

 

 

Figure 1. Example of mosaic image with 400x400 pixels and respective binary mask.

 

 

ABBREVIATIONS, NOMENCLATURE AND FILE DIRECTORY

  ABBREVIATIONS

S → Slide
→ Tile (a digital microscopy field)
TR →Tile Region (a set of 164 Tiles or digital fields)
St → Stack (a set of 11 digital images acquired from a microscopic field at different focal depths)
EDF→ Extended Depth of Focus image (a digital image obtained by the fusion of the stack of 11 images)
→ Marking (Annotation)
NP → Negative Patch
PP→ Positive Patch
MSC→ Mosaic Image

 

 

Figure 2. File Directory