2.4. Number of Floors Detector¶
On a randomly selected set of in-the-wild building images from New Jersey’s Bergen, Middlesex, and Moris Counties, the model attains an F1-score of 86%. Here, in-the-wild building images are defined as street-level photos that may contain multiple buildings and are captured with random camera properties. confusion_nFloorWildv2
is the confusion matrix of the model inferences on the aforementioned in-the-wild test set.
If the test images are constrained such that a single building exists in each image, the building is viewed with minimal obstructions, and the images are captured such that the image plane is nearly parallel to the frontal plane of the building facade, the F1-score of the model is determined as 94.7%. confusion_nFloorClean
shows the confusion matrix for the pretrained model on a test set generated according to these constraints.
Table 2.4.1 shows a sample of images removed from the in-the-wild test set that were found to display weak resemblance of the visual cues necessary for a valid number of floor predictions.