The process of automatically masking objects from complex backgrounds is extremely beneficial when trying to utilize those objects for computer vision research, such as object detection, autonomous driving, pedestrian tracking, etc. Therefore, a robust method of segmentation is imperative towards ongoing research between the Digital Imaging and Remote Sensing Laboratory at the Rochester Institute of Technology and the Savannah River National Laboratory directed at the volume estimation of condense water vapor plumes emanating from mechanical draft cooling towers. Instance segmentation was performed on a custom data set consisting of RGB imagery with the Matterport Mask R-CNN implementation,1 where condensed water vapor plumes were masked out from mixed backgrounds for the purpose of 3D reconstruction and volume estimation. This multi-class Mask R-CNN was trained to detect cooling tower structure and plumes with and without data augmentation to study the effects on a preliminary data set, in addition to a model trained with a single plume class. The average precision and intersection over union metrics across all models were shown to not be statistically different. While each model is capable of detecting and segmenting plumes in the preliminary data set, all models essentially perform the task with the same efficacy. This indicates some level of bias in the preliminary data set, demonstrating the need for more variance in the form of additional annotated imagery. The single plume class model tested within 7% for mAP, AP, and IoU when compared to the other two models, demonstrating the ability of Mask R-CNN to detect and segment these dynamically-changing plumes without any spatial dependence on the stationary cooling tower structure. This ongoing research includes a long-term data collection campaign where imagery of condensed water vapor plumes will be continuously gathered over an 18-month period so as to include imagery examples under many different meteorological and environmental conditions, seasonal variations, and illumination changes that will occur over an annual cycle. Including this data in future training of the Mask R-CNN implementation is expected to reduce any bias that may exist in the current data set.
The vast majority of agricultural remote sensing applications that utilize multispectral imagery require several pre-processing techniques in order to provide a basis on which to accurately analyze data and provide meaningful information to the grower. This research takes these techniques and compresses them into a fully-automated data processing pipeline. This pipeline is implemented using a BeamIO TileDriver workflow, converting raw digital count to direct-georectified reflectance, ready for further processing to provide a geolocated information product for the grower.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.