At present, medical endoscopy is the main procedure for exploring the internal cavities of the human body. Developing methods for integrating image processing and endoscopic visualization improves image quality and accurately identifies cancerous abnormalities. This paper aims to improve the efficiency detection of gastrointestinal disease detection in the endoscopic video. A recognition method is proposed, which is based on a 3-D binary difference of micro blocks and consists of the following steps: 1) a sequence of frames is divided into 3-D cuboids; 2) inside each cuboid, patches of different sizes are built, which are used to obtain volumetric local binary templates; 3) the Hamming distance is calculated between a randomly selected pair of cuboids within each patch of the video clip, which is sequentially written to a separate vector descriptor; 4) for recognition, the method of machine learning is used. Experimental studies show that disease detection is performed accurately on the test dataset (Kvasir).
|