Osteoarthritis is a prevalent degenerative knee joint disease, causing considerable pain and mobility issues, thus impacting the independence and quality of life of millions. Deep learning and Machine Learning techniq...
详细信息
Nowadays, multimedia technology is progressing everyday. It is very easy to duplicate, distribute and modify digital images with online editing software. Image security and privacy are critical aspects of the multimed...
详细信息
Nowadays, multimedia technology is progressing everyday. It is very easy to duplicate, distribute and modify digital images with online editing software. Image security and privacy are critical aspects of the multimedia revolution. Therefore, digital image watermarking offers an alternative way out for image authentication. Currently, watermarking methods are crucial for safeguarding digital images. Several traditional watermarking approaches have been developed to protect images using spatial domains and transformations. Watermarking techniques that are more traditional are less resistant to repeated attacks. Deep learning-based watermarking has recently gained traction, greatly improving the safety of visual images in a variety of common applications. This study presents a robust and secure digital watermarking method for multimedia content protection and authentication. The watermark image is first transformed using the hybrid wavelet transform, and then it is encrypted using a chaos encryption algorithm. The cover image is simultaneously subjected to neighborhood-based feature extraction. Leveraging these extracted features, a novel Adaptive Gannet Optimization algorithm (AGOA) is employed to determine the optimal embedding location. Subsequently, the watermarked image is seamlessly integrated and extracted using the hybrid Generative adversarial network-based long short-term memory (GAN-LSTM) approach within the identified optimal region. Decryption and Inverse transformation are then used to get the original watermark image. Several previous methods, such as DNN, Deep-ANN, and Deep-CNN, are used to evaluate the performance of the proposed method. This technique improves multimedia content protection and authentication by guaranteeing strong and secure watermarking. The proposed method for digital image watermarking produced a peak signal-to-noise ratio of 46.412 and a mean square error of 24.512. Therefore, the proposed method performs well in digital image wa
The stability and security of the Internet's routing infrastructure, largely dependent on the Border Gateway Protocol (BGP), are critical for ensuring reliable global connectivity. Anomalous routing behavior can d...
详细信息
Edge computing (EC) has emerged as a viable solution for resource-intensive Internet of Things (IoT) applications seeking low-latency services at the network edge. However, the limited computing power of edge servers ...
详细信息
This research study focuses on video categorization, which is a crucial area of computer vision with uses in entertainment, education, and surveillance. Convolutional Neural Networks (CNNs) are used in a two-stage app...
详细信息
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical *** study prop...
详细信息
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical *** study proposes a novel end-to-end disparity estimation model to address these *** approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting *** study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and *** model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video *** results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing ***,the model exhibited faster convergence during training,contributing to overall performance *** study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.
This research work focuses on food recognition, especially, the identification of the ingredients from food images. Here, the developed model includes two stages namely: 1) feature extraction;2) classification. Initia...
详细信息
A dandelion algorithm(DA) is a recently developed intelligent optimization algorithm for function optimization problems. Many of its parameters need to be set by experience in DA,which might not be appropriate for all...
详细信息
A dandelion algorithm(DA) is a recently developed intelligent optimization algorithm for function optimization problems. Many of its parameters need to be set by experience in DA,which might not be appropriate for all optimization problems. A self-adapting and efficient dandelion algorithm is proposed in this work to lower the number of DA's parameters and simplify DA's structure. Only the normal sowing operator is retained;while the other operators are discarded. An adaptive seeding radius strategy is designed for the core dandelion. The results show that the proposed algorithm achieves better performance on the standard test functions with less time consumption than its competitive peers. In addition, the proposed algorithm is applied to feature selection for credit card fraud detection(CCFD), and the results indicate that it can obtain higher classification and detection performance than the-state-of-the-art methods.
Smart applications are getting more powerful and cheaper cost due to the advancement in sensor technology. In this chapter, we have considered a smart greenhouse application. The important parameters of the smart gree...
详细信息
Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts...
详细信息
Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts, recent studies revealed that current VideoQA models mostly tend to over-rely on the superficial correlations rooted in the dataset bias while overlooking the key video content, thus leading to unreliable results. Effectively understanding and modeling the temporal and semantic characteristics of a given video for robust VideoQA is crucial but, to our knowledge, has not been well investigated. To fill the research gap, we propose a robust VideoQA framework that can effectively model the cross-modality fusion and enforce the model to focus on the temporal and global content of videos when making a QA decision instead of exploiting the shortcuts in datasets. Specifically, we design a self-supervised contrastive learning objective to contrast the positive and negative pairs of multimodal input, where the fused representation of the original multimodal input is enforced to be closer to that of the intervened input based on video perturbation. We expect the fused representation to focus more on the global context of videos rather than some static keyframes. Moreover, we introduce an effective temporal order regularization to enforce the inherent sequential structure of videos for video representation. We also design a Kullback-Leibler divergence-based perturbation invariance regularization of the predicted answer distribution to improve the robustness of the model against temporal content perturbation of videos. Our method is model-agnostic and can be easily compatible with various VideoQA backbones. Extensive experimental results and analyses on several public datasets show the advantage of our method over the state-of-the-art methods in terms of both accuracy and robustness.
暂无评论