检索结果-内蒙古大学图书馆

Data Hiding Methods Using Voting Strategy and Mapping Table

Journal of Internet technology 2024年第3期25卷 365-377页

作者： Chi, Hengxiao Chang, Chin-Chen Lin, Chia-Chen Department of Information Engineering and Computer Science Feng Chia University Taiwan Department of Computer Science and Information Engineering National Chin-Yi University of Technology Taiwan

With advancements in technology, the study of data hiding (DH) in images has become more and more important. In this paper, we introduce a novel data hiding scheme that employs a voting strategy to predict pixels based on their neighbors, then embeds data into the predicted pixels according to a designed mapping table. To extract the information, it is only necessary to use the voting strategy to predict the pixels again, then to compare the predicted and hidden pixels to extract the secret data with the assistance of the mapping table. Our experimental results demonstrate that the proposed hiding scheme has a high embedding capacity, and preserves a satisfactory visual quality. Additionally, it attains a high peak signal-to-noise ratio (PSNR) even at high embedding capacity. © 2024 Taiwan Academic Network Management Committee. All rights reserved.

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

computer vision-based six layered ConvNeural network to recognize sign language for both numeral and alphabet signs

引用

Biomimetic Intelligence & Robotics 2024年第1期4卷 45-58页

作者： Muhammad Aminur Rahaman Kabiratun Ummi Oyshe Prothoma Khan Chowdhury Tanoy Debnath Anichur Rahman Md.Saikat Islam Khan Department of Computer Science and Engineering Green University of BangladeshDhakaBangladesh Department of Computer Science and Engineering National Institute of Textile Engineering and Research(NITER)Constituent Institute of Dhaka UniversityDhakaBangladesh Department of Computer Science and Engineering Mawlana Bhashani Science and Technology UniversityTangailBangladesh

People who have trouble communicating verbally are often dependent on sign language,which can be difficult for most people to understand,making interaction with them a difficult *** Sign Language Recognition(SLR)system takes an input expression from a hearing or speaking-impaired person and outputs it in the form of text or voice to a normal *** existing study related to the Sign Language Recognition system has some drawbacks,such as a lack of large datasets and datasets with a range of backgrounds,skin tones,and *** research efficiently focuses on Sign Language Recognition to overcome previous *** importantly,we use our proposed Convolutional Neural Network(CNN)model,“ConvNeural”,in order to train our ***,we develop our own datasets,“BdSL_OPSA22_STATIC1”and“BdSL_OPSA22_STATIC2”,both of which have ambiguous backgrounds.“BdSL_OPSA22_STATIC1”and“BdSL_OPSA22_STATIC2”both include images of Bangla characters and numerals,a total of 24,615 and 8437 images,***“ConvNeural”model outperforms the pre-trained models with accuracy of 98.38%for“BdSL_OPSA22_STATIC1”and 92.78%for“BdSL_OPSA22_STATIC2”.For“BdSL_OPSA22_STATIC1”dataset,we get precision,recall,F1-score,sensitivity and specificity of 96%,95%,95%,99.31%,and 95.78%***,in case of“BdSL_OPSA22_STATIC2”dataset,we achieve precision,recall,F1-score,sensitivity and specificity of 90%,88%,88%,100%,and 100%respectively.

关键词： Conv NeuralSign language CNN Static Feature extraction Convolution2D Fully connected layer Dropout

来源：评论

学校读者我要写书评

暂无评论

Conditional adversarial segmentation and deep learning approach for skin lesion sub-typing from dermoscopic images

引用

Neural Computing and Applications 2024年第26期36卷 16445-16463页

作者： Mirunalini, P. Desingu, Karthik Aswatha, S. Deepika, R. Deepika, V. Jaisakthi, S.M. Department of Computer Science Engineering Sri Sivasubramaniya Nadar College of Engineering Chennai603110 India School of Computer Science and Engineering Vellore Institute of Technology Chennai Campus Chennai600127 India

Automatic skin lesion subtyping is a crucial step for diagnosing and treating skin cancer and acts as a first level diagnostic aid for medical experts. Although, in general, deep learning is very effective in image processing tasks, there are notable areas of the processing pipeline in the dermoscopic image regime that can benefit from refinement. Our work identifies two such areas for improvement. First, most benchmark dermoscopic datasets for skin cancers and lesions are highly imbalanced due to the relative rarity and commonality in the occurrence of specific lesion types. Deep learning methods tend to exhibit biased performance in favor of the majority classes with such datasets, leading to poor generalization. Second, dermoscopic images can be associated with irrelevant information in the form of skin color, hair, veins, etc.;hence, limiting the information available to a neural network by retaining only relevant portions of an input image has been successful in prompting the network towards learning task-relevant features and thereby improving its performance. Hence, this research work augments the skin lesion characterization pipeline in the following ways. First, it balances the dataset to overcome sample size biases. Two balancing methods, synthetic minority oversampling TEchnique (SMOTE) and Reweighting, are applied, compared, and analyzed. Second, a lesion segmentation stage is introduced before classification, in addition to a preprocessing stage, to retain only the region of interest. A baseline segmentation approach based on Bi-Directional ConvLSTM U-Net is improved using conditional adversarial training for enhanced segmentation performance. Finally, the classification stage is implemented using EfficientNets, where the B2 variant is used to benchmark and choose between the balancing and segmentation techniques, and the architecture is then scaled through to B7 to analyze the performance boost in lesion classification. From these experiments, we find

关键词： Pipelines

来源：评论

学校读者我要写书评

暂无评论

A comparison study of passive, active, and semi-active suspension systems

引用

International Journal of Vehicle Noise and Vibration 2024年第2期20卷 107-118页

作者： Demircioǧlu, Ufuk Çakir, Mutlu Tarik Department of Mechanical Engineering Faculty of Engineering and Natural Sciences Sivas University of Science and Technology Sivas Turkey

This study presents the performance of active and semi-active suspension systems by comparing them to the passive system. Firstly, a mathematical model of a quarter vehicle for passive, active, and semi-active systems was obtained by using the D'Alemberts principle. Later on, controllers are designed for active and semi-active suspension systems. For active systems, PID and LQR controller is designed, and for semi-active systems skyhook control algorithm is designed. The simulation was run by using MATLAB/Simulink under different road conditions. RMS values of body vertical motions were obtained to evaluate the performance of controllers. It was found that under bump road excitation RMS values of displacement of a passive system, LQR, PID, and semi-active controllers are found to be 0.0101, 0.0053, 0.0036, and 0.0035 respectively. It was observed that under bump road semi-active control gave the best result. When D grade road is considered, RMS values of displacement of the passive system, LQR, PID, and semi-active controllers are found to be 0.0111, 0.0078, 0.0061, and 0.0093 respectively. Under D-grade road conditions, the active controller gave the best response. Under bump road conditions LQR, PID, and semi-active systems improved body vertical displacement by 90.56, 180.55, and 188.5% respectively to passive systems. © 2024 Inderscience Enterprises Ltd.

关键词： Active suspension systems

来源：评论

学校读者我要写书评

暂无评论

Robust video question answering via contrastive cross-modality representation learning

引用

science China(Information sciences) 2024年第10期67卷 211-226页

作者： Xun YANG Jianming ZENG Dan GUO Shanshan WANG Jianfeng DONG Meng WANG School of Information Science and Technology University of Science and Technology of China Institute of Artificial Intelligence Hefei Comprehensive National Science Center School of Computer Science and Information Engineering Hefei University of Technology Institutes of Physical Science and Information Technology Anhui University School of Computer Science and Technology Zhejiang Gongshang University

Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts, recent studies revealed that current VideoQA models mostly tend to over-rely on the superficial correlations rooted in the dataset bias while overlooking the key video content, thus leading to unreliable results. Effectively understanding and modeling the temporal and semantic characteristics of a given video for robust VideoQA is crucial but, to our knowledge, has not been well investigated. To fill the research gap, we propose a robust VideoQA framework that can effectively model the cross-modality fusion and enforce the model to focus on the temporal and global content of videos when making a QA decision instead of exploiting the shortcuts in datasets. Specifically, we design a self-supervised contrastive learning objective to contrast the positive and negative pairs of multimodal input, where the fused representation of the original multimodal input is enforced to be closer to that of the intervened input based on video perturbation. We expect the fused representation to focus more on the global context of videos rather than some static keyframes. Moreover, we introduce an effective temporal order regularization to enforce the inherent sequential structure of videos for video representation. We also design a Kullback-Leibler divergence-based perturbation invariance regularization of the predicted answer distribution to improve the robustness of the model against temporal content perturbation of videos. Our method is model-agnostic and can be easily compatible with various VideoQA backbones. Extensive experimental results and analyses on several public datasets show the advantage of our method over the state-of-the-art methods in terms of both accuracy and robustness.

关键词： video question answering cross-modality fusion contrastive learning cross-media reasoning

来源：评论

学校读者我要写书评

暂无评论

Fake Face Detection Based on Fusion of Spatial Texture and High-Frequency Noise

引用

Chinese Journal of Electronics 2025年第1期34卷 212-221页

作者： Dengyong Zhang Feifan Qi Jiahao Chen Jiaxin Chen Rongrong Gong Yuehong Tian Lebing Zhang Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation School of Computer and Communication Engineering Changsha University of Science and Technology School of Computer and Communication Engineering Changsha University of Science and Technology Changsha Social Work College Changkuangao Beijing Technology Co. Ltd. School of Computer and Artificial Intelligence Huaihua University

The rapid development of the Internet has led to the widespread dissemination of manipulated facial images, significantly impacting people's daily lives. With the continuous advancement of Deepfake technology, the generated counterfeit facial images have become increasingly challenging to distinguish. There is an urgent need for a more robust and convincing detection method. Current detection methods mainly operate in the spatial domain and transform the spatial domain into other domains for analysis. With the emergence of transformers, some researchers have also combined traditional convolutional networks with transformers for detection. This paper explores the artifacts left by Deepfakes in various domains and, based on this exploration, proposes a detection method that utilizes the steganalysis rich model to extract high-frequency noise to complement spatial features. We have designed two main modules to fully leverage the interaction between these two aspects based on traditional convolutional neural networks. The first is the multi-scale mixed feature attention module, which introduces artifacts from high-frequency noise into spatial textures, thereby enhancing the model's learning of spatial texture features. The second is the multi-scale channel attention module, which reduces the impact of background noise by weighting the features. Our proposed method was experimentally evaluated on mainstream datasets, and a significant amount of experimental results demonstrate the effectiveness of our approach in detecting Deepfake forged faces, outperforming the majority of existing methods.

关键词： Deepfakes Adaptation models Attention mechanisms Noise Transforms Transformers Feature extraction Internet Convolutional neural networks Faces

来源：评论

学校读者我要写书评

暂无评论

An innovative Telugu text summarization framework using the pointer network and optimized attention layer

引用

Multimedia Tools and Applications 2024年第37期83卷 84539-84564页

作者： M, Varaprasad Rao Chakma, Kunal Jamatia, Anupam Rudrapal, Dwijen Department of Computer Science & amp Engineering National Institute of Technology Agartala Tripura799046 India Department of Computer Science & amp Engineering CVR College of Engineering Telangana Hyderabad501510 India

Summarizing lengthy text involves distilling crucial information into a concise form by covering the key events in the source text. Previous researchers mostly explored the supervised approaches for the task, but due to its strong reliance on the quality of text features, the resulting summaries often lack precision and coherence. The performance of the state-of-the-art summarizers becomes poor when applied to various Indian languages due to several challenges. The current research paper proposes a summarization approach for Telugu text based on a Pointer Network with an Optimised Attention Layer (PN-OAL). In this approach, the attention layer’s weights are adjusted using a hybrid optimization method that combines the Fusion of Coyote Optimization and Squirrel Search method (FCO-SSA). Weight optimization is conducted in order to address the goal function, which involves the maximization of the cosine similarity between the source text and the summary text. A novel loss function is incorporated within the framework of the constructed network architecture to generate a robust summary. The proposed approach is experimented with alternative baseline approaches in order to validate its efficacy for summarizing Telugu documents. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Differential Evolution Strategy with Chebyshev Chaos Based Mutation

Journal of Network Intelligence

引用

Journal of Network Intelligence 2024年第1期9卷 613-628页

作者： Sitenda, Amos Song, Pei-Cheng Chu, Shu-Chuan Chen, Shi-Huang College of Computer Science and Engineering Shandong University of Science and Technology Qingdao266590 China Department of Computer Science and Information Engineering Shu-Te UniversityKaohsiung Kaohsiung824 Taiwan

Differential Evolution (DE) is a potent stochastic evolutionary optimization algorithm garnering increasing research attention. Over the years, it has been found applicable in solving diverse real-world problems. DE employs mutation, crossover, and selection operators to guide populations toward optimal or nearly optimal solutions. However, the standard DE mutation strategies have found limitations in balancing exploration and exploitation effectively, thus prompting research into possible improvements. This study introduces a novel mutation strategy named Chebyshev Infused Chaos Mutation Strategy (CICMS). where a chaotic sequence partially guides the process of donor vector generation. Rigorous evaluations were conducted, comparing our modified DE against the standard DE and seven other metaheuristic algorithms, including Genetic Algorithm, Particle Swarm Optimization and Gravitational Search Algorithm. Experiments were performed using the challenging CEC 2014 benchmark functions, consisting of 30 objective functions. Results indicate substantial improvements in convergence speed and solution quality, highlighting the potential of our novel mutation strategy to enhance DE’s practicality in addressing complex optimization problems. This research contributes valuable insights to the dynamic field of optimization algorithms with implications for a wide range of applications. © 2024, Taiwan Ubiquitous Information CO LTD. All rights reserved.

关键词： Chaos theory

来源：评论

学校读者我要写书评

暂无评论

Autism spectrum disorder based on squeezenet with fractional tasmanian rat swarm optimization

引用

Multimedia Tools and Applications 2024年第41期83卷 89029-89054页

作者： Muppidi, Satish Anuradha, G. Valarmathi, K. Department of Computer Science and Engineering GMR Institute of Technology Andhra Pradesh Rajam India Computer Science and Engineering Velagapudi Ramakrishna Siddhartha Engineering College Vijayawada Kanuru India Department of Computer Science and Engineering Panimalar Engineering College Bangalore Trunk Road Chennai Pooñamallee600123 India

A complicated neuro-developmental disorder called Autism Spectrum Disorder (ASD) is abnormal activities related to brain development. ASD generally affects the physical impression of the face as well as the growth of the brain in children. An early and proper medical diagnosis is essential for ASD affected children to enhance their quality of life. However, the clinical detection of ASD is a difficult task and time-consuming, hence it is essential to design an ASD detection approach for precise diagnosis of ASD children. In this research, an algorithmic approach called Fractional Tasmanian Rat Swarm Optimization driven SqueezeNet (FTRSO-SqueezeNet) is designed for the detection of ASD. The median filter and Region of Interest (RoI) extraction are used to de-noise the input image initially and extract a particular region from the filtered image. Later, the nub regions are extracted by choosing the optimal grid from the pre-processed image and using different feature extractors the features from the input images are determined. Finally, the detection of ASD is done by using SqueezeNet, which is trained using the FTRSO approach. The performance of the method is estimated, where the designed model achieved higher performance with an accuracy of 94.55%, sensitivity of 92.53%, and specificity of 95.22% than other prevailing approaches. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Rats

来源：评论

学校读者我要写书评

暂无评论

AMachine Learning Approach to User Profiling for Data Annotation of Online Behavior

引用

computers, Materials & Continua 2024年第2期78卷 2419-2440页

作者： Moona Kanwal Najeed AKhan Aftab A.Khan Computer Science and Information Technology Department NED University of Engineering and TechnologyKarachiPakistan Computer Engineering Department Sir Syed University of Engineering and TechnologyKarachiPakistan College of Education Health and Human ServicesLongwood UniversityFarmvilleVAUSA

The user’s intent to seek online information has been an active area of research in user *** profiling considers user characteristics,behaviors,activities,and preferences to sketch user intentions,interests,and *** user characteristics can help capture implicit and explicit preferences and intentions for effective user-centric and customized content *** user’s complete online experience in seeking information is a blend of activities such as searching,verifying,and sharing it on social ***,a combination of multiple behaviors in profiling users has yet to be *** research takes a novel approach and explores user intent types based on multidimensional online behavior in information *** research explores information search,verification,and dissemination behavior and identifies diverse types of users based on their online engagement using machine *** research proposes a generic user profile template that explains the user characteristics based on the internet experience and uses it as ground truth for data *** feedback is based on online behavior and practices collected by using a survey *** participants include both males and females from different occupation sectors and different *** data collected is subject to feature engineering,and the significant features are presented to unsupervised machine learning methods to identify user intent classes or profiles and their *** techniques are evaluated,and the K-Mean clustering method successfully generates five user groups observing different user characteristics with an average silhouette of 0.36 and a distortion score of *** average is computed to identify user intent type *** user intent classes are then further generalized to create a user intent template with an Inter-Rater Reliability of 75%.This research successfully extracts different user types based on th

关键词： User intent cluster user profile online search information sharing user behavior search reasons

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：