检索结果-内蒙古大学图书馆

2nd IEEE International Conference on Advances in Information Technology, ICAIT 2024

作者： Devi, M. Shyamala Hepzhibha Rachel, S. Janani, G. Irene, B. Joy Praisy, E. Humaira, S. Department of Computer Science and Engineering Panimalar Engineering College Tamilnadu Chennai India Department of Artificial Intelligence and Data Science Panimalar Engineering College Tamilnadu Chennai India

ISBN: (纸本)9798350383867

Assessments have demonstrated that the human lip and its motions provide a wealth of knowledge about the identity and substance of communication. However, due to large differences in illumination condition, head perspective, and background, obtaining strong and precise lip image segmentation in natural settings remains difficult. This paper recommends Deep Masked Input UNet that categorizes the Gender based on the lip segmentation with high precision. For implementation, 10,132 face images from the KAGGLE Lip Segmentation dataset were used. The dataset includes 5066 images of the face and 5066 segmented lip images. The proposed Deep Masked Input UNet starts by masking the original picture with a segmented lip image to create masked face images with lips segments. Utilizing the Relu Activation function, Deep Masked Input UNet with contracting and expansion was enabled. The masked lip images are fitted to the proposed Deep Masked Input UNet and traditional CNN models. The results show that the suggested Deep Masked Input UNet model performs better in lip segmentation and gender classification, with a high accuracy of 98.95%. © 2024 IEEE.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies

引用

Intelligent Automation & Soft Computing 2023年第6期36卷 2757-2771页

作者： Adel Alshamrani Abdullah Alshahrani Department of Cybersecurity College of Computer Science and EngineeringUniversity of JeddahJeddahSaudi Arabia Department of Computer Science and Artificial Intelligence College of Computer Science and EngineeringUniversity of JeddahJeddahSaudi Arabia

The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology *** this paper,we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber *** present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning(MARL).Our proposed approach is inspired by the multiarmed bandits(MAB)learning technique for multiple agents to cooperate in decision making or to work *** study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their *** find that this game can be modeled from an individual player’s perspective as a restless MAB *** discover further results when the MAB takes the form of a pure birth process,such as a myopic optimal policy,as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.

关键词： Multiarmed bandits reinforcement learning multiagents intrusion detection systems

来源：评论

学校读者我要写书评

暂无评论

Farm-Level Smart Crop Recommendation Framework Using Machine Learning

引用

Annals of data science 2025年第1期12卷 117-140页

作者： Bhola, Amit Kumar, Prabhat Department of Computer Science and Engineering National Institute of Technology Patna Bihar Patna India

Agriculture is the primary source of food, fuel, and raw materials and is vital to any country’s economy. Farmers, the backbone of agriculture, primarily rely on instinct to determine what crops to plant in any given season. They are comfortable following customary farming practices and standards and are oblivious to the fact that crop yield is highly dependent on current environmental and soil conditions. Crop recommendations involve multifaceted factors such as weather, soil quality, crop production, market demand, and prices, making it crucial for farmers to make well-informed decisions. An improper or imprudent crop recommendation can affect them, their families, and the entire agricultural sector. Modern technologies like artificial intelligence, machine learning, and data science have emerged as efficient solutions to combat issues like declining crop production and lower profits. This research proposes a Smart Crop Recommendation framework that leverages machine learning to empower farmers to make informed decisions about optimal crop selection. The framework consists of two phases: crop filtration and yield prediction. Crops are filtered in the first phase using an artificial neural network based on local input parameters. The second phase estimates yield for filtered crops, considering the season, farm area, and location data. The final recommendation provides farmers with crops aimed at maximizing profit. The remarkable 99.10% accuracy of the framework is demonstrated through experimentation using artificial neural networks and the 0.99 R2 error metric for the random forest. The uniqueness of this framework lies in its distinctive focus on the farm level and its consideration of the challenges and various agricultural features that change over time. The experimental results affirm the effectiveness of the framework, and its lightweight nature enhances its practicality, making it an efficient real-time recommendation solution. © The Author(s), under exclusi

关键词： Crops

来源：评论

学校读者我要写书评

暂无评论

RTSA: A Run-Through Sparse Attention Framework for Video Transformer

引用

IEEE Transactions on computers 2025年第6期74卷 1949-1962页

作者： Wang, Xuhang Song, Zhuoran Qi, Chunyu Liu, Fangxin Jiang, Li Liang, Xiaoyao Naifeng, Jing Shanghai Jiao Tong University Department of Computer Science and Engineering Shanghai200240 China

In the realm of video understanding tasks, Video Transformer models (VidT) have recently exhibited impressive accuracy improvements in numerous edge devices. However, their deployment poses significant computational challenges for hardware. To address this, pruning has emerged as a promising approach to reduce computation and memory requirements by eliminating unimportant elements from the attention matrix. Unfortunately, existing pruning algorithms face a limitation in that they only optimize one of the two key modules on VidT's critical path: linear projection or self-attention. Regrettably, due to the variation in battery power in edge devices, the video resolution they generate will also change, which causes both linear projection and self-attention stages to potentially become bottlenecks, the existing approaches lack generality. Accordingly, we establish a Run-Through Sparse Attention (RTSA) framework that simultaneously sparsifies and accelerates two stages. On the algorithm side, unlike current methodologies conducting sparse linear projection by exploring redundancy within each frame, we extract extra redundancy naturally existing between frames. Moreover, for sparse self-attention, as existing pruning algorithms often provide either too coarse-grained or fine-grained sparsity patterns, these algorithms face limitations in simultaneously achieving high sparsity, low accuracy loss, and high speedup, resulting in either compromised accuracy or reduced efficiency. Thus, we prune the attention matrix at a medium granularity—sub-vector. The sub-vectors are generated by isolating each column of the attention matrix. On the hardware side, we observe that the use of distinct computational units for sparse linear projection and self-attention results in pipeline imbalances because of the bottleneck transformation between the two stages. To effectively eliminate pipeline stall, we design a RTSA architecture that supports sequential execution of both sparse linear pro

关键词： Vectors

来源：评论

学校读者我要写书评

暂无评论

Secure multimedia communication: advanced asymmetric key authentication with grayscale visual cryptography

引用

Mathematical Biosciences and engineering 2024年第3期21卷 4762-4778页

作者： Liu, Tao Vairagar, Shubhangi Adagale, Sushadevi Karthick, T. Karunya, Catherine Esther Blesswin, A. John Mary, G. Selva Tianjin Sino-German University of Applied Sciences Tianjin300350 China Department of Artificial Intelligence and Data Science Dr. D. Y. Patil Institute of Technology Pimpri Pune411018 India Department of Computer Engineering KJEI's Trinity Academy of Engineering Pune411048 India Department of Data Science and Business Systems School of Computing SRM Institute of Science and Technology Kattankulathur603203 India Computer Science and Engineering School of Computing SRM Institute of Science and Technology Kattankulathur603203 India Directorate of Learning and Development SRM Institute of Science and Technology Kattankulathur603203 India

The secure authentication of user data is crucial in various sectors, including digital banking, medical applications and e-governance, especially for images. Secure communication protects against data tampering and forgery, thereby bolstering the foundation for informed decision-making, whether managing traffic, enhancing public safety, or monitoring environmental conditions. Conventional visual cryptographic protocols offer solutions, particularly for color images, though they grapple with challenges such as high computational demands and reliance on multiple cover images. Additionally, they often require third-party authorization to verify the image integrity. On the other hand, visual cryptography offers a streamlined approach. It divides images into shares, where each pixel represented uniquely, thus allowing visual decryption without complex computations. The optimized multi-tiered authentication protocol (OMTAP), which is integrated with the visual sharing scheme (VSS), takes secure image sharing to the next level. It reduces share count, prioritizes image fidelity and transmission security, and introduces the self-verification of decrypted image integrity through asymmetric key matrix generators, thus eliminating external validation. Rigorous testing has confirmed OMTAP's robustness and broad applicability, thereby ensuring that decrypted images maintain their quality with a peak signal-to-noise ratio (PSNR) of 40 dB and full integrity at the receiver's end. © 2024 American Institute of Mathematical sciences. All rights reserved.

关键词： Medical applications

来源：评论

学校读者我要写书评

暂无评论

Optimised hybrid classification approach for rice leaf disease prediction with proposed texture features

引用

Journal of Control and Decision 2024年第1期11卷 84-97页

作者： Sakhamuri Sridevi K.Kiran Kumar Department of Computer Science and Engineering Koneru Lakshmaiah Education FoundationVaddeswaramIndia

This paper aims to frame a new rice disease prediction model that included three major ***,median filtering(MF)is deployed during pre-processing and then‘proposed Fuzzy Means Clustering(FCM)based segmentation’is *** that,‘Discrete Wavelet Transform(DWT),Scale-Invariant Feature Transform(SIFT)and low-level features(colour and shape),Proposed local Binary Pattern(LBP)based features’are extracted that are classified via‘MultiLayer Perceptron(MLP)and Long Short Term Memory(LSTM)’and predicted outcomes are *** exact prediction,this work intends to optimise the weights of LSTM using Inertia Weighted Salp Swarm Optimisation(IW-SSO)***,the development of IW-SSO method is established on varied metrics.

关键词： Rice disease improved fuzzy hybrid classifiers optimised LSTM IW-SSO algorithm

来源：评论

学校读者我要写书评

暂无评论

Convergence of various computer-aided systems for breast tumor diagnosis: a comparative insight

引用

Multimedia Tools and Applications 2025年第16期84卷 16709-16756页

作者： Singh, Saket Kumar Patnaik, K. Sridhar Department of Computer Science and Engineering Birla Institute of Technology Mesra Ranchi835215 India

Breast Cancer, with an expected 42,780 deaths in the US alone in 2024, is one of the most prevalent types of cancer. The death toll due to breast cancer would be very high if it were to be totaled up globally. Early detection of breast cancer is the only way to decrease the mortality caused by it. In order to diagnose breast cancer, even the most competent and qualified pathologists and radiologists have to examine hundreds of high-resolution images, which is a massive burden on them. Compared to the number of cases, very few experts are available to manage this burden. Additionally, as humans are more prone to mistakes, the likelihood of finding false positive cases is also high. Numerous AI techniques, including machine learning and deep learning, are ideally suited to address these issues, inspiring many researchers to introduce novel computer-aided detection systems. In this study, we have comprehensively reviewed pre-existing literature aimed at developing computer-aided systems based on using machine learning, deep learning, and vision transformers to identify and classify breast cancer. We have discussed numerous imaging modalities for detecting breast cancer, along with the widely used data pre-processing approaches, machine learning and deep learning models, as well as ensemble learning methods suitable for the task. Popular datasets and their sources are also listed for future referencing. Finally, we have identified a few gaps and addressed potential future research directions with an intent of aiding researchers select approaches tailored to case-specific needs. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： computer aided diagnosis

来源：评论

学校读者我要写书评

暂无评论

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

引用

science China(Information sciences) 2024年第12期67卷 36-51页

作者： Yangzhou LIU Yue CAO Zhangwei GAO Weiyun WANG Zhe CHEN Wenhai WANG Hao TIAN Lewei LU Xizhou ZHU Tong LU Yu QIAO Jifeng DAI School of Computer Science Nanjing University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai AI Laboratory School of Computer Science Fudan University Department of Information Engineering The Chinese University of Hong Kong SenseTime Research Department of Electronic Engineering Tsinghua University

Despite the effectiveness of vision-language supervised fine-tuning in enhancing the performance of vision large language models(VLLMs), existing visual instruction tuning datasets include the following limitations.(1) Instruction annotation quality: despite existing VLLMs exhibiting strong performance,instructions generated by those advanced VLLMs may still suffer from inaccuracies, such as hallucinations.(2) Instructions and image diversity: the limited range of instruction types and the lack of diversity in image data may impact the model's ability to generate diversified and closer to real-world scenarios outputs. To address these challenges, we construct a high-quality, diverse visual instruction tuning dataset MMInstruct,which consists of 973k instructions from 24 domains. There are four instruction types: judgment, multiplechoice, long visual question answering, and short visual question answering. To construct MMInstruct, we propose an instruction generation data engine that leverages GPT-4V, GPT-3.5, and manual correction. Our instruction generation engine enables semi-automatic, low-cost, and multi-domain instruction generation at 1/6 the cost of manual construction. Through extensive experiment validation and ablation experiments,we demonstrate that MMInstruct could significantly improve the performance of VLLMs, e.g., the model fine-tuning on MMInstruct achieves new state-of-the-art performance on 10 out of 12 benchmarks. The code and data shall be available at https://***/yuecao0119/MMInstruct.

关键词： instruction tuning multi-modal multi-domain dataset vision large language model

来源：评论

学校读者我要写书评

暂无评论

Underwater object detection based on enhanced YOLOv4 architecture

引用

Multimedia Tools and Applications 2024年第18期83卷 53759-53783页

作者： Liu, Ching-Hua Lin, Chang Hong Department of Electronic and Computer Engineering National Taiwan University of Science and Technology Taiwan

Object detection and image restoration pose significant challenges in deep learning and computer vision. These tasks are widely employed in various applications, and there is an increasing demand for specialized environments where images are prone to blur or noise, which can adversely affect subsequent results. In recent years, significant breakthroughs have been achieved in object detection performance. While some previously proposed methods prioritize high accuracy at the cost of longer inference times, others prioritize speed. Therefore, it is crucial to design an efficient network architecture that maintains both inference speed and high accuracy. This research proposes a network architecture for underwater object detection with an attention mechanism. The proposed approach differentiates itself from other methods by employing a deblurring network as a preprocessing step to restore and enhance the image quality of the underwater dataset. Additionally, in the feature extraction stage of the detection network, channel and spatial feature information are individually enhanced. These adaptive attention features are then integrated into a multi-scale feature fusion. Finally, the cross-stage local method is combined to improve the learning ability of the convolutional neural network while reducing the size of the model. Based on the experimental results, our proposed model architecture achieves leading accuracy and strikes a favorable balance in terms of model size compared to previously proposed methods. Based on our structure, the metrics of AP and AP50 reach 66.8 and 87.6, respectively. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2023.

关键词： Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

The most tenuous group query

引用

Frontiers of computer science 2023年第2期17卷 197-208页

作者： Na LI Huaijie ZHU Wenhao LU Ningning CUI Wei LIU Jian YIN Jianliang XU Wang-Chien LEE School of Computer Science and Engineering Sun Yat-Sen UniversityGuangzhou 510006China Laboratory of Big Data Analysis and Processing Guangzhou 510006China School of Artificial Intelligence Sun Yat-Sen UniversityGuangzhou 510006China Department of Computer Science Anhui UniversityHefei 230601China Department of Computer Science Hong Kong Baptist UniversityHong Kong 999077China Department of Computer Science The Pennsylvania State UniversityState College 19019USA

Rtecently a lot of works have been investigating to find the tenuous groups,i.e.,groups with few social interactions and weak relationships among members,for reviewer selection and psycho-educational group ***,the metrics(e.g.,k-triangle,k-line,and k-tenuity)used to measure the tenuity,require a suitable k value to be specified which is difficult for users without background ***,in this paper we formulate the most tenuous group(MTG)query in terms of the group distance and average group distance of a group measuring the tenuity to eliminate the influence of parameter k on the tenuity of the *** address the MTG problem,we first propose an exact algorithm,namely MTGVDIS,which takes priority to selecting those vertices whose vertex distance is large,to generate the result group,and also utilizes effective filtering and pruning *** MTGVDIS is not fast enough,we design an efficient exact algorithm,called MTG-VDGE,which exploits the degree metric to sort the vertexes and proposes a new combination order,namely degree and reverse based branch and bound(DRBB).MTG-VDGE gives priority to those vertices with small *** a large p,we further develop an approximation algorithm,namely MTG-VDLT,which discards candidate attendees with high degree to reduce the number of vertices to be *** experimental results on real datasets manifest that the proposed algorithms outperform existing approaches on both efficiency and group tenuity.

关键词： tenuous group pruning strategy social network group query

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：