检索结果-内蒙古大学图书馆

2nd International Conference on Advancement in Computation and computer Technologies, InCACCT 2024

作者： Shyamala Devi, M. Irene, B. Kanagavalli, A. Kousika, A. Joy Praisy, E. Humaira, S. Panimalar Engineering College Department of Computer Science and Engineering Tamilnadu Chennai India Panimalar Engineering College Department of Artificial Intelligence and Data Science Tamilnadu Chennai India

ISBN: (纸本)9798350371314

The manufacturing and dissemination of spores is the main purpose of sporocarps which is the specialized type of configuration found in freshwater plants of the Salviniales family. Though sporocarp are mainly used for reproduction, some of them are consumable by the animals and birds for the extraction of rich nutrients. Sporocarp categorization is an extremely difficult task as various species of sporocarp share significant appearance. Some sporocarp may be poisonous that lead to brain damage and infection to the animals. This paper proposes Normalized AlexNet that classifies the edible and poisonous sporocarps with high efficiency. For the sake of this investigation, 1800 sporocarp images from the Sporocarp edible dataset were used. The best CNN model was chosen by applying the Sporocarp edible dataset to the current CNN models. With an accuracy of 89.67%, AlexNet performs better than the other CNN models. AlexNet has been chosen to fine-tune its single dense layer at this level. The AlexNet typically consists of two blocks with a single convolution including a max pooling layer, one block with three convolutions along with a max pooling layer, and one last block with three dense layers. The proposed Normalized AlexNet was now suggested by modifying the second dense layer with a single block of batch normalization, relu optimization, and drop out with a vector size of 4096. The suggested Normalized AlexNet and the current CNN have both been fitted with the Sporocarp Edible dataset. The demonstration illustrates that the Normalized AlexNet model performs better in the edibility categorization of sporocarp, with a high accuracy of 98.76%. © 2024 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

ReLU Activated Attention UNet Encipher Decipher Framework based Species Classification of Butterfly

ReLU Activated Attention UNet Encipher Decipher Framework ba...

引用

2024 International Conference on Advancements in Power, Communication and Intelligent Systems, APCI 2024

作者： Devi, M. Shyamala Jeeva, R. Jagadheeswaran, R. Kumar V, Hemand Erwin Nicholas, M. Keerthi Vasan, P. Panimalar Engineering College Department of Computer Science and Engineering Tamilnadu Chennai India Panimalar Engineering College Department of Artificial Intelligence and Data Science Tamilnadu Chennai India

ISBN: (纸本)9798350363289

The recognition and categorization of butterflies is crucial for the preservation of butterfly species in the fields of entomology, computer vision and deep learning. Environmentalists have long utilized butterflies as model organisms in the research on the effects of habitat degradation, dispersion, and warming temperatures. This paper recommends Relu Activated Attention UNet (RAA-UNet) that categorizes the nine butterfly species with high precision. The Butterfly dataset contains 1664 butterfly images that were used in this investigation. The dataset consists of 832 photos of butterflies and 832 related images that have been segmented. The proposed RAA-UNet starts by creating masked butterfly pictures by masking the original image with a segmented butterfly image. To classify the butterfly species, the masked butterfly pictures are assigned to Attention UNet with Decoder and Encoder combined with the Attention gate and activated using Relu Activation function. The proposed RAA-UNet and conventional CNN models are fitted to the masked butterfly pictures. As demonstrated by the experiments, the RAA-UNet model works better with a high accuracy of 97.75% in the classification of butterfly species. © 2024 IEEE.

关键词： Invertebrates

来源：评论

学校读者我要写书评

暂无评论

Emotion Recognition Using Focal Network and Resnet

Emotion Recognition Using Focal Network and Resnet

引用

2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems, ICITEICS 2024

作者： Meena, T. Balasai, Sigireddy Anandh Duggireddy Kumar, P. Phaneendhra Velagapudi Ramakrishna Siddartha Engineering College Department of Computer Science and Engineering India Velagapudi Ramakrishna Siddartha Engineering College Department of Artificial Intelligence and Data Science India

ISBN: (纸本)9798350382693

Image emotion recognition involves finding the emotions from visual data, usually done through convolutional neural networks (CNN) or deep neural networks (DNN). The existing methodologies are often high complex or time complex and it is not giving the good considerable results and computationally expensive due to high number of parameters. In this paper we try to accomplish a better output by using focal network which stood among other transformers architecture due to the unique property of the dynamic attention Recent studies have shown the efficacy of transformer architectures in various deep learning tasks and challenged the convolution neural networks on various tasks. but, It has a tradeoff between reduced parameter complexity and the need for extensive training data. Drawing inspiration from recent publications, the study adopts a focal network for training, demonstrating its effectiveness in enhancing the model's attention and optimizing emotional feature extraction. The proposed methodology aims to do the process of image emotion recognition by using the advantages of transformers while mitigating the computational problem associated with the deep neural networks. The incorporation of a focal network addresses attention-related challenges, and it contributes to the development of more efficient emotion recognition systems. This research offers insights into the evolving landscape of image emotion recognition methodologies and underscores the potential of transformer architectures in advancing this field. © 2024 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

AI-Powered Mental Health Screening and Support for Homeless Children

AI-Powered Mental Health Screening and Support for Homeless ...

引用

2025 AI-Driven Smart Healthcare for Society 5.0, AdSoc5.0 2025

作者： Raga Madhuri, C.H. Bandaru, Jaya Sankar Krishna Srinu, Medisetti Vardhan, Gangadhari Midhun Anand Department of Computer Science and Engineering Velagapudi Ramakrishna Siddhartha Engineering College Vijayawada India Department of Artificial Intelligence and Data Science Velagapudi Ramakrishna Siddhartha Engineering College Vijayawada India

ISBN: (纸本)9798331536336

Mental health illness is a significant global public health threat exacerbated by the lack of effective early identification and intervention measures. This project aims to address these challenges by focusing on mental health recognition among homeless children. Individuals who are particularly vulnerable due to the loss of their parents. We created a model that screens mental health status by analyzing the responses to screening pediatric symptoms checklist (PSC) questions using an SVM algorithm. This model achieved 90% accuracy in predicting the mental health status of children. This approach makes it more accessible and effective for caretakers in orphan homes. The idea of providing some clear guidelines from the obtained results would make sure these children are well taken care of at the right time, which ultimately improves the mental health status of this group of people. © 2025 IEEE.

关键词： Pediatrics

来源：评论

学校读者我要写书评

暂无评论

Deep Learning Classification Techniques on Detecting Diabetic Retinopathy dataset 7

Deep Learning Classification Techniques on Detecting Diabeti...

引用

7th International Conference on Inventive Computation Technologies, ICICT 2024

作者： Revathi, B. Usharani, C. Kezial Elizabeth, S.K. Nagaraj, P. Nithya, D. Ramco Institute of Technology Department of Artificial Intelligence And Data Science Virudhunagar India Mangayarkarasi College of Engineering Department of Computer Science And Engineering Madurai India Kalasalingam Academy of Research And Education Department of Computer Science And Engineering Virudhunagar Krishnankoil India Theni Kammavar Sangam College of Technology Department of Computer Science And Engineering Theni India

ISBN: (纸本)9798350359299

Deep learning algorithms can summarize images to understand how to carry out necessary tasks. The purpose of this study is to compare several deep learning methods. Both experience-based and explanation-based learning are possible in deep learning. The most widely utilized algorithms, such as Convolutional Neural Networks (CNN), Multilayer Perceptron (MLP), Generative Adversarial Networks (GAN), Radial Basis Function Networks (RBFN), and Deep Belief Networks (DBN), and the Diabetic Retinopathy dataset is utilized in this study to evaluate the effectiveness of the algorithms. A comparative study of the classifiers reveals that CNN performs more accurately than the other approaches. © 2024 IEEE.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

Auxiliary Dense Layered LeNet Deep Learning based Mango Variety Classification 2

Auxiliary Dense Layered LeNet Deep Learning based Mango Vari...

引用

2nd International Conference on Advancement in Computation and computer Technologies, InCACCT 2024

作者： Shyamala Devi, M. Janani, G. Fernandes, Jonita Maria Harshini, T. Nivetha, R. Joshika Rao, O.R. Panimalar Engineering College Department of Computer Science and Engineering Tamilnadu Chennai India Panimalar Engineering College Department of Artificial Intelligence and Data Science Tamilnadu Chennai India

ISBN: (纸本)9798350371314

In the agriculture sector, physical classification of fruits is a costly process that can produce inconsistent outcomes due to human negligence. Fruit categorization from snapshots is an extremely difficult venture, especially for mango fruits because various species of mango fruits share significant resemblance. This paper proposes Auxiliary Dense Layered LeNet (ADL-LeNet) that classifies the mango type with high accuracy. The Mango Varieties Classification and Grading dataset, which includes 1200 mango pictures was used for this work. The Mango image dataset is applied with the existing CNN model to select the best CNN model. The LeNet outperforms with the accuracy of 92.77% when compared to the other CNN model. Now LeNet is selected for finetuning the number of dense layers. Generally, the LeNet is composed of three convolutions, two subsampling followed by two dense layers. Now, the LeNet is modified by adding one extra dense layer at the end that compress the output further into vector size of 42 to suggest the proposed model ADL-LeNet. The ADL-LeNet composed of three convolutional, two subsampling and three dense layers ending with the final output layer. The Mango Image dataset has been applied with the existing CNN and proposed ADL-LeNet. The implementation shows that the proposed ADL-LeNet model outperforms with the high accuracy of 97.6% in mango type classification. © 2024 IEEE.

关键词： Fruits

来源：评论

学校读者我要写书评

暂无评论

A Review on the Applications of Machine Learning and Deep Learning Techniques for Skin Cancer Detection 4

A Review on the Applications of Machine Learning and Deep Le...

引用

4th International Conference on Sentiment Analysis and Deep Learning, ICSADL 2025

作者： Wanjari, Ketan Verma, Prateek Faculty of Engineering and Technology Department of Computer Science and Engineering Maharashtra Wardha442001 India Faculty of Engineering and Technology Department of Artificial Intelligence and Data Science Maharashtra Wardha442001 India

ISBN: (纸本)9798331523923

Skin cancer is the most commonly reported type of cancer globally and one of the few cancers that can be effectively treated if detected in its early stages. Recent advancements in artificial intelligence (AI) have significantly improved skin cancer diagnosis through Machine Learning (ML) and Deep Learning (DL) models. This review explores ML and DL methodologies, including Random Forest, Convolutional Neural Networks (CNNs), Inception Networks, ResNets, and Support Vector Machines (SVMs), which have been widely used for automated diagnosis. However, existing AI-based systems face several challenges, including the need for balanced datasets, model interpretability issues, computational complexity, and difficulties in generalizing across diverse populations. Moreover, multimodal detection systems and Explainable AI (XAI) present additional challenges in making AI-driven diagnoses more reliable and transparent. This article discusses these challenges while also highlighting emerging techniques that integrate multiple data sources and individualized diagnostic tools to enhance precision. By addressing these challenges, this review aims to provide insights into the practical advancements and future directions of AI in skin cancer detection and treatment. © 2025 IEEE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

An overview on IRS-enabled sensing and communications for 6G: architectures, fundamental limits, and joint beamforming designs

引用

science China(Information sciences) 2025年第5期68卷 170-193页

作者： Xianxin SONG Yuan FANG Feng WANG Zixiang REN Xianghao YU Ye ZHANG Fan LIU Jie XU Derrick Wing Kwan NG Rui ZHANG Shuguang CUI School of Science and Engineering (SSE) Shenzhen Future Network of Intelligence Institute (FNii-Shenzhen)and Guangdong Provincial Key Laboratory of Future Networks of Intelligence The Chinese University of Hong Kong Department of Electrical Engineering City University of Hong Kong School of Information Engineering Guangdong University of Technology Key Laboratory of Wireless-Optical Communications Chinese Academy of Sciences School of Information Science and Technology University of Science and Technology of China Computer School Beijing Information Science and Technology University National Mobile Communications Research Laboratory School of Information Science and Engineering Southeast University School of Electrical Engineering and Telecommunications University of New South Wales School of Science and Engineering Shenzhen Research Institute of Big Data The Chinese University of Hong Kong Department of Electrical and Computer Engineering National University of Singapore

This study presents an overview on intelligent reflecting surface(IRS)-enabled sensing and communication for the forthcoming sixth-generation(6G) wireless networks, in which IRSs are strategically deployed to proactively reconfigure wireless environments to improve both sensing and communication(S&C) performance. First, we exploit a single IRS to enable wireless sensing in the base station's(BS's) non-line-of-sight(NLoS) area. In particular, we present three IRS-enabled NLoS target sensing architectures with fully-passive, semi-passive, and active IRSs, respectively. We compare their pros and cons by analyzing the fundamental sensing performance limits for target detection and parameter estimation. Next, we consider a single IRS to facilitate integrated sensing and communication(ISAC), in which the transmit signals at the BS are used for achieving both S&C functionalities, aided by the IRS through reflective beamforming. We present joint transmit signal and receiver processing designs for realizing efficient ISAC, and jointly optimize the transmit beamforming at the BS and reflective beamforming at the IRS to balance the fundamental performance tradeoff between S&C. Furthermore, we discuss multi-IRS networked ISAC, by particularly focusing on multi-IRS-enabled multi-link ISAC, multi-region ISAC, and ISAC signal routing, respectively. Finally, we highlight various promising research topics in this area to motivate future work.

关键词： integrated sensing and communication (ISAC) intelligent reflecting surface (IRS) non-line-of-sight (NLoS) sensing sensing and communication tradeoff

来源：评论

学校读者我要写书评

暂无评论

A Machine Learning and Deep Learning based Approach to Generate a Speech Emotion Recognition System 18

A Machine Learning and Deep Learning based Approach to Gener...

引用

18th INDIAcom;11th International Conference on Computing for Sustainable Global Development, INDIACom 2024

作者： Sumera Vaidehi, K. Nisha, Qamar Stanley College of Engineering and Technology for Women Department of Artificial Intelligence & Data Science and Computer Engineering Hyderabad India Stanley College of Engineering and Technology for Women Department. of Computer Science Engineering Hyderabad India

ISBN: (纸本)9789380544519

The most common and general medium via which we humans convey or communicate our thoughts, emotions, feelings or ideas artlessly is by speech or articulation. Blending of this artless way of speech with the technological advancements of AI, has given rise to the importance of building emotion recognition systems from speech today. Even more, the speech/articulation emotion recognition system presented here is also to contribute in and facilitate various emerging applications of today like, in detecting persons' physiological state (as in lie detectors), also be used in forensics, medicine. The proposed work identifies/associates an appropriate label/emotion for the respective emotion from speech presented in the form of an audio file (.wav format). About 4240 audio samples are taken. 1440, 2800 samples from RAVDESS and TESS datasets are considered respectively. After this process of data collection, features are separately extracted for each audio dataset mentioned above. Energy, pitch, ZCR, co-efficient of Mel frequency ceptrum (MFCC) are some of the features considered in this study. Furthermore, clubbing and merging of 2 datasets is performed resulting in a total of 4240 rows and 24 columns (features/characteristics including 1class label) of audio samples. The resulting 4240 samples of feature dataset is split/bifurcated into training and testing set by considering 3 different possibilities/instances viz;60%-40% ratio, 70%-30% ratio, 80%-30% ratio. The models namely CNN, Random forest and Support Vector Machine are trained to classify the dataset into 8 different emotions (neutral, calm, happy, sad, angry, fearful, disgust, surprise). An attempt to implement the models using two very essential disciplines of AI i.e. Machine Learning and Deep Learning is made here. The accuracy or results are depicted by generating confusion matrices on test data for CNN, RF and SVM models (Each model is trained and test across 3 different ratios viz;60%-40%, 70%-30%, 80%-20%). C

关键词： Classification CNN Confusion matrices Extraction of features MFCC RF Speech/articulation Emotion Recognition SVM

来源：评论

学校读者我要写书评

暂无评论

Geometric prior guided hybrid deep neural network for facial beauty analysis

引用

CAAI Transactions on Intelligence Technology 2024年第2期9卷 467-480页

作者： Tianhao Peng Mu Li Fangmei Chen Yong Xu David Zhang The School of Computer Science and Technology Guizhou UniversityGuiyangChina Department of Automation Moutai InstituteRenhuaiGuizhouChina The School of Computer Science and Technology Harbin Institute of TechnologyShenzhenShenzhenChina The Information and Communication Engineering Department Dalian Minzu UniversityDalianChina The School of Data Science The Chinese University of Hong KongShenzhenShenzhenChina

Facial beauty analysis is an important topic in human *** may be used as a guidance for face beautification applications such as cosmetic *** neural networks(DNNs)have recently been adopted for facial beauty analysis and have achieved remarkable ***,most existing DNN-based models regard facial beauty analysis as a normal classification *** ignore important prior knowledge in traditional machine learning models which illustrate the significant contribution of the geometric features in facial beauty *** be specific,landmarks of the whole face and facial organs are introduced to extract geometric features to make the *** by this,we introduce a novel dual-branch network for facial beauty analysis:one branch takes the Swin Transformer as the backbone to model the full face and global patterns,and another branch focuses on the masked facial organs with the residual network to model the local patterns of certain facial ***,the designed multi-scale feature fusion module can further facilitate our network to learn complementary semantic information between the two *** model optimisation,we propose a hybrid loss function,where especially geometric regulation is introduced by regressing the facial landmarks and it can force the extracted features to convey facial geometric *** performed on the SCUT-FBP5500 dataset and the SCUT-FBP dataset demonstrate that our model outperforms the state-of-the-art convolutional neural networks models,which proves the effectiveness of the proposed geometric regularisation and dual-branch structure with the hybrid *** the best of our knowledge,this is the first study to introduce a Vision Transformer into the facial beauty analysis task.

关键词： deep neural networks face analysis face biometrics image analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：