检索结果-内蒙古大学图书馆

2024 International Conference on Advances in Data engineering and Intelligent Computing Systems, ADICS 2024

作者： Karthikeyan, Nithesh Kumar Ali, Sharuksha Syed Vishnu Sekhar, R. Hindustan Institute Of Technology And Science Department Of Computer Science & Engineering Chennai India

ISBN: (纸本)9798350364828

Lung cancer stands as a formidable and prevalent threat, necessitating urgent attention to early diagnosis and precise treatment to mitigate its high fatality rates. In this context, the utilization of computed tomography (CT) scans, particularly in conjunction with advanced deep learning algorithms, emerges as a powerful strategy for effective lung cancer identification. This study introduces a specialized Convolutional Neural Network (CNN) framework meticulously designed for the early detection of lung cancer through the analysis of CT scan images. Through rigorous comparative analyses with alternative models, our research highlights the CNN's superior performance, marking a substantial improvement over conventional diagnostic technique. The results accentuate the efficacy of our proposed deep learning model, solidifying its position as a more robust and potent diagnostic tool compared to prevailing approaches for the early identification of lung cancer. Future research avenues may explore the integration of larger and more diverse datasets, ensuring the model's robustness and applicability across varied clinical scenarios, ultimately advancing the landscape of lung cancer diagnostics towards improved patient outcomes and healthcare practices. © 2024 IEEE.

关键词： Learning algorithms

来源：评论

学校读者我要写书评

暂无评论

From Pixels to Prognosis: A Deep Dive into Lung Cancer Subtype Classification using Transfer learning 2

From Pixels to Prognosis: A Deep Dive into Lung Cancer Subty...

引用

2nd IEEE International Conference on Contemporary Computing and Communications, InC4 2024

作者： Ravindra, Chaya Nalband, Abdul Haq Kumar, Govind Basheer, Salman Ravindra, Manjunath School of Computer Science Engineering REVA University Bengaluru India

ISBN: (纸本)9798350383652

Current diagnostic procedures, such as imaging tests and biopsies, are time intensive and prone to human error. As a result, we employed deep learning to uncover patterns and identify lung cancer from histology pictures. We hypothesised that combining two separate methods (transfer learning and data augmentation) would increase the efficacy of lung cancer detection. The study's dataset consisted of histology images for Lung Cancer and was quite unbalanced, with very few photos containing tumours. We hypothesised that transfer learning, which employs existing knowledge for classification and detection, would be effective on such an imbalanced dataset. To boost our learning, we employed data augmentation to incorporate changes into the dataset. This can be accomplished by implementing Sequential Keras Model Layers. We then tested the efficacy of various convolutional neural network models on this job. We achieved an accuracy of 87.50% utilising the transfer learning model MobileNetV2 as the basis model and numerous geometric changes, beating the cutting-edge convolutional neural network-based technique. We then analysed the results and published them on a website that people can access at any time and from any location. © 2024 IEEE.

关键词： Lung cancer

来源：评论

学校读者我要写书评

暂无评论

Signbridge-Audio to Sign Language Translator

Signbridge-Audio to Sign Language Translator

引用

2025 IEEE International Students' Conference on Electrical, Electronics and computer science, SCEECS 2025

作者： Shirisha, Kammadanam Deeksith, E. Mani Madhava, M. S. S Sri Surendhar, S. Karthikeyan, R. Vardhaman College of Engineering Department of Computer Science and Engineering Hyderabad India Hyderabad India

ISBN: (纸本)9798331529833

The lack of communication options for Deaf and hearing people, some may say creates a significant social disadvantage in accessing the often-bare essential services. In contrast to acoustically communicated sound patterns, sign language communicates ideas freely through manual and body language. It can be used by those who have trouble speaking, by those who can hear but cannot talk, and by regular people to interact with those with hearing impairments. By automatically converting spoken words into hand gestures, this project creates a web-based interface that allows hearing-impaired individuals to communicate with normal people in real time through sign language interpretation. Two major phases of the system Speech-to-text technology interprets oral input into textual output. Next, the text is run through Natural Language Processing (NLP) algorithms leveraging the Natural language toolkit (NLTK) to be syntactically parsed in the context of sign natural rules. The last phase translates the parsed text into sign language gestures which involve hand shapes, orientation, and body movements to convey the visual meaning of a message. This system could be used to greatly reduce the communication barriers experienced by people with hearing loss and deafness using Machine Learning (ML) to continuously improve accuracy, enhance quality of life, and provide a more inclusive society between the deaf community in our daily interactions within an ever-growing world. © 2025 IEEE.

关键词： Natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

InsightNav - Empowering Desktop Navigation with computer Vision

InsightNav - Empowering Desktop Navigation with Computer Vis...

引用

2025 IEEE International Students' Conference on Electrical, Electronics and computer science, SCEECS 2025

作者： Sharma, Ayush Kumar Wakchaure, Supriya Bharat Twinkle, Tanya Vignesh, U. Vellore Institute of Technology School of Computer Science and Engineering Chennai India

ISBN: (纸本)9798331529833

InsightNav redefines desktop interaction by harnessing cutting-edge computer vision and AI technologies to deliver a transformative user experience. Beyond its intuitive gesture-based navigation, InsightNav pioneers two groundbreaking features that elevate human-computer interaction. The first, Personalized Desktop Themes Based on Emotion Detection, leverages real-time facial expression analysis to dynamically alter the desktop's visual aesthetics based on the user's mood. This innovation empowers the system to foster emotional well-being, transitioning to serene themes during stress or energizing designs to amplify motivation. The second feature, Voice-Activated Multitasking Snap Layouts, revolutionizes task management by enabling users to orchestrate optimized application layouts through simple voice commands. By uttering phrases like "Focus Mode,"users can seamlessly arrange tools such as browsers, editors, and readers, thereby accelerating productivity and streamlining workflows. Powered by MediaPipe, OpenCV, and Google Speech-to-Text, InsightNav transcends traditional interaction paradigms, offering unparalleled accessibility, efficiency, and personalization. Together, these innovations position InsightNav as a revolutionary tool, setting new benchmarks in adaptive and immersive human-computer interaction. © 2025 IEEE.

关键词： Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

Random Frame: a Data Augmentation for Glass Detection 27th

Random Frame: a Data Augmentation for Glass Detection

引用

27th International Conference on Pattern Recognition, ICPR 2024

作者： Liang, Yiming Ishikawa, Hiroshi Department of Computer Science and Communications Engineering Waseda University Tokyo Japan

ISBN: (纸本)9783031801358

Glass, though ubiquitous, is difficult to recognize in an image due to its transparency. Fine-grained low-level features indicating the presence of glass, such as refraction and reflection, are weak and subtle. This causes difficulties for existing glass detection models in learning those features, pushing them to rely on more overt cues, especially the frame surrounding the glass. Consequently, they can be fooled easily by frame-like objects. Here, we propose a simple data augmentation scheme called Random Frame to address this problem. Random Frame inserts a frame into an image to create an area with a frame but no glass. The model will receive a penalty if it only relies on the frame. The performances of existing models on various datasets improve when Random Frame is applied while being trained. Our comprehensive experiments demonstrate that our data augmentation can make models utilize more low-level features with more confidence in their predictions. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Image recognition

来源：评论

学校读者我要写书评

暂无评论

Utilizing Large Language Models with Causal Reasoning and Commonsense Knowledge for Empathic Dialogue Generation 15

Utilizing Large Language Models with Causal Reasoning and Co...

引用

15th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2025

作者： Zhu, Yunhao College of Computer Science and Engineering Northwest Normal University LanZhou China

ISBN: (纸本)9798331507695

Empathic dialogue plays an indispensable role in interpersonal communication. Previous methods were mainly based on carefully designed small-scale language models. With the emergence of ChatGPT, the application of large language models (LLMs) in this field has attracted great attention. Our framework combines causal reasoning, commonsense knowledge bases, and LLMs to generate logically rigorous and emotionally rich dialogues. We first use small models to understand user input, and then use LLMs to optimize responses and enhance the emotional expression and naturalness of the dialogue. Causal reasoning and commonsense reasoning help the model understand the content of the conversation and reason about the underlying information. In addition, the introduction of LLMs significantly improves the coherence and emotional depth of the conversation. Although the initial generated response may not be natural enough, the embellishment of the LLMs makes the final response closer to human communication. The framework of this study not only promotes the development of humanized dialogue systems, but also provides new ideas for future human-computer interaction. Experimental results show that our approach outperforms most of the state-of-the-art baselines and excels at generating more empathetic and contextually relevant responses. © 2025 IEEE.

关键词： Causal Reasoning Commonsense Knowledge Empathic Dialogue Generation Large Language Models

来源：评论

学校读者我要写书评

暂无评论

Visual Topic Semantic Enhanced Machine Translation for Multi-Modal Data Efficiency

引用

Journal of computer science & Technology 2023年第6期38卷 1223-1236页

作者：王超蔡思佳史北祥崇志宏 School of Computer Science and Engineering Southeast UniversityNanjing 210096China School of Architecture Southeast UniversityNanjing 210096China

The scarcity of bilingual parallel corpus imposes limitations on exploiting the state-of-the-art supervised translation *** of the research directions is employing relations among multi-modal data to enhance ***,the reliance on manually annotated multi-modal datasets results in a high cost of data *** this paper,the topic semantics of images is proposed to alleviate the above ***,topic-related images can be auto-matically collected from the Internet by search ***,topic semantics is sufficient to encode the relations be-tween multi-modal data such as texts and ***,we propose a visual topic semantic enhanced translation(VTSE)model that utilizes topic-related images to construct a cross-lingual and cross-modal semantic space,allowing the VTSE model to simultaneously integrate the syntactic structure and semantic *** the above process,topic similar texts and images are wrapped into groups so that the model can extract more robust topic semantics from a set of similar images and then further optimize the feature *** results show that our model outperforms competitive base-lines by a large margin on the Multi30k and the Ambiguous COCO *** model can use external images to bring gains to translation,improving data efficiency.

关键词： multi-modal machine translation visual topic semantics data efficiency

来源：评论

学校读者我要写书评

暂无评论

MUSE: Multi-Knowledge Passing on the Edges, Boosting Knowledge Graph Completion 23

MUSE: Multi-Knowledge Passing on the Edges, Boosting Knowled...

引用

23rd International Conference on Machine Learning and Cybernetics, ICMLC 2024

作者： Liu, Pengjie School of Computer Science and Engineering Southern University of Science and Technology Shenzhen China

ISBN: (纸本)9798331528041

Knowledge Graph Completion (KGC) aims to predict the missing information in the (head entity)-[relation]-(tail entity) triplet. Deep Neural Networks have achieved significant progress in the relation prediction task. However, most existing KGC methods focus on single features (e.g., entity IDs) and sub-graph aggregation, which cannot fully explore all the features in the Knowledge Graph (KG), and neglect the external semantic knowledge injection. To address these problems, we propose MUSE, a knowledgeaware reasoning model to learn a tailored embedding space in three dimensions for missing relation prediction through a multiknowledge representation learning mechanism. Our MUSE consists of three parallel components: 1) Prior Knowledge Learning for enhancing the triplets' semantic representation by finetuning BERT;2) Context Message Passing for enhancing the context messages of KG;3) Relational Path Aggregation for enhancing the path representation from the head entity to the tail entity. Our experimental results show that MUSE significantly outperforms other baselines on four public datasets, such as over 5.50% improvement in H@1 and 4.20% improvement in MRR on the NELL995 dataset. The code and all datasets will be released via https://***/NxxTGT/MUSE. © 2024 IEEE.

关键词： Knowledge graph

来源：评论

学校读者我要写书评

暂无评论

A Hybrid Deep Learning for Animal Species Classification using CCTV Surveillance Video: A Review 4

A Hybrid Deep Learning for Animal Species Classification usi...

引用

4th IEEE International Conference on ICT in Business Industry and Government, ICTBIG 2024

作者： More, Rahul P. Khan, Rais Abdul Hamid Sandip University School of Computer Science & Engineering Nashik India

ISBN: (纸本)9798331518981

Biodiversity is crucial for maintaining ecosystem stability, yet global biodiversity is currently in sharp decline, necessitating urgent protective measures. Wildlife monitoring and conservation, which determine biodiversity patterns, are fundamental in ecology, biogeography, and conservation biology. Advances in digital technologies, such as Closed Circuit Television (CCTV) systems, have enhanced the ability to monitor wildlife activities. The growth in wildlife data has facilitated the detection of targeted wild animals and their behaviors, but challenges remain due to the continuous data stream and complex environments. Recent developments in Deep Learning (DL) have demonstrated potential in overcoming these challenges by efficiently processing large-scale video data to detect and classify animal activities. Despite these advancements, further research is needed to refine these models for real-time applications, especially in mitigating wildlife-human conflicts and ensuring the safety of both wildlife and humans. This paper explores the integration of DL techniques in wildlife monitoring, highlighting their potential to improve biodiversity conservation efforts. In this review article we overview the deep learning approaches, focusing on detection of wild creatures to lessen the wild animal- human conflicts. We present a comprehensive overview of issues and challenges in wild life detection in order for a better understanding. © 2024 IEEE.

关键词： Invertebrates

来源：评论

学校读者我要写书评

暂无评论

AESRM-Automatic English Sign Language Recognition with Machine learning techniques

AESRM-Automatic English Sign Language Recognition with Machi...

引用

2024 IEEE Region 10 Conference, TENCON 2024

作者： Kaur, Kamalpreet Garg, Rachit Lovely Professional University Department of Computer Science and Engineering India

ISBN: (纸本)9798350350821

The hand sign recognition has one of the most important learning domains and applications in the fields of computer vision and artificial intelligence. More specifically, one type of such communication is Sign Language, which includes finger gestures, illustrations of the face and gestural movements. However, there is always a social barrier that separates the deaf community from the oral/aural society, and hence there is need to develop better and more natural channels of communication. Making a stand that is one of the potential remedies is capable of correctly identifying the performed hand gestures and translating them into corresponding letters of American Sign Language (ASL) almost immediately. For image collection in this paper, first the images are collected from a live video stream and then these images are preprocessed to form the dataset. Then, the given dataset is divided into testing and training portions set over which, various machine learning algorithms like Naive Bayes, Multinomial Naive Bayes, Random Forest, SVM, KNN, Logistic Regression, Decision Trees are built, and a model/classifier and highest accuracy of 99. 79% has been achieved by decision tree. Lastly, to identify the hand signs in the live video stream we use the predicted hand signs, and a string is produced which can be converted into the user's preferred speech. © 2024 IEEE.

关键词： Video streaming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：