检索结果-内蒙古大学图书馆

Addressing the role and opportunities of machine learning utilization in brain tumor detection

Procedia computer science 2024年 245卷 869-878页

作者： Vallerie Delia Lesmana Holly Agustine Irma Kartika Wairooy Brilly Andro Makalew Computer Science Department School of Computer Science - Binus University Jakarta 11480 Indonesia Mobile Application & Technology Program Computer Science Department - Binus University Jakarta 11480 Indonesia

This research aims to develop a brain tumor detection model by utilizing the machine learning techniques and Convolutional Neural Network (CNN). A significant matter to address is revolving around early detection and the proper handling regarding the brain tumor. This research's methodology consists of collecting the dataset, identifying the tools and language to use, prepare and preprocessing the data, data augmentation, splitting and label encoding, building the model architecture, compiling the model, training, and evaluating the model, predicting the model, and comparing it with other models. Dataset consists of 7022 MRI images, divided into training and testing subsets; and four classes: glioma, meningioma, pituitary, and no tumor. There are four different CNN models that have been built and evaluated, namely VGG16, InceptionV3, ResNet50, and DenseNet121. The result gained shows VGG16 with the best performance achieving an accuracy rate of 96.43%, followed by DenseNet121 (94,96%), InceptionV3 (92,40%), and ResNet50 (78,69%). Although there is still room for improvement regarding overfitting and increasing the models’ overall performance, this result is promising enough to enhance early diagnosis and offer an appropriate and effective treatment for patients.

关键词： Machine Learning Deep Learning CNN Brain Tumor Detection MRI VGG16 InceptionV3 ResNet50 DenseNet121

来源：评论

学校读者我要写书评

暂无评论

Camera-Only Perception System for Traffic Jam Assistance

Camera-Only Perception System for Traffic Jam Assistance

引用

2023 International Automatic Control Conference, CACS 2023

作者： Huang, Xiao-Wei Chen, Xiu-Zhi Chen, Yen-Lin National Taipei University of Technology Master Program in Artificial Intelligence Technology Taipei Taiwan National Taipei University of Technology Department of Computer Science and Information Engineering Taipei Taiwan

ISBN: (纸本)9798350306354

Traditional perception systems for TJA (Traffic Jam Assistance) are mostly implemented by fusing images with radar or lidar. As computer vision techniques become more powerful, cameras can almost replace the need for radar and lidar in perception tasks, which reduces the hardware cost of the system. In this research, we propose a camera-only perception system for TJA, which is able to provide the information of the vehicles ahead and the drivable area. The proposed system has been evaluated through real-world scenario sequences, and proved that it achieves high robustness, which is highly possible to be adopted for TJA development. © 2023 IEEE.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Robust Object Detection Model for UAV Application

Robust Object Detection Model for UAV Application

引用

2023 International Automatic Control Conference, CACS 2023

作者： Ku, Chun Chen, Xiu-Zhi Chen, Yen-Lin National Taipei University of Technology Master Program in Artificial Intelligence Technology Taipei Taiwan National Taipei University of Technology Department of Computer Science and Information Engineering Taipei Taiwan

ISBN: (纸本)9798350306354

Not only common issues on object detection task need to be deal with, for Unmanned Aerial Vehicle (UAV) applications, small object is one of the critical problems that needs to be solved. YOLOv7 is a powerful network architecture that provides high efficiency and accuracy object detection results. This paper adopts YOLOv7 as an object detection model for two different kinds of targets, one is vehicle, and the other is ocean flotsam. By training the model with open datasets and fine-tuning the model with self-collected datasets, we prove through sequences collected from real-world scenarios that YOLOv7 is able to provide robust and accurate object detection results, including vehicles and ocean flotsam, with real-time efficiency. Based on such experimental result, we confirmed that YOLOv7 can be the baseline for object detection model development. © 2023 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Using textile capacitive sensors to train synchronized movement of hands

Using textile capacitive sensors to train synchronized movem...

引用

2024 IEEE International Conference on Consumer Electronics, ICCE 2024

作者： Yang, Chang-Ming Wu, Shu-Cing Chen, Shih-Hung Wu, Ze-We Liao, Kuo-Cheng Chen, Chi-Chun Ming Young Biomedical Corp. Miaoli Taiwan National Chin-Yi University of Technology Program Prospective Technology of Electrical Engineering and Computer Science Taichung Taiwan National Chin-Yi University of Technology Dept. of Computer Science and Information Engineering Taichung Taiwan National Chin-Yi University of Technology Dept. of Electronic Engineering Taichung Taiwan

ISBN: (纸本)9798350324136

This study introduced a capacitive sensing interactive game platform aimed at promoting emotional stability, which we have named the 'Sunrise and Sunset' game. This game primarily consists of two pieces of regular textile fabric enveloping conductive silver fabric. A microcontroller was employed to extract the sensed capacitive values, and a game named 'Sunrise and Sunset' is designed to complement the slow raising and lowering of both hands. The development of this gaming platform has the potential to offer a novel method of emotional management, particularly in high-stress living environments. It can serve as an effective relaxation tool, aiding individuals in emotional balance, anxiety reduction, and stress alleviation. Simultaneously, this platform can contribute to the promotion of mental well-being, providing an engaging and beneficial means for people to manage their emotions and moods. © 2024 IEEE.

关键词： capacitive sensing emotional stability game relaxation tool silver fabric

来源：评论

学校读者我要写书评

暂无评论

AlexNet Architecture Based Convolution Neural Network for Realtime Audio to Text Translator of Bisindo Hand Sign

AlexNet Architecture Based Convolution Neural Network for Re...

引用

2023 International Seminar on Application for technology of Information and Communication, iSemantic 2023

作者： Sujatmiko, Dhiki Sari, Christy Atika Rachmawanto, Eko Hari Krismawan, Andi Danang Altamer, Bilal R. Alkhafaji, Mohamed Ayad University of Dian Nuswantoro Study Program in Informatics Engineering Semarang Indonesia University of Dian Nuswantoro Study Program in Animation Semarang Indonesia Mosul University Presidency University of Mosul Department of Computer Science Mosul Iraq National University of Science and Technology Department of Computer Science DhiQar Iraq

ISBN: (纸本)9798350339215

Deafness is a condition that results in the loss of hearing function, hindering the reception of information such as oral communication that relies on auditory senses. Consequently, individuals with hearing impairment experience communication barriers and may have limited or no ability to respond. One solution is the use of sign language. In Indonesia, there are two known sign languages: Sibi and Bisindo. Both serve the same function but differ in their style of movement and expression. Bisindo is considered more flexible as it conveys meaning based on the Indonesian language. However, the universal understanding of this language solution is still limited among many people. Therefore, a program is needed to facilitate translation between deaf individuals who use sign language and their counterparts who do not communicate through sign language. CNN (Convolutional Neural Network) is a deep learning algorithm used for training visual input data recognition by computer systems. There are various CNN-based architectures, and one of them is AlexNet. Based on the author's testing, the AlexNet architecture proves to be suitable for real-time sign language translation. The evaluation of the system involved 7,800 datasets and 520 testing instances, with an average accuracy of 468 correct translations. When averaged, the system achieved a 90% accuracy rate, representing a 100% increase in accuracy compared to previous approaches. © 2023 IEEE.

关键词： Audition

来源：评论

学校读者我要写书评

暂无评论

Correlating the Ambient Conditions and Performance Indicators of the LoRaWAN via Surrogate Gaussian Process-Based Bidirectional LSTM Stacked Autoencoder

引用

IEEE Transactions on Network and Service Management 2023年第3期20卷 3413-3427页

作者： Bhat, Showkat Ahmad Huang, Nen-Fu Hussain, Imtiyaz Sajjad, Uzair National Tsing Hua University College of Electrical and Computer Science Hsinchu1300044 Taiwan National Tsing Hua University Department of Computer Science Hsinchu1300044 Taiwan National Taipei University of Technology Graduate Program in Energy and Opto-Electronic Materials Taipei10608 Taiwan National Taipei University of Technology Department of Energy and Refrigerating Airconditioning Engineering Taipei10608 Taiwan

LoRa's biggest advantage is its flexibility, which is the ability to increase or decrease data rate and range while decreasing or increasing sensitivity. Whenever propagation conditions change frequently, this function allows the spreading factor to be modified accordingly. Despite their efficiency and scalability, adaptive data rate algorithms ignore and fail to factor in the complex correlation between ambient weather parameters influencing the communication channel design. In this research, a Bayesian surrogate Gaussian process-based bidirectional LSTM stacked autoencoder model (BSGP-BLSTM-SAE) is proposed to estimate the channel performance indicators such as received signal strength indicator (RSSI) and signal-to-noise ratio (SNR) and to determine the correlation between the ambient weather conditions and performance indicators for the LoRaWAN network. Bayesian optimization algorithm has been used to optimize the hyper-parameters of the developed model. A LoRaWAN experimental multivariate time series dataset has been used for the evaluation of the developed model, which upon testing and validation produces high accuracy in predicting the channel performance indicators and ambient conditions of the experimental LoRaWAN network. The mean absolute error of the developed model was around 0.45. Thus, the proposed model can predict the link performance indicators and thereby assist in real-time optimization of the transmission parameters to enhance the network performance in LoRaWAN-based systems at different ambient conditions. © 2004-2012 IEEE.

关键词： Internet of things

来源：评论

学校读者我要写书评

暂无评论

The future of virtual reality: Prospect and problems

引用

Procedia computer science 2024年 245卷 355-364页

作者： Andrew Alfonso Lie Owen Tamashi Buntoro Eko Setyo Purwanto Muhamad Keenan Ario Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480 Mobile Application & Technology Program Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480

Virtual Reality (VR) is a technology that allows users to interact with a simulated environment created by virtual reality capabilities through the intermediary of computers, simulator tools, and others, providing new experiences and can feel experiences that are not possible in the real world. This research aims to identify and analyze what challenges may occur such as technical, social, and ethical barriers in the development and application of VR technology. Using a mixed methods approach, we collected quantitative data from Kaggle's dataset and conducted in-depth interviews with active VR users. Our findings show that although VR has significant potential in various fields, such as education, healthcare, and entertainment, challenges such as motion sickness, high costs, and limited and engaging content hinder the widespread adoption of VR technology. The study also highlights the importance of improving technologies that need to be implemented such as visual capabilities, motion responsiveness, and audio quality to enhance the user experience when using VR. By overcoming these barriers, VR can maximize its positive impact and become a revolutionary tool in various sectors.

关键词： Virtual Reality immersive experience technical barriers social impact ethical considerations

来源：评论

学校读者我要写书评

暂无评论

Quantitative analysis of sign language translation using artificial neural network model

引用

Procedia computer science 2024年 245卷 998-1009页

作者： Fendy Wijaya Leonardo Dahendra Eko Setyo Purwanto Muhamad Keenan Ario Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480 Mobile Application & Technology Program Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480

Sign language is one of the technique to support communication with deaf and speech impaired people. Nowadays, human needs become more complex, so are the needs of people with those disabilities. Therefore, with the sophistication of modern technology in the field of computer science, it is necessary to have a sign language translation system which is capable to convert human gestures into words or spelling letters in a natural human language, especially Indonesian language and SIBI sign language. This research is conducted to analyze the capabilities of an Artificial Neural Network (ANN) system to translate SIBI sign language into Indonesian language. The analysis is performed quantitatively from the experiment results to gain a descriptive insights for the generated models. By creating two datasets containing alphabets and words in SIBI sign language, two models were generated to predict alphabets and words, respectively. The result shows that ANN model could effectively and efficiently perform a sign language translation with accuracy of alphabet prediction reached 96.15% and words prediction reached 99.45% in duration of prediction without exceeding 0.15 seconds. Regarding to the results, it can be concluded that ANN model quantitatively suitable to be implemented as an Indonesian sign language translator system.

关键词： sign language Artificial Neural Network SIBI prediction quantitative analysis

来源：评论

学校读者我要写书评

暂无评论

Harnessing Deep Learning for Ocular Disease Diagnosis

引用

Procedia computer science 2024年 245卷 914-923页

作者： Jessica Ryan Dave Andrew Nathaniel Eko Setyo Purwanto Muhamad Keenan Ario Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480 Mobile Application & Technology Program Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480

Vision impairment, often caused by preventable ocular diseases can be challenging to diagnose accurately and prone to human error. Automation using technology, particularly deep learning, offers a promising solution to aid in accurate and efficient disease detection. This study explores the use of different CNN models specifically VGG-16, VGG-19, ResNet-50, and ResNet-152v2, for detecting ocular diseases. Simple fine-tuning is applied to these models, and their performance is compared to identify the most effective model. The purpose is to show how different models contribute to establishing reliable illness detection systems. The results reveal that most of these models perform well with even minimal fine-tuning. Among the models, ResNet-152v2 achieved the highest training accuracy of 90.36% demonstrating its capacity to learn from the training data. In contrast, ResNet-50 offered a more balanced performance with marginally lower accuracy, making it a robust choice for general application.

关键词： ocular diseases deep learning classification

来源：评论

学校读者我要写书评

暂无评论

A fine-tuned vision transformer-based on limited dataset for facial expression recognition

引用

Procedia computer science 2024年 245卷 574-582页

作者： Rio Febrian Ronald Richie Huang Nicholas Setiono Dimas Ramdhan Andry Chowanda Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480 Game Application & Technology Program Computer Science Department School of Computer Science Bina Nusantara University Jakarta Indonesia 11480

With the increase of interest in Facial Expression Recognition (FER) in the past few decades. Several challenges surfaced with the invention of many different FER models which are often based on Convolution Neural Network (CNN) architectures. Recently, an attention-based transformer model has been presented to address FER. One of the major issues with Transformers is the need for a large data quantity for training. Therefore, in this paper, we propose to learn how to fine-tune a vision transformer-based (ViT) model using a limited dataset. We will be using the JAFFE Dataset, which consists of only 213 images containing seven different emotions. The proposed method is evaluated using several fine-tuning methods, such as adding dropout, data augmentation, and layer freezing. We compared the models implemented with 5% dropout regularization, augmented dataset (up to 5000 images), and freezing the initial model's layers, fine-tuning around a fourth of the last layers. The best model was achieved by fine-tuning ViT L-16 with 96.06% accuracy, trained with 5% dropout in the augmented dataset, and freezing the initial 21st layers. We also compared our model to the other previous work model and the results showed that our model reached the state-of-the art for the JAFFE dataset.

关键词： Facial Expression Recognition Vision Transformer Attention Deep Learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：