This research aims to develop a brain tumor detection model by utilizing the machine learning techniques and Convolutional Neural Network (CNN). A significant matter to address is revolving around early detection and ...
详细信息
This research aims to develop a brain tumor detection model by utilizing the machine learning techniques and Convolutional Neural Network (CNN). A significant matter to address is revolving around early detection and the proper handling regarding the brain tumor. This research's methodology consists of collecting the dataset, identifying the tools and language to use, prepare and preprocessing the data, data augmentation, splitting and label encoding, building the model architecture, compiling the model, training, and evaluating the model, predicting the model, and comparing it with other models. Dataset consists of 7022 MRI images, divided into training and testing subsets; and four classes: glioma, meningioma, pituitary, and no tumor. There are four different CNN models that have been built and evaluated, namely VGG16, InceptionV3, ResNet50, and DenseNet121. The result gained shows VGG16 with the best performance achieving an accuracy rate of 96.43%, followed by DenseNet121 (94,96%), InceptionV3 (92,40%), and ResNet50 (78,69%). Although there is still room for improvement regarding overfitting and increasing the models’ overall performance, this result is promising enough to enhance early diagnosis and offer an appropriate and effective treatment for patients.
Traditional perception systems for TJA (Traffic Jam Assistance) are mostly implemented by fusing images with radar or lidar. As computer vision techniques become more powerful, cameras can almost replace the need for ...
详细信息
Not only common issues on object detection task need to be deal with, for Unmanned Aerial Vehicle (UAV) applications, small object is one of the critical problems that needs to be solved. YOLOv7 is a powerful network ...
详细信息
This study introduced a capacitive sensing interactive game platform aimed at promoting emotional stability, which we have named the 'Sunrise and Sunset' game. This game primarily consists of two pieces of reg...
详细信息
Deafness is a condition that results in the loss of hearing function, hindering the reception of information such as oral communication that relies on auditory senses. Consequently, individuals with hearing impairment...
详细信息
LoRa's biggest advantage is its flexibility, which is the ability to increase or decrease data rate and range while decreasing or increasing sensitivity. Whenever propagation conditions change frequently, this fun...
详细信息
Virtual Reality (VR) is a technology that allows users to interact with a simulated environment created by virtual reality capabilities through the intermediary of computers, simulator tools, and others, providing new...
详细信息
Virtual Reality (VR) is a technology that allows users to interact with a simulated environment created by virtual reality capabilities through the intermediary of computers, simulator tools, and others, providing new experiences and can feel experiences that are not possible in the real world. This research aims to identify and analyze what challenges may occur such as technical, social, and ethical barriers in the development and application of VR technology. Using a mixed methods approach, we collected quantitative data from Kaggle's dataset and conducted in-depth interviews with active VR users. Our findings show that although VR has significant potential in various fields, such as education, healthcare, and entertainment, challenges such as motion sickness, high costs, and limited and engaging content hinder the widespread adoption of VR technology. The study also highlights the importance of improving technologies that need to be implemented such as visual capabilities, motion responsiveness, and audio quality to enhance the user experience when using VR. By overcoming these barriers, VR can maximize its positive impact and become a revolutionary tool in various sectors.
Sign language is one of the technique to support communication with deaf and speech impaired people. Nowadays, human needs become more complex, so are the needs of people with those disabilities. Therefore, with the s...
详细信息
Sign language is one of the technique to support communication with deaf and speech impaired people. Nowadays, human needs become more complex, so are the needs of people with those disabilities. Therefore, with the sophistication of modern technology in the field of computerscience, it is necessary to have a sign language translation system which is capable to convert human gestures into words or spelling letters in a natural human language, especially Indonesian language and SIBI sign language. This research is conducted to analyze the capabilities of an Artificial Neural Network (ANN) system to translate SIBI sign language into Indonesian language. The analysis is performed quantitatively from the experiment results to gain a descriptive insights for the generated models. By creating two datasets containing alphabets and words in SIBI sign language, two models were generated to predict alphabets and words, respectively. The result shows that ANN model could effectively and efficiently perform a sign language translation with accuracy of alphabet prediction reached 96.15% and words prediction reached 99.45% in duration of prediction without exceeding 0.15 seconds. Regarding to the results, it can be concluded that ANN model quantitatively suitable to be implemented as an Indonesian sign language translator system.
Vision impairment, often caused by preventable ocular diseases can be challenging to diagnose accurately and prone to human error. Automation using technology, particularly deep learning, offers a promising solution t...
详细信息
Vision impairment, often caused by preventable ocular diseases can be challenging to diagnose accurately and prone to human error. Automation using technology, particularly deep learning, offers a promising solution to aid in accurate and efficient disease detection. This study explores the use of different CNN models specifically VGG-16, VGG-19, ResNet-50, and ResNet-152v2, for detecting ocular diseases. Simple fine-tuning is applied to these models, and their performance is compared to identify the most effective model. The purpose is to show how different models contribute to establishing reliable illness detection systems. The results reveal that most of these models perform well with even minimal fine-tuning. Among the models, ResNet-152v2 achieved the highest training accuracy of 90.36% demonstrating its capacity to learn from the training data. In contrast, ResNet-50 offered a more balanced performance with marginally lower accuracy, making it a robust choice for general application.
With the increase of interest in Facial Expression Recognition (FER) in the past few decades. Several challenges surfaced with the invention of many different FER models which are often based on Convolution Neural Net...
详细信息
With the increase of interest in Facial Expression Recognition (FER) in the past few decades. Several challenges surfaced with the invention of many different FER models which are often based on Convolution Neural Network (CNN) architectures. Recently, an attention-based transformer model has been presented to address FER. One of the major issues with Transformers is the need for a large data quantity for training. Therefore, in this paper, we propose to learn how to fine-tune a vision transformer-based (ViT) model using a limited dataset. We will be using the JAFFE Dataset, which consists of only 213 images containing seven different emotions. The proposed method is evaluated using several fine-tuning methods, such as adding dropout, data augmentation, and layer freezing. We compared the models implemented with 5% dropout regularization, augmented dataset (up to 5000 images), and freezing the initial model's layers, fine-tuning around a fourth of the last layers. The best model was achieved by fine-tuning ViT L-16 with 96.06% accuracy, trained with 5% dropout in the augmented dataset, and freezing the initial 21st layers. We also compared our model to the other previous work model and the results showed that our model reached the state-of-the art for the JAFFE dataset.
暂无评论