Brain-computer interfaces (BCIs) hold immense potential for restoring communication to individuals with severe speech impairments. This paper investigates the feasibility of real-time open-vocabulary sentence decoding...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
Brain-computer interfaces (BCIs) hold immense potential for restoring communication to individuals with severe speech impairments. This paper investigates the feasibility of real-time open-vocabulary sentence decoding from magnetoencephalography (MEG) signals using deeplearning with a Transformer architecture. We propose a novel end-to-end model that leverages the high temporal and spatial resolution of MEG and the powerful sequence-to-sequence learning capabilities of Transformers. Our model is trained and evaluated on a large dataset of MEG recordings paired with natural language sentences. We demonstrate the effectiveness of our approach in achieving real-time, accurate, and flexible communication, significantly outperforming existing methods. Our findings pave the way for the development of more practical and user-friendly BCIs for individuals with speech disabilities.
Nowadays, accurate and fast vehicle detection technology is of great significance for constructing intelligent transportation systems in the context of the era of big data. This paper proposes an improved lightweight ...
详细信息
Nowadays, accurate and fast vehicle detection technology is of great significance for constructing intelligent transportation systems in the context of the era of big data. This paper proposes an improved lightweight YOLOX real-time vehicle detection algorithm. Compared with the original network, the detection speed and accuracy of the new algorithm have been improved with fewer parameters. First, referring to the GhostNet, we make a lightweight design of the backbone extraction network, which significantly reduces the network parameters, training cost, and inference time. Furthermore, by introducing the alpha-CIoU loss function, the regression accuracy of the bounding box (bbox) is improved, while the convergence speed of the model is also accelerated. The experimental results show that the mAP of the improved algorithm on the BIT-Vehicle dataset can reach up to 99.21% with 41.2% fewer network parameters and 12.7% higher FPS than the original network and demonstrate the effectiveness of our proposed method.
Existing Optical Mark Recognition (OMR) systems tend to be expensive and rigid in their operation, often resulting in erroneous evaluations due to strict correction protocols. This scenario airs the need for a flexibl...
详细信息
Existing Optical Mark Recognition (OMR) systems tend to be expensive and rigid in their operation, often resulting in erroneous evaluations due to strict correction protocols. This scenario airs the need for a flexible OMR system. Hence, in this work, we propose a lightweight transfer learning based Convolutional Neural Network (CNN) model, dubbed as OMRNet, which can classify answer boxes on any generalized OMR test sheet. Unlike most existing techniques that rely on imageprocessing algorithms to recognize extracted answer boxes in two classes: confirmed and empty, the OMRNet is designed to classify the answer boxes into confirmed, crossed-out, and empty categories. That is, OMRNet is facilitating the crossing out of previously answered questions and thus removing the rigidity of templates in Multiple Choice Question (MCQ) tests. We have built OMRNet on top of a MobileNetV2 backbone connected to four fully connected layers with appropriate dropouts and activation functions in between. We have evaluated OMRNet on the Multiple Choice Answer Boxes dataset available at . We have performed experiments following a 5 fold cross validation scheme, and OMRNet has achieved accuracies of 95.29%, 95.88%, 93.97%, 97.45%, and 97.20%, with an average accuracy of 95.96%. Also, the experimental results confirm that the present model performs better than the compared state-of-the-art methods and standard CNN models in terms of accuracy, execution time, and memory required to store the trained module. Moreover, we have employed a quantization technique to make the trained module more memory efficient and deployed it to a web app using our own Representational State Transfer Application Programming Interface (REST API). It makes OMRNet available via a Hypertext Transfer Protocol (HTTP) endpoint, allowing potential users to connect to it via the Internet. The source code for the work is available at the following link: .
Autism spectrum disorder (ASD) starts in the early childhood. Therefore, its diagnosis and classification at the right time would prevent the damages in long terms. EEG signals are non-invasive brain activity signals ...
详细信息
Autism spectrum disorder (ASD) starts in the early childhood. Therefore, its diagnosis and classification at the right time would prevent the damages in long terms. EEG signals are non-invasive brain activity signals with excellent temporal resolution and low costs. In this article, the goal is to propose a unified framework for early, efficient and noise robust diagnosis of ASD using EEG signals and with the help of deep transfer learning. In the proposed method, other that the proposed unified diagnosis framework, the main contribution is to use Cross Wavelet Transform (XWT) images for representation of brain signals. After pre-processing and segmentation of the signals, a reference signal is separated from the normal class. Using the reference signal, XWT images are generated. Produced images are fed as input to deep network architectures such as AlexNet, GoogleNet VGG19, ResNet-50 and ResNet-101 in a transfer learning procedure. Transfer learning is applied to make use of information from a source image classification domain while compensating the scarcity of ASD and normal subjects. The approach is evaluated on a dataset of 34 ASD samples and 11 normal case in two different without-voice and with-voice conditions. To validate the early diagnosis hypothesis, EEG signals from children older than 5 years are used as the training set and EEG signals from younger subjects are used as the validation set. Experiments on the proposed framework show that the ResNet-101 deep architecture has achieved the best classification performance. This classification performance is higher than recent reported approaches in terms of classification accuracy, sensitivity, specificity and F1 measure. The results show the effectiveness of the proposed approach in early diagnosis of autism spectrum disorder and also demonstrates the auditory impact on the diagnosis of autism. Also, having evaluated the approach on with-voice and without-voice datasets, the results denote the robustness o
A method for recognizing the contours of objects in a video data stream is proposed. The data will be uploaded using the video camera. Objects will be recognized in real-time. We will use YOLO—a method of identificat...
详细信息
Optical Coherence Tomography (OCT) uses low coherence light to provide a high spatial resolution to detect changes in the microstructure of living organisms in a non-invasive, real-time manner. A new OCT image denoisi...
详细信息
Bangladesh is one of the countries struggling to prevent road accidents, which is a global cause for concern. An early warning system that indicates road conditions can contribute to the prevention task. For this purp...
详细信息
Mobile edge computing improves data processing efficiency and reduces latency by deploying computing and storage resources at the network edge, making it suitable for real-time applications. In vehicular networks, due...
详细信息
Urinary tract infections (UTIs) are infections that affect the urinary system. It is usually caused by bacteria and pus cells. Analyzing urine samples, including examining pus cells, is a standard method for diagnosin...
详细信息
ISBN:
(纸本)9783031581731;9783031581748
Urinary tract infections (UTIs) are infections that affect the urinary system. It is usually caused by bacteria and pus cells. Analyzing urine samples, including examining pus cells, is a standard method for diagnosing and monitoring UTIs. However, manually detecting bacteria or pus cells in microscopic urine images is a time-consuming and labour-intensive task for microbiologists. Therefore, the segmentation of microscopic pus cell images will ease the process of detecting UTI. Especially low resolution microscopic images are hard to annotate;therefore, in this study, we propose an adversarial learning based semi-supervised segmentation method for segmentation of pus cell images at low resolution i.e. 40x using labeled high resolution images i.e. 100x. The proposed methodology aims to ease the process of UTI detection by automating the segmentation of pus cell images. The results of the proposed methodology demonstrate an increase in the Dice coefficient score percentage by 1%, 1.6% and 2.4% on 40x images when compared to fully supervised segmentation model trained on only 100x data using three different architectures- Unet, ResUnet++, and PSPnet, respectively.
Classification of Insects is one of the most vital and essential research, which needs to be done for, various factors like, for the protection of crops in the agricultural sector. The identification of crop pests is ...
详细信息
ISBN:
(数字)9783031585357
ISBN:
(纸本)9783031585340;9783031585357
Classification of Insects is one of the most vital and essential research, which needs to be done for, various factors like, for the protection of crops in the agricultural sector. The identification of crop pests is a difficult problem since, pest infestations cause significant crop damage and quality degradation. The majority of insect species are quite similar to one another that makes the task of detection of the insect on field crops like rice, soybeans, and other crops more challenging than a normal detection of objects. Currently, classifying insects manually is the major method used to distinguish them in crop fields, but this is a time - consuming and expensive operation. Considering the advancements in the field of deeplearning, we propose to use a pre-trained network model trained on a millions of images of imageNet dataset to do the classification task using transfer learning mechanism. An extensive experimentation was done using various pretrained models like VGG, inception, xception, ResNet, MobileNet, DenseNet and efficient net. Various insect datasets were used for classification task and model was fined tuned using transfer learning. The EfficientNet B7 model has achieved the highest accuracy 70%, 98% and 99% on IP102 (102 classes), Xie (40 classes) and Kaggle village Synthetic dataset (10 classes) respectively.
暂无评论