In the rapidly advancing field of computer vision, object detection has become crucial for various applications, including animal tracking, face detection, and surveillance systems. This study investigates the efficac...
详细信息
ISBN:
(数字)9798350370249
ISBN:
(纸本)9798350370270
In the rapidly advancing field of computer vision, object detection has become crucial for various applications, including animal tracking, face detection, and surveillance systems. This study investigates the efficacy of contemporary object detection methodologies by evaluating the performance of the You Only Look Once (YOLO) models and TensorFlow Model Zoo architectures for animal tracking. YOLO models, known for their ability to process entire images in real-time and predict bounding boxes and class probabilities simultaneously, offer significant advantages over traditional methods such as Convolutional Neural Networks (CNNs) and Fast R-CNNs. This paper compares the performance of YOLOv5 and YOLOv7, alongside TensorFlow-based models like Faster R-CNN ResNetv152 and SSD ResNet101, using a dataset of animal images. Our findings reveal that YOLOv5 outperforms other models with a mean average precision (mAP) of 9 7.5%, demonstrating superior accuracy and efficiency in object detection tasks. YOLOv7 also shows strong performance with an mAP of 96.7%, while TensorFlow Model Zoo’s Faster R-CNN and SSD models lag behind with mAPs of 81.9% and 81.6%, respectively. The results highlight the significant advancements in deep learning and object detection algorithms, particularly the advantages of YOLO’s architecture in handling complex detection tasks in real-world scenarios.
This paper presents an information-theoretic framework for unifying active learning problems: level set estimation (LSE), Bayesian optimization (BO), and their generalized variant. We first introduce a novel active le...
详细信息
Bayesian optimization (BO) has recently been extended to the federated learning (FL) setting by the federated Thompson sampling (FTS) algorithm, which has promising applications such as federated hyperparameter tuning...
详细信息
Named Data Networking (NDN) is part of an NSF research project that started in 2010 and was created to be implemented as a future Internet architecture. NDN as a network service has evolved from an Internet host-based...
详细信息
Hardware prefetching is a latency-hiding technique that hides the costly off-chip DRAM accesses. Although hardware prefetching is an extensively researched topic with many state-of-the-art data prefetchers pushing the...
详细信息
ISBN:
(纸本)9798350342543
Hardware prefetching is a latency-hiding technique that hides the costly off-chip DRAM accesses. Although hardware prefetching is an extensively researched topic with many state-of-the-art data prefetchers pushing the performance limits, prefetching for irregular applications with hard-to-predict access patterns is still a challenging problem to solve. The usage of neural networks for hardware prefetching is a promising direction, especially for predicting irregular memory access patterns. This paper presents Drishyam, a novel hardware prefetcher based on computer vision algorithms that use images to learn memory access patterns and predict future memory accesses with high accuracy and coverage. For hardware prefetching, an image is a graphical representation of memory accesses observed over time. For a sequence of memory addresses, Drishyam creates images that predict the future addresses by predicting the future OS page and a cache line offset within the OS page. Drishyam outperforms Voyager, the state-of-the-art machine learning (ML) based prefetcher, for a set of irregular benchmarks by an average of 4.7% with an average prefetch accuracy and prefetch coverage of 89.5% and 66.6%, respectively. In terms of training time, Drishyam outperforms Voyager by 225.5%.
Recognition of Bengali sign language characters is crucial for facilitating communication for the deaf and hard-of-hearing population in Bengali-speaking regions, which encompass approximately 430 million people world...
详细信息
ISBN:
(数字)9798350370249
ISBN:
(纸本)9798350370270
Recognition of Bengali sign language characters is crucial for facilitating communication for the deaf and hard-of-hearing population in Bengali-speaking regions, which encompass approximately 430 million people worldwide. Despite the significant number of individuals requiring this support, research on Bengali sign language character recognition remains underdeveloped. This article presents a novel approach to categorize Bengali sign language characters using the Ishara-Lipi dataset, based on convolutional neural networks (CNNs) and pretrained models. We evaluated our approach using metrics such as accuracy, precision, recall, F1-score, and confusion matrices. Our findings indicate that the CNN model achieved the highest performance with an accuracy of 98%, followed by VGG19 with $\mathbf{94\%}$ and ResNet variants achieving around $\mathbf{88\%}$. The proposed model demonstrates robust and efficient classification capabilities, significantly bridging the gap in existing literature. This study holds substantial promise for enhancing assistive technology, thereby improving social inclusion and quality of life for Bengalispeaking deaf and hard-of-hearing individuals.
Optimizing the design, performance, and resource efficiency of wireless networks (WNs) necessitates the ability to discern Line of Sight (LoS) and Non-Line of Sight (NLoS) scenarios across diverse applications and env...
详细信息
ISBN:
(数字)9798350362244
ISBN:
(纸本)9798350362251
Optimizing the design, performance, and resource efficiency of wireless networks (WNs) necessitates the ability to discern Line of Sight (LoS) and Non-Line of Sight (NLoS) scenarios across diverse applications and environments. Unmanned Aerial Vehicles (UAVs) exhibit significant potential in this regard due to their rapid mobility, aerial capabilities, and payload characteristics. Particularly, UAVs can serve as vital nonterrestrial base stations (NTBS) in the event of terrestrial base station (TBS) failures or downtime. In this paper, we propose CNN autoencoder resizer (CAR) as a framework that improves the accuracy of LoS/NLoS detection without demanding extra power consumption. Our proposed method increases the mean accuracy of detecting LoS/NLoS signals from 66% to 86%, while maintaining consistent power consumption levels. In addition, the resolution provided by CAR shows that it can be employed as a preprocessing tool in other methods to enhance the quality of signals.
AI is considered as most disruptive language and revolutionized various sectors with its ability analyze data with its large language model. incapability of other AI to read PDF and accept prompt and do text generatio...
详细信息
ISBN:
(数字)9798350354218
ISBN:
(纸本)9798350354225
AI is considered as most disruptive language and revolutionized various sectors with its ability analyze data with its large language model. incapability of other AI to read PDF and accept prompt and do text generation using their large language models .The work is based on proposing a new technique to integrate it into current AI. It will eliminate the need manual work of copy pasting Portable Document Format data to AI for prompt . In this can rely on Technologies Like Optical character recognition for reading images to text but challenging part is Portable Document Format to image and then image to text. this work is advancement of Optical character recognition which can eliminate the drawback of already existing technology.
Although built-in self-repair (BISR) techniques have been widely used to improve memory yield, their applications to the testing of 3D systems-on-chip (SoC) remained primarily unexplored. In this manuscript, we presen...
Although built-in self-repair (BISR) techniques have been widely used to improve memory yield, their applications to the testing of 3D systems-on-chip (SoC) remained primarily unexplored. In this manuscript, we present a multi-stage approach to implement BISR in 3D SoCs with an aim to (i) reduce test time by proposing a test scheduling technique satisfying given power constraints, (ii) reduce the number of BISR modules, and (iii) to place BISR circuitry in suitable layers for facilitating thermal dissipation. Experimental results on several SoC benchmarks show that our approach reduces both test time as well as the cost of BISR architecture in most cases.
While the availability of multispectral imagery (MSI) is increasing, there is still much to be learned regarding the spectral sensitivity of deep neural networks (DNNs). This work presents and analyzes DNNs trained to...
详细信息
ISBN:
(数字)9798350360325
ISBN:
(纸本)9798350360332
While the availability of multispectral imagery (MSI) is increasing, there is still much to be learned regarding the spectral sensitivity of deep neural networks (DNNs). This work presents and analyzes DNNs trained to perform object detection in high-resolution multispectral, satellite imagery. These DNNs are first trained and evaluated on variations of the xView dataset with varied bit-dept.s and spectral band inclusion. Inference is also performed on test imagery with different bit-dept.s or with missing spectral information in order to analyze their importance for object detection. Initial experiments show an MSI-trained DNN had an average optimal F1-score 3.7 points higher than the best performing RGB network. Additional experiments suggest significant DNN sensitivity to changes in bit-dept. and an even greater sensitivity to the loss of any given spectral band. The results given here highlight the need for future research regarding spectral biases of DNNs for object detection in high-resolution MSI.
暂无评论