At present, common object detection methods for missile borne images often perform poorly. Because the experimental data of missile borne images is difficult to obtain, the target is small, and the imaging environment...
详细信息
ISBN:
(纸本)9781665464680
At present, common object detection methods for missile borne images often perform poorly. Because the experimental data of missile borne images is difficult to obtain, the target is small, and the imaging environment is complex and random, it is a challenge to build an appropriate object detection model for such missile borne images. Based on the classic YOLOv5, the paper constructs an aerial platform image target detection model YOLO v5mb, which is suitable for missile borne images. The model can accurately detect targets in single-mode visible or infrared missile borne images. In addition, the fusion layer architecture in YOLO v5mb makes it suitable for multi-mode visible and infrared missile borne fusion images object detection.
The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, imageprocessing, social network analysis, etc.). As sizes ...
详细信息
The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, imageprocessing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers have focused on designing algorithms for clustering problems in models of computation suited for large-scale computation such as MapReduce, Pregel, and streaming models. The k-machine model (Klauck et al., (SODA 2015) [8]) is a simple, message-passing model for large-scale distributed graph processing. This paper considers three of the most prominent examples of clustering problems: the uncapacitated facility location problem, the p-median problem, and the p-center problem and presents O (1)-factor approximation algorithms for these problems running in (O) over tilde (n/k) rounds in the k-machine model. These algorithms are optimal up to polylogarithmic factors because this paper also shows (Omega) over tilde (n/k) lower bounds for obtaining polynomial-factor approximation algorithms for these problems. These are the first results for clustering problems in the k-machine model. We assume that the metric provided as input for these clustering problems is only implicitly provided, as an edge-weighted graph and in a nutshell, our main technical contribution is to show that constant-factor approximation algorithms for all three clustering problems can be obtained by learning only a small portion of the input metric. (C) 2021 Elsevier B.v. All rights reserved.
Each day, countless individuals around the globe are affected by the extensive and complex problem of traffic congestion. This raising challenge is becoming increasingly substantial across the country, driven by facto...
详细信息
BackgroundDespite significant advances in computer-aided diagnostics, onychomycosis, a widespread fungal nail infection, lacks an automated approach for objective analysis and *** study aimed to develop and validate a...
详细信息
BackgroundDespite significant advances in computer-aided diagnostics, onychomycosis, a widespread fungal nail infection, lacks an automated approach for objective analysis and *** study aimed to develop and validate automated machine learning models to accurately detect and classify onychomycosis-affected areas in *** images in this study were captured using the Scarletred (R) vision mobile App and SkinPatch, a CE certified medical device system working seamlessly together to deliver auto-color calibrated, high-resolution clinical images. Considering a total of 1687 images from 440 subjects, the research explores various degrees of onychomycosis and evaluates the infection extent in the toenails detected. We developed an advanced machine learning algorithm for precise segmentation and classification of onychomycosis-affected toenails, utilizing expert annotations and advanced post-processing techniques. Additionally, an analysis of nail growth was performed, and a comparison graph with the percentage of infection was *** advanced machine learning algorithms, we successfully detected toenails, enabling detailed analysis of intricate structures within the images. We achieved a final validation loss of 0.0236 and an F1 score of 0.8566 for accurate toenail detection, while the Random Forest algorithm demonstrated 81% accuracy in classifying and distinguishing between infected and healthy toenail areas. Our applied superpixel method furthermore improved the algorithm's precision in identifying the infected *** AI-powered image analysis method, initially focused on the big toe's toenail, shows great promise for broader validation on comprehensive datasets, enabling more detailed assessments of onychomycosis severity and disease dynamics. The potential impact of limited patient diversity, particularly with darker skin tones, needs further assessment. Proven to measure nail growth and assess trea
The proceedings contain 16 papers. The topics discussed include: performance evaluation of recent object detection models for traffic safety applications on edge;tracking of artillery shell using optical flow;action r...
ISBN:
(纸本)9781450397926
The proceedings contain 16 papers. The topics discussed include: performance evaluation of recent object detection models for traffic safety applications on edge;tracking of artillery shell using optical flow;action recognition with non-uniform key frame selector;a view direction-driven approach for automatic room mapping in mixed reality;automatic gait gender classification using convolutional neural networks;deep 3D-2D convolutional neural networks combined with Mobinenetv2 for hyperspectral image classification;attention based BiGRU-2DCNN with hunger game search technique for low-resource document-level sentiment classification;strategies of multi-step-ahead forecasting for chaotic time series using autoencoder and LSTM neural networks: a comparative study;semi-supervised defect segmentation with uncertainty-aware pseudo-labels from multi-branch network;and security analysis of visual based share authentication and algorithms for invalid shares generation in malicious model.
With the deployment of the fifth generation (5G) wireless systems gathering momentum across the world, possible technologies for 6G are under active research discussions. In particular, the role of machine learning (M...
详细信息
ISBN:
(纸本)9798350302615
With the deployment of the fifth generation (5G) wireless systems gathering momentum across the world, possible technologies for 6G are under active research discussions. In particular, the role of machine learning (ML) in 6G is expected to enhance and aid emerging applications such as virtual and augmented reality, vehicular autonomy computer vision and internet of everything. This will result in large segments of wireless data traffic comprising image, video and speech. The ML algorithms process these for classification/recognition/estimation through the learning models located on cloud servers. This requires wireless transmission of data from edge devices to the cloud server. Channel estimation, handled separately from recognition step, is critical for accurate learning performance. Toward combining the learning for both channel and the ML data, we introduce implicit channel learning to perform the ML tasks without estimating the wireless channel. Here, the ML models are trained with channel-corrupted datasets in place of nominal data. Without channel estimation, the proposed approach exhibits approximately 60% improvement in image and speech classification tasks for diverse scenarios such as millimeter wave and IEEE 802.11p vehicular channels.
image captioning involves generating a natural language description that accurately represents the content and context of an image. To achieve this, image captioning utilises various machine learning techniques and fi...
详细信息
Today, technological advancement in production of radar images can be seen with high spatial resolution and also the availability of these images' significant growth in interpretation and processing of high-resolu...
详细信息
Convolutional neural networks (CNNs) are widely used in machine learning (ML) applications such as imageprocessing. CNN requires heavy computations to provide significant accuracy for many ML tasks. Therefore, the ef...
详细信息
Convolutional neural networks (CNNs) are widely used in machine learning (ML) applications such as imageprocessing. CNN requires heavy computations to provide significant accuracy for many ML tasks. Therefore, the efficient implementations of CNNs to improve performance using limited resources without accuracy reduction is a challenge for ML systems. One of the architectures for the efficient execution of CNNs is the array-based accelerator, that consists of an array of similar processing elements (PEs). The array accelerators are popular as high-performance architecture using the features of parallel computing and data reuse. These accelerators are optimized for a set of CNN layers, not for individual layers. Using the same accelerator dimension size to compute all CNN layers with varying shapes and sizes leads to the resource underutilization problem. We propose a flexible and scalable architecture for array-based accelerator that increases resource utilization by resizing PEs to better match the different shapes of CNN layers. The low-cost partial reconfiguration improves resource utilization and performance, resulting in a 23.2% reduction in computational times of GoogLeNet compared to the state-of-the-art accelerators. The proposed architecture decreases the on-chip memory access rate by 26.5% with no accuracy loss.
Fashion understanding is a challenging multi-modal task of interpreting multi aspects of fashion images. While traditional computer vision or multi-modal algorithms fall short in providing a comprehensive understandin...
详细信息
ISBN:
(纸本)9783031723438;9783031723445
Fashion understanding is a challenging multi-modal task of interpreting multi aspects of fashion images. While traditional computer vision or multi-modal algorithms fall short in providing a comprehensive understanding, Large vision-Language Model (LvLM) offers a new approach. However, directly using LvLMs presents four major limitations, highlighting the need for a fashion-specific LvLM. Existing fashion datasets also reveal limitations in providing a coherent natural input that fits the LvLMs. To address this bottleneck, we introduce the FUND dataset featuring meticulously annotated textual descriptions for fashion images. Specifically, we build a fashion knowledge base and collect fashion images in various categories online. By leveraging image segmentation model and GPT4, we refine the pre-annotations through manual modifications. Through instruct-tuning with FUND, we develop FashionGPT, a GPT-assisted LvLM based on a solid architecture with exceptional performance on fashion understanding. It is capable of generating coherent and multi-aspect descriptions for fashion images and greatly alleviates the four limitations. Extensive experiments quantitatively and qualitatively demonstrate the effectiveness of FashionGPT and the benefits of FUND, and showcase the broad applications in more tasks.
暂无评论