In order to detect COVID-19 positive cases more quickly, in this paper, a deep learning model is used and a CNN network is designed for model training. In addition, some classical networks such as Resnet, VGG and some...
In order to detect COVID-19 positive cases more quickly, in this paper, a deep learning model is used and a CNN network is designed for model training. In addition, some classical networks such as Resnet, VGG and some lightweight networks are tried to train the model and analyze their advantages and disadvantages. Finally, a Resnet network is designed. The Fire module is introduced to reduce the network depth and parameter quantity and make the network pay attention to the global information. The performance of the model on the three loss functions of CrossEntropyLoss, FocalLoss and Poly1FocalLoss is compared, and the optimal loss function is CrossEntropyLoss. The network model has achieved good results in both the training set and the verification set. The accuracy in the training set is 95%, the average accuracy in the verification set is about 80%, and the accuracy in the test set is nearly 75%. Because the model can be trained under the CPU, compared with the 84% accuracy of the original author's DenseNet test set, the precision has slightly decreased, and the recall rate has increased by 4 percentage points. However, in comparison, the speed and accuracy of the improved model have been improved under the Resnet 34 model.
Recent neural rendering approaches greatly improve image quality, reaching near photorealism. However, the underlying neural networks have high runtime, precluding telepresence and virtual reality applications that re...
详细信息
ISBN:
(纸本)9781665493468
Recent neural rendering approaches greatly improve image quality, reaching near photorealism. However, the underlying neural networks have high runtime, precluding telepresence and virtual reality applications that require high resolution at low latency. The sequential dependency of layers in deep networks makes their optimization difficult. We break this dependency by caching information from the previous frame to speed up the processing of the current one with an implicit warp. The warping with a shallow network reduces latency and the caching operations can further be parallelized to improve the frame rate. In contrast to existing temporal neural networks, ours is tailored for the task of rendering novel views of faces by conditioning on the change of the underlying surface mesh. We test the approach on view-dependent rendering of 3D portrait avatars, as needed for telepresence, on established benchmark sequences. Warping reduces latency by 70% (from 49.4ms to 14.9ms on commodity GPUs) and scales frame rates accordingly over multiple GPUs while reducing image quality by only 1%, making it suitable as part of end-to-end view-dependent 3D teleconferencing applications.
Although the single image dehazing techniques now in use work well in certain circumstances, real-world scenarios' complex and ever-changing nature makes it impossible for them to function well. This research offe...
详细信息
Referring image segmentation, the task of segmenting any arbitrary entities described in free-form texts, opens up a variety of vision applications. However, manual labeling of training data for this task is prohibiti...
ISBN:
(纸本)9798350307184
Referring image segmentation, the task of segmenting any arbitrary entities described in free-form texts, opens up a variety of vision applications. However, manual labeling of training data for this task is prohibitively costly, leading to lack of labeled data for training. We address this issue by a weakly supervised learning approach using text descriptions of training images as the only source of supervision. To this end, we first present a new model that discovers semantic entities in input image and then combines such entities relevant to text query to predict the mask of the referent. We also present a new loss function that allows the model to be trained without any further supervision. Our method was evaluated on four public benchmarks for referring image segmentation, where it clearly outperformed the existing method for the same task and recent open-vocabulary segmentation models on all the benchmarks.
Chest radiography allows a detailed inspection of a patient's thorax via an imaging modality, but requires specialized training for proper interpretation. With the advent of high performance general purpose image ...
详细信息
RAW files are the initial measurement of scene radiance widely used in most cameras, and the ubiquitously-used RGB images are converted from RAW data through image Signal processing (ISP) pipelines. Nowadays, digital ...
ISBN:
(纸本)9798350307184
RAW files are the initial measurement of scene radiance widely used in most cameras, and the ubiquitously-used RGB images are converted from RAW data through image Signal processing (ISP) pipelines. Nowadays, digital images are risky of being nefariously manipulated. Inspired by the fact that innate immunity is the first line of body defense, we propose DRAW, a novel scheme of defending images against manipulation by protecting their sources, i.e., camera-shooted RAWs. Specifically, we design a lightweight Multi-frequency Partial Fusion Network (MPF-Net) friendly to devices with limited computing resources by frequency learning and partial feature fusion. It introduces invisible watermarks as protective signal into the RAW data. The protection capability can not only be transferred into the rendered RGB images regardless of the applied ISP pipeline, but also is resilient to post-processing operations such as blurring or compression. Once the image is manipulated, we can accurately identify the forged areas with a localization network. Extensive experiments on several famous RAW datasets, e.g., RAISE, FiveK and SIDD, indicate the effectiveness of our method. We hope that this technique can be used in future cameras as an option for image protection, which could effectively restrict image manipulation at the source.
The following paper presents a workflow for biomedical imageprocessing. On the base of accomplished analysis the software for biological and biomedical imageprocessing Cell Profiler is used. The designed workflow is...
详细信息
In order to address current water challenges, scientific research on water-related issues is crucial. However, traditional techniques for selecting research topics, such as literature reviews and expert opinions, can ...
详细信息
ISBN:
(纸本)9798350326970
In order to address current water challenges, scientific research on water-related issues is crucial. However, traditional techniques for selecting research topics, such as literature reviews and expert opinions, can be time-consuming and may not provide a comprehensive overview of available information. We propose using Natural Language processing (NLP) techniques in this study to extract, align, and compare water research topics from different corpora. We apply these techniques to the research paper abstracts from the New Mexico Water Resources Research Institute (NMWRRI) and the U.S. Geological Survey (USGS) to assess these institutions' current research interests and identify potential new research directions. We use a Latent Dirichlet Allocation (LDA) model for topic extraction and a Word2Vec model for topic alignment. This study highlights the benefits of using NLP techniques to analyze trends and identify novel research directions in water studies.
In the field of remote sensing, multimodal data matching is a problem with great challenges, and the matching between SAR and visible light images is tremendously difficult, because there is obvious speckle noise in S...
详细信息
The convolutional neural network (CNN) is a potent and popular neural network types and has been crucial to deep learning in recent years. A standard CNN which is known as 2-dimensions CNN was first proposed to solve ...
详细信息
暂无评论