Medical image segmentation plays an important role in accurately identifying and isolating regions of interest within medical images. Generative approaches are particularly effective in modeling the statistical proper...
详细信息
Traffic accidents are one of the biggest challenges in a society where commuting is so important. What triggers an accident can be dependent on several subjective parameters and varies within each region, city, or cou...
详细信息
The total generalized variation extends the total variation by incorporating higher-order smoothness. Thus, it can also suffer from similar discretization issues related to isotropy. Inspired by the success of novel d...
详细信息
Longer training times pose a significant challenge in Artificial neural networks (ANNs) as it may leads to increasing the computational costs and decreasing the effectiveness of the model. Therefore, it is imperative ...
详细信息
Longer training times pose a significant challenge in Artificial neural networks (ANNs) as it may leads to increasing the computational costs and decreasing the effectiveness of the model. Therefore, it is imperative to reduce training times in ANNs to enhance the computational efficiency. The initialization of the weights between the layers in ANN plays a vital role in reducing training times. Appropriate weight initialization can help the network converge faster during the training by providing an optimum starting point for the network. Therefore, weight initialization techniques are essential for efficient training of ANNs. This paper revisits and implements different popular weight initialization techniques in ANNs and analyzes their impact on training time. Specifically, this paper implements Gaussian-based, Kaming-based, and Xavier-based weight initiation atop a popular DNN-based network. The experiments are conducted by employing a well-known dataset. The results show that the scenario when no weight initiation is applied consumed the highest training time, whereas different weight initiation techniques contribute in reducing the training times for the network.
In this paper, we present a research platform to support studying collaboration in hybrid and co-located scenarios. Mixed-presence collaboration includes various novel and exciting use cases, such as situated and imme...
详细信息
ISBN:
(数字)9798331506919
ISBN:
(纸本)9798331506926
In this paper, we present a research platform to support studying collaboration in hybrid and co-located scenarios. Mixed-presence collaboration includes various novel and exciting use cases, such as situated and immersive data analysis by multiple users. However, research in this emerging field is hindered by the technical complexity of the setups and often requires re-implementation of common features. We address this issue by contributing a toolkit and research platform for mixed-presence collaboration that serves as an extensible baseline implementation and enables fast prototyping for user studies in collaborative mixed reality. Furthermore, our platform provides adjustable parameters, such as types of avatars, audio source placement, or the amount of simulated network latency. This way, developers are supported in making design choices regarding typical, re-occurring technical challenges.
We propose a novel method for unsupervised semantic image segmentation based on mutual information maximization between local and global high-level image features. The core idea of our work is to leverage recent progr...
详细信息
Digital imaging aims to replicate realistic scenes, but Low Dynamic Range (LDR) cameras cannot represent the wide dynamic range of real scenes, resulting in under-/overexposed images. This paper presents a deep learni...
Digital imaging aims to replicate realistic scenes, but Low Dynamic Range (LDR) cameras cannot represent the wide dynamic range of real scenes, resulting in under-/overexposed images. This paper presents a deep learning-based approach for recovering intricate details from shadows and highlights while reconstructing High Dynamic Range (HDR) images. We formulate the problem as an image-to-image (I2I) translation task and propose a conditional Denoising Diffusion Probabilistic Model (DDPM) based framework using classifier-free guidance. We incorporate a deep CNN-based autoencoder in our proposed framework to enhance the quality of the latent representation of the input LDR image used for conditioning. Moreover, we introduce a new loss function for LDR-HDR translation tasks, termed Exposure Loss. This loss helps direct gradients in the opposite direction of the saturation, further improving the results’ quality. By conducting comprehensive quantitative and qualitative experiments, we have effectively demonstrated the proficiency of our proposed method. The results indicate that a simple conditional diffusion-based method can replace the complex camera pipeline-based architectures.
作者:
Stanczyk, UrszulaDepartment of Graphics
Computer Vision and Digital Systems Faculty of Automatic Control Electronics and Computer Science Silesian University of Technology Akademicka 2A Gliwice44-100 Poland
Relative or decision reducts belong with mechanisms dedicated to feature selection, and they are embedded in rough set approach to data processing. Algorithms for reduct construction typically aim at dimensionality re...
详细信息
Traffic safety on highways is supported by a variety of technical measures, including countless camera systems that are often only monitored by human operators. However, due to the sheer amount of data, safety monitor...
Traffic safety on highways is supported by a variety of technical measures, including countless camera systems that are often only monitored by human operators. However, due to the sheer amount of data, safety monitoring and accident prevention are limited by human resources. In this paper, we present an efficient system capable of extracting accurate vehicle trajectories from the vast amount of video data generated by modern highway infrastructures. Our proposed system conveniently leverages bird's eye view transformations estimated from aerial data or street marker geometry to generate geo-localized trajectories. Utilizing existing infrastructure, we demonstrate that the central data for video-based highway traffic monitoring can be reliably extracted. Remarkably, this can be achieved solely relying on uncalibrated cameras and noisy video streams.
Digital imaging aims to replicate realistic scenes, but Low Dynamic Range (LDR) cameras cannot represent the wide dynamic range of real scenes, resulting in under-/overexposed images. This paper presents a deep learni...
详细信息
暂无评论