The proceedings contain 19 papers. The topics discussed include: VIVAR: learning view-invariant embedding for video action recognition;railway clearance intrusion detection using feature fusion enhancement neural netw...
ISBN:
(纸本)9781510689237
The proceedings contain 19 papers. The topics discussed include: VIVAR: learning view-invariant embedding for video action recognition;railway clearance intrusion detection using feature fusion enhancement neural network;progressive self-supervised spatio-temporal feature learning based on video sequence saliency;a student behavior recognition algorithm based on improved MobileNetV2;survey on emotion recognition systems based on facial expressions combined with neurological and cardiac signals;does display size affect short-term memory tasks?;dual photography using hierarchical orthogonal codes;SPGMS: an annotated benchmark for grain-matrix segmentation in sandstone photomicrographs;and research on the application of VR in the learning of different algorithm models and the construction of spatial interaction models.
In order to improve the testing efficiency and accuracy of the external interface and human-machine interaction page of the integrated display control system in the ground integrated test of helicopter avionics system...
详细信息
This study proposes an innovative algorithm based on DCNN and multi-channel image fusion, aiming to improve the quality and efficiency of virtual scene image generation. The algorithm extracts depth information and te...
详细信息
Flexible screens consist of various layered structures, including a protective layer and a display layer. The display layer with limited deformation ability is expected to be in the neutral layer to minimize stress du...
详细信息
Under-display Camera is an emerging technology for full-screen display with a camera under the display. However, the current implementation of UDC causes serious image degradation. Incident light required for camera i...
详细信息
The field of image processing is playing a vital role in making technological changes those results in real time applications. image scaling is one of such fundamental method that helps to resolve storage issue and al...
详细信息
image captioning evaluation is of great significance in guiding caption generation, practically valuable evaluation requires fine-grained evaluation metrics. However, most of the current metrics are only capable of me...
详细信息
ISBN:
(纸本)9789819794362;9789819794379
image captioning evaluation is of great significance in guiding caption generation, practically valuable evaluation requires fine-grained evaluation metrics. However, most of the current metrics are only capable of measuring the overall quality of the caption, which means users are not informed of any details about the caption's defects when given a low score. As fidelity is one of the quality criteria of image captioning and has a great impact on the application, we proposed WBQA, a metric that focuses on evaluating the fidelity of the caption by Weighted Boolean Question Answering. WBQA can display which part of captions is unfaithful because natural language is used to express questions to check whether the caption matches the image. Experiments show that our metric is excellent for measuring fidelity (from 0.31 and 0.34 to 0.52 and 0.58 with Pearson correlation coefficient) and achieves state-of-the-art on multiple image captioning evaluation datasets.
A video conferencing system contains one or more display adjusting components, whereby an object to be displayed can be adjusted to appropriately fit various sized display screens. A display adjusting component is con...
标准号:
US9769425(B1)
A video conferencing system contains one or more display adjusting components, whereby an object to be displayed can be adjusted to appropriately fit various sized display screens. A display adjusting component is contained within the sending client, which adjusts the image of the object to be appropriately displayed to one or more receiving clients. The receiving clients also contain a display adjusting component, which can further adjust the image of the object to be displayed, as necessary. The multimedia conferencing server of the video conferencing system also contains a display adjusting component, which negotiates parameters of the sending and receiving clients. Any of the display adjusting components can function alone, or in any combination together. A method, and computer-readable media which contain computer-readable instructions to perform a method, of adjusting an image for video conferencedisplay are also described.
Plenoptic cameras capture 3D information from the scene in the form of a dense light field thanks to an array of micro-lenses placed in front of the sensor. These cameras have great potential for future 3D remote appl...
详细信息
ISBN:
(纸本)9789819620531;9789819620548
Plenoptic cameras capture 3D information from the scene in the form of a dense light field thanks to an array of micro-lenses placed in front of the sensor. These cameras have great potential for future 3D remote applications. Therefore, it is crucial to explore effective rendering methods for 3D displays using this data. Until now, only a single pipeline using external software, which converts the plenoptic image as a dense multi-view, was proposed, which makes it slow and prone to artifacts. In this paper, we consider using the plenoptic image as the input light field data for conventional rendering methods, instead of the traditional multiview format. Hence, we modify the camera projection to a plenoptic one for 3D displayimage generation without the need for external software. Our method only takes a few seconds and works with any plenoptic camera model.
The fidelity of reconstruction of 3-dimensional images by reflection and transmission holograms is investigated, concentrating on the measurement and theoretical modelling of large-scale geometric distortions in holog...
The fidelity of reconstruction of 3-dimensional images by reflection and transmission holograms is investigated, concentrating on the measurement and theoretical modelling of large-scale geometric distortions in holograms illuminated by white or monochromatic light. Variable parameters include: recording and replay wavelengths; object position at recording; emulsion thickness and refractive index; replay light-source angle and distance; presence of a glass substrate, its thickness and refractive index. Holograms were recorded on Agfa 8E56HD photographic plates, and measurements made of image-point directions as a function of position on the hologram, revealing considerable image distortions even for modest changes in parameter values between recording and replay. The results are accurately modelled by the theory, which gives numerical values for point-by-point image positions, without any resort to paraxial approximations.
暂无评论