This 2-volume LNCS conference set constitutes the proceedings of the 17th Asian Conference on computervision, in Hanoi, Vietnam, held during December 8–12, 2024. The 269 full papers included in this volume were care...
详细信息
ISBN:
(数字)9789819626441
ISBN:
(纸本)9789819626434
This 2-volume LNCS conference set constitutes the proceedings of the 17th Asian Conference on computervision, in Hanoi, Vietnam, held during December 8–12, 2024. The 269 full papers included in this volume were carefully reviewed and selected from 839 submissions.
The conference presents and discusses new problems, solutions, and technologies in computervision, machine learning, and related areas in artificial intelligence.
This book constitutes the referred proceedings of the First International Conference on Pattern Analysis and Machine Intelligence, ICPAMI 2024, held in Shanghai, China, from August 30 to September 01, 2...
详细信息
ISBN:
(数字)9789819633494
ISBN:
(纸本)9789819633487
This book constitutes the referred proceedings of the First International Conference on Pattern Analysis and Machine Intelligence, ICPAMI 2024, held in Shanghai, China, from August 30 to September 01, 2024.
The 28 papers presented here were carefully reviewed and selected from 56 submissions. These papers have been organized under the following topical sections: computervision and Pattern Recognition; Natural Language processing (NLP) and Machine Learning; Intelligent System and Optimization Algorithm.
Airborne optical sectioning, an effective aerial synthetic aperture imaging technique for revealing artifacts occluded by forests, requires precise measurements of drone poses. In this letter, we present a new approac...
详细信息
Airborne optical sectioning, an effective aerial synthetic aperture imaging technique for revealing artifacts occluded by forests, requires precise measurements of drone poses. In this letter, we present a new approach for reducing pose estimation errors beyond the possibilities of conventional Perspective-n-Point solutions by considering the underlying optimization as a focusing problem. We present an efficient image integration technique, which also reduces the parameter search space to achieve realistic processing times, and improves the quality of resulting synthetic integral images.
This work provides a solution to the challenge of small amounts of training data in Non-Destructive Ultrasonic Testing for composite components. It was demonstrated that direct simulation alone is ineffective at produ...
详细信息
This work provides a solution to the challenge of small amounts of training data in Non-Destructive Ultrasonic Testing for composite components. It was demonstrated that direct simulation alone is ineffective at producing training data that was representative of the experimental domain due to poor noise reconstruction. Therefore, four unique synthetic data generation methods were proposed which use semi-analytical simulated data as a foundation. Each method was evaluated for its performance in the classification of real experimental images when trained on a Convolutional Neural Network which underwent hyperparameter optimization using a genetic algorithm. The first method introduced task specific modifications to CycleGAN, a generative network for image -to-image translation, to learn the mapping from physics-based simulations of defect indications to experimental indications in resulting ultrasound images. The second method was based on combining real experimental defect free images with simulated defect responses. The final two methods fully simulated the noise responses at an image and signal level respectively. The purely simulated data produced a mean classification F1 score of 0.394. However, when trained on the new synthetic datasets, a significant improvement in classification performance on experimental data was realized, with mean classification F1 scores of 0.843, 0.688, 0.629, and 0.738 for the respective approaches.
Fast keypoint recognition is essential to many vision tasks. In contrast to the classification-based approaches, we directly formulate the keypoint recognition as an image patch retrieval problem, which enjoys the mer...
详细信息
Fast keypoint recognition is essential to many vision tasks. In contrast to the classification-based approaches, we directly formulate the keypoint recognition as an image patch retrieval problem, which enjoys the merit of finding the matched keypoint and its pose simultaneously. To effectively extract the binary features from each patch surrounding the keypoint, we make use of treelets transform that can group the highly correlated data together and reduce the noise through the local analysis. Treelets is a multiresolution analysis tool, which provides an orthogonal basis to reflect the geometry of the noise-free data. To facilitate the real-world applications, we have proposed two novel approaches. One is the convolutional treelets that capture the image patch information locally and globally while reducing the computational cost. The other is the higher-order treelets that reflect the relationship between the rows and columns within image patch. An efficient sub-signature-based locality sensitive hashing scheme is employed for fast approximate nearest neighbor search in patch retrieval. Experimental evaluations on both synthetic data and the real-world Oxford dataset have shown that our proposed treelets binary feature retrieval methods outperform the state-of-the-art feature descriptors and classification-based approaches.
In this first issue of volume 19 in 2022, I would like to begin by paying tribute to Dr. Matthias Carlsohn who shared the duties of editor-in-chief with me from the inception of the journal 16 years ago in 2006 till t...
详细信息
In this first issue of volume 19 in 2022, I would like to begin by paying tribute to Dr. Matthias Carlsohn who shared the duties of editor-in-chief with me from the inception of the journal 16 years ago in 2006 till the end of last year 2021.
[...]the papers associated with a topical collection would appear as they get accepted under a listing for the topical collection.
Manuscripts having text similarities with previous publications, even by the same authors, are normally rejected and not placed into the review pipeline.
[...]authors are urged to pay a very close attention to this matter by making sure that no text similarities appear in their manuscripts before they submit them to JRTIP.
This paper proposed an efficient omnidirectional surveillance system for i-habitat (intelligent habitat or smart home in another name) security. In this surveillance system, the omnidirectional scenes in a room, kitch...
详细信息
This paper proposed an efficient omnidirectional surveillance system for i-habitat (intelligent habitat or smart home in another name) security. In this surveillance system, the omnidirectional scenes in a room, kitchen, balcony, dining hall, corridor, garage or backyard within an i-habitat are first captured using wireless webcam attached to a specially designed hyperbolic optical mirror. The captured scenes are then fed into a wireless router connected to laptop computer for imageprocessing and alarm purposes. A mapping algorithm is developed to match the captured omnidirectional image into panoramic image, hence providing the observer or imageprocessing tools a complete wide angle of view. An efficient trespasser detection algorithm is designed for trespasser detection purpose, and an efficient faint detection algorithm is designed to identify/monitor if there is any member who stay in i-habitat (old-folks or ill-patients) who fall down or faint and thus, immediate attention can be carried out. Based on the experimental findings, both the proposed trespasser and faint detection algorithm achieves high accuracy.
Nestor is a real-time recognition and camera pose estimation system for planar shapes. The system allows shapes that carry contextual meanings for humans to be used as Augmented Reality (AR) tracking targets. The user...
详细信息
Nestor is a real-time recognition and camera pose estimation system for planar shapes. The system allows shapes that carry contextual meanings for humans to be used as Augmented Reality (AR) tracking targets. The user can teach the system new shapes in real time. New shapes can be shown to the system frontally, or they can be automatically rectified according to previously learned shapes. Shapes can be automatically assigned virtual content by classification according to a shape class library. Nestor performs shape recognition by analyzing contour structures and generating projective-invariant signatures from their concavities. The concavities are further used to extract features for pose estimation and tracking. Pose refinement is carried out by minimizing the reprojection error between sample points on each image contour and its library counterpart. Sample points are matched by evolving an active contour in real time. Our experiments show that the system provides stable and accurate registration, and runs at interactive frame rates on a Nokia N95 mobile phone.
Spatial augmented reality is especially interesting for the design process of a car, because a lot of virtual content and corresponding real objects are used. One important issue in such a process is that the designer...
详细信息
Spatial augmented reality is especially interesting for the design process of a car, because a lot of virtual content and corresponding real objects are used. One important issue in such a process is that the designer can trust the visualized colors on the real object, because design decisions are made on basis of the projection. In this paper, we present an interactive visualization technique which is able to exactly compute the RGB values for the projected image, so that the resulting colors on the real object are equally perceived as the real desired colors. Our approach computes the influences of the ambient light, the material, the pose and the color model of the projector to the resulting colors of the projected RGB values by using a physically based computation. This information allows us to compute the adjustment for the RGB values for varying projector positions at interactive rates. Since the amount of projectable colors does not only depend on the material and the ambient light, but also on the pose of the projector, our method can be used to interactively adjust the range of projectable colors by moving the projector to arbitrary positions around the real object. We further extend the mentioned method so that it is applicable to multiple projectors. All methods are evaluated in a number of experiments.
暂无评论