We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments. LDL focuses on the sparse structural information of lines in the scene, which is robust to illumination changes ...
We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments. LDL focuses on the sparse structural information of lines in the scene, which is robust to illumination changes and can potentially enable efficient computation. While previous line-based localization approaches tend to sacrifice accuracy or computation time, our method effectively observes the holistic distribution of lines within panoramic images and 3D maps. Specifically, LDL matches the distribution of lines with 2D and 3D line distance functions, which are further decomposed along principal directions of lines to increase the expressiveness. The distance functions provide coarse pose estimates by comparing the distributional information, where the poses are further optimized using conventional local feature matching. As our pipeline solely leverages line geometry and local features, it does not require costly additional training of line-specific features or correspondence matching. Nevertheless, our method demonstrates robust performance on challenging scenarios including object layout changes, illumination shifts, and large-scale scenes, while exhibiting fast pose search terminating within a matter of milliseconds. We thus expect our method to serve as a practical solution for line-based localization, and complement the well-established point-based paradigm. The code for LDL is available through the following link: https://***/82magnolia/panoramic-localization.
We present a 4-channel 15 kS/s Voltage-to-Time Converter (VTC) analog front-end (AFE) with a $0.49\mu W$ impulse-based galvanic uplink for a peripheral nerve interface. Multiple, low-noise, high-data-rate channels a...
详细信息
ISBN:
(数字)9798350387179
ISBN:
(纸本)9798350387186
We present a 4-channel 15 kS/s Voltage-to-Time Converter (VTC) analog front-end (AFE) with a
$0.49\mu W$
impulse-based galvanic uplink for a peripheral nerve interface. Multiple, low-noise, high-data-rate channels are needed to sense compound action potentials and measure their conduction velocity as they propagate down a peripheral nerve. To achieve high energy efficiency for these constraints, the AFE encodes and transmits data with time-domain charge-balanced impulses through an implantable galvanic link. Each channel consists of an integrator with charge-based sampling and amplification for rapid multiplexing. A shared VTC encodes the amplitude-domain outputs of each integrator into differential time-domain impulses. Since the timing can be synchronized with stimulation, this AFE achieves instant artifact recovery after rail-to-rail stimulation events. We designed this AFE in a 180nm CMOS process, and the simulation results show an SNDR of 60dB and noise of
$3.7\mu Vrms$
. Thanks to the new galvanic uplink protocol, this front-end only consumes
$11.28\mu W$
including wireless data transmission for four channels.
Computed tomography is a widely used imaging modality with applications ranging from medical imaging to material analysis. One major challenge arises from the lack of scanning information at certain angles, leading to...
详细信息
We present Reusable Motion prior (ReMP), an effective motion prior that can accurately track the temporal evolution of motion in various downstream tasks. Inspired by the success of foundation models, we argue that a ...
详细信息
This research aims to construct a two-dimensional image to represent an underwater geometry map with a Side Scan Sonar (SSS) mounted on a Hybrid Autonomous Underwater Glider (HAUG). Building the underwater map has two...
This research aims to construct a two-dimensional image to represent an underwater geometry map with a Side Scan Sonar (SSS) mounted on a Hybrid Autonomous Underwater Glider (HAUG). Building the underwater map has two stages of the process. The first stage is preprocessing which includes Time-varying gains (TVG), slant range correction, and ground range correction. The second is HAUG navigation, the movement of HAUG orientation to the angle and distance of SSS readings and plotting the geometry of the grid map. With these two stages, a two-dimensional grid map will be formed. Meanwhile, the 3D plot utilizes slant range and ground range correction to obtain the surface height of the bottom of the pool. From the experiments conducted to conduct mapping in the pool, two-dimensional and three-dimensional visual forms of the pool floor and pool walls were produced. The results of the estimated measurement of the width of the pool obtained 14.05 meters and the height of the pool 3.199 meters with an SSS beam angle of 58 degrees.
Conventional boost converters operating with hard-switching result in low conversion efficiency and increased electromagnetic interference emissions. In this paper, a cost-efficient passive snubber is proposed with a ...
详细信息
ISBN:
(数字)9784885523472
ISBN:
(纸本)9798350349498
Conventional boost converters operating with hard-switching result in low conversion efficiency and increased electromagnetic interference emissions. In this paper, a cost-efficient passive snubber is proposed with a few additional components: two diodes, one capacitor, and one inductor. Moreover, because these snubber components are not located on the main power processing path, they only require low ratings, resulting in improved cost-effectiveness.
Recent empirical work has shown that human children are adept at learning and reasoning with probabilities. Here, we model a recent experiment investigating the development of school-age children's non-symbolic pr...
详细信息
The exact 3D dynamics of the human body provides crucial evidence to analyze the consequences of the physical interaction between the body and the environment, which can eventually assist everyday activities in a wide...
The exact 3D dynamics of the human body provides crucial evidence to analyze the consequences of the physical interaction between the body and the environment, which can eventually assist everyday activities in a wide range of applications. However, optimizing for 3D configurations from image observation requires a significant amount of computation, whereas real-world 3D measurements often suffer from noisy observation or complex occlusion. We resolve the challenge by learning a latent distribution representing strong temporal priors. We use a conditional variational autoencoder (CVAE) architecture with a transformer to train the motion priors with a large-scale motion dataset. Then our feature follower effectively aligns the feature spaces of noisy, partial observation with the necessary input for pre-trained motion priors, and quickly recovers a complete mesh sequence of motion. We demonstrate that the transformer-based autoencoder can collect necessary spatio-temporal correlations robust to various adversaries, such as missing temporal frames, or noisy observation under severe occlusion. Our framework is general and can be applied to recover the full 3D dynamics of other subjects with parametric representations.
Distinct selectivity to the spin angular momenta of photons has garnered significant attention in recent years, for its relevance in basic science and for imaging and sensing applications. While nonlocal metasurfaces ...
详细信息
Distinct selectivity to the spin angular momenta of photons has garnered significant attention in recent years, for its relevance in basic science and for imaging and sensing applications. While nonlocal metasurfaces with strong chiral responses to the incident light have been reported, these responses are typically limited to a narrow range of incident angles. In this study, we demonstrate a nonlocal metasurface that showcases strong chirality, characterized by circular dichroism (∼0.6), over a wide range of incident angles ±5°. Its quality factor, circular dichroism and resonant frequency can be optimized by design. These findings pave the way to further advance the development of valley-selective optical cavities and augmented reality applications.
Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label during training but are irrelevant to the learning problem. This hurts model generalization and poses problems when dep...
详细信息
暂无评论