The key challenge of multi-view indoor 3D object detection is to infer accurate geometry information from images for precise 3D detection. Previous method relies on NeRF for geometry reasoning. However, the geometry e...
ISBN:
(纸本)9798331314385
The key challenge of multi-view indoor 3D object detection is to infer accurate geometry information from images for precise 3D detection. Previous method relies on NeRF for geometry reasoning. However, the geometry extracted from NeRF is generally inaccurate, which leads to sub-optimal detection performance. In this paper, we propose MVSDet which utilizes plane sweep for geometry-aware 3D object detection. To circumvent the requirement for a large number of depth planes for accurate depth prediction, we design a probabilistic sampling and soft weighting mechanism to decide the placement of pixel features on the 3D volume. We select multiple locations that score top in the probability volume for each pixel and use their probability score to indicate the confidence. We further apply recent pixel-aligned Gaussian Splatting to regularize depth prediction and improve detection performance with little computation overhead. Extensive experiments on ScanNet and ARKitScenes datasets are conducted to show the superiority of our model. Our code is available at https://***/Pixie8888/MVSDet.
Image decomposition offers deep insights into the imaging factors of visual data and significantly enhances various advanced computer vision tasks. In this work, we introduce a novel approach to low-light image enhanc...
详细信息
Weakly-supervised medical image segmentation is gaining traction as it requires only rough annotations rather than accurate pixel-to-pixel labels, thereby reducing the workload for specialists. Although some progress ...
详细信息
Early diagnosis of osteonecrosis of the femoral head (ONFH) can inhibit the progression and improve femoral head preservation. The radiograph difference between early ONFH and healthy ones is not apparent to the naked...
详细信息
Contrastive learning is a new self-supervised representation learning technique, which is considered to have great potential to improve the performance of downstream learning tasks. Recently, some researchers have con...
详细信息
A Clifford circuit is a pivotal tool in quantum computing and has extensive applications in quantum error correction codes and topological quantum computing. Hence, it is essential to benchmark and verify the effect o...
详细信息
A Clifford circuit is a pivotal tool in quantum computing and has extensive applications in quantum error correction codes and topological quantum computing. Hence, it is essential to benchmark and verify the effect of Clifford circuits against noise and errors. Standard quantum process tomography is a fundamental technique for fully characterizing quantum dynamics, but at the cost of exponential time, space, and computation with an increasing number of qubits. Here, we propose an efficient quantum process tomography method for Clifford circuits. Combining with the stabilizer formalism, we prove theoretically that, for an n-qubit Clifford circuit, our method merely needs m ancillary qubits and ⌈n/m⌉ input stabilizer states to obtain the quantum process. Numerical simulation results show that our method could perfectly rebuild the unknown Clifford quantum circuit with fidelity over 99.99% to six qubit cases. Our work provides an efficient and practical approach to benchmark and verify Clifford circuits.
In this paper,we firstly construct several new kinds of Sidon spaces and Sidon sets by investigating some known ***,using these Sidon spaces,we will present a construction of cyclic subspace codes with cardinality τ,...
详细信息
In this paper,we firstly construct several new kinds of Sidon spaces and Sidon sets by investigating some known ***,using these Sidon spaces,we will present a construction of cyclic subspace codes with cardinality τ,q^(n)-1/q-1 and minimum distance 2k-2,whereτis a positive *** further-more give some cyclic subspace codes with size 2τ·q^(n)-1/q-1 and without changing the minimum distance 2k-2.
The Peaceman-Rachford splitting method is efficient for minimizing a convex optimization problem with a separable objective function and linear ***,its convergence was not guaranteed without extra *** et al.(SIAM ***....
详细信息
The Peaceman-Rachford splitting method is efficient for minimizing a convex optimization problem with a separable objective function and linear ***,its convergence was not guaranteed without extra *** et al.(SIAM ***.24:1011-1040,2014)proved the convergence of a strictly contractive Peaceman-Rachford splitting method by employing a suitable underdetermined relaxation *** this paper,we further extend the so-called strictly contractive Peaceman-Rachford splitting method by using two different relaxation ***,motivated by the recent advances on the ADMM type method with indefinite proximal terms,we employ the indefinite proximal term in the strictly contractive Peaceman-Rachford splitting *** show that the proposed indefinite-proximal strictly contractive Peaceman-Rachford splitting method is convergent and also prove the o(1/t)convergence rate in the nonergodic *** numerical tests on the l 1 regularized least square problem demonstrate the efficiency of the proposed method.
Erasure coding is a common redundancy scheme for tolerating failures in storage systems. Compared with replication, erasure coding saves a large amount of storage space, but incurs heavy computation overhead and thus ...
Erasure coding is a common redundancy scheme for tolerating failures in storage systems. Compared with replication, erasure coding saves a large amount of storage space, but incurs heavy computation overhead and thus is more time-consuming. To this end, we design an algorithm to find a better parity coding matrix to reduce the number of XORs in coding based on Vandermonde matrices instead of Cauchy matrices. In addition, we optimize the coding process, to accelerate the computation speed of XOR and obtain a better tradeoff between spatial locality and computation efficiency. For wide stripes which becomes increasingly interesting, we propose to decompose the coding procedure into multiple subprocedures for better utilization of spatial locality. We integrate these methods into coding procedure and implement an erasure coding library, Cerasure. Extensive experiments show that Cerasure significantly improves the coding speed. Compared with the state-of-the-art erasure coding libraries, Zerasure and SLPEC, Cerasure increases the encoding throughput by up to 109.47%.
暂无评论