检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Zhou, Chao Zhang, Yanan Chen, Jiaxin Huang, Di State Key Laboratory of Software Development Environment Beihang University Beijing China School of Computer Science and Engineering Beihang University Beijing China Hangzhou Innovation Institute Beihang University Hangzhou China

A key challenge for LiDAR-based 3D object detection is to capture sufficient features from large scale 3D scenes especially for distant or/and occluded objects. Albeit recent efforts made by Transformers with the long sequence modeling capability, they fail to properly balance the accuracy and efficiency, suffering from inadequate receptive fields or coarse-grained holistic correlations. In this paper, we propose an Octree-based Transformer, named OcTr, to address this issue. It first constructs a dynamic octree on the hierarchical feature pyramid through conducting self-attention on the top level and then recursively propagates to the level below restricted by the octants, which captures rich global context in a coarse-to-fine manner while maintaining the computational complexity under control. Furthermore, for enhanced foreground perception, we propose a hybrid positional embedding, composed of the semantic-aware positional embedding and attention mask, to fully exploit semantic and geometry clues. Extensive experiments are conducted on the Waymo Open Dataset and KITTI Dataset, and OcTr reaches newly state-of-the-art results. Copyright © 2023, The Authors. All rights reserved.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration

DR-Tune: Improving Fine-tuning of Pretrained Visual Models b...

引用

International Conference on computer Vision (ICCV)

作者： Nan Zhou Jiaxin Chen Di Huang State Key Laboratory of Software Development Environment Beihang University Beijing China School of Computer Science and Engineering Beihang University Beijing China Hangzhou Innovation Institute Beihang University Hangzhou China

The visual models pretrained on large-scale benchmarks encode general knowledge and prove effective in building more powerful representations for downstream tasks. Most existing approaches follow the fine-tuning paradigm, either by initializing or regularizing the downstream model based on the pretrained one. The former fails to retain the knowledge in the successive fine-tuning phase, thereby prone to be over-fitting, and the latter imposes strong constraints to the weights or feature maps of the downstream model without considering semantic drift, often incurring insufficient optimization. To deal with these issues, we propose a novel fine-tuning framework, namely distribution regularization with semantic calibration (DR-Tune). It employs distribution regularization by enforcing the downstream task head to decrease its classification error on the pretrained feature distribution, which prevents it from over-fitting while enabling sufficient training of downstream encoders. Furthermore, to alleviate the interference by semantic drift, we develop the semantic calibration (SC) module to align the global shape and class centers of the pretrained and downstream feature distributions. Extensive experiments on widely used image classification datasets show that DR-Tune consistently improves the performance when combing with various backbones under different pretraining strategies. Code is available at: https://***/weeknan/DR-Tune.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Phase Estimation Algorithm for Quantum Speed-Up Multi-Party Computing

引用

computers, Materials & Continua 2021年第4期67卷 241-252页

作者： Wenbin Yu Hao Feng Yinsong Xu Na Yin Yadang Chen Zhi-Xin Yang Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology(CICAEET) Jiangsu Engineering Center of Network MonitoringSchool of Computer and SoftwareNanjing University of Information Science&TechnologyNanjing210044China Department of Computer Science and Engineering Michigan State UniversityEast Lansing48824MIUSA State Key Laboratory of Internet of Things for Smart City and Department of Electromechanical Engineering University of Macao999078Macao

Security and privacy issues have attracted the attention of researchers in the field of IoT as the information processing scale grows in sensor *** computing,theoretically known as an absolutely secure way to store and transmit information as well as a speed-up way to accelerate local or distributed classical algorithms that are hard to solve with polynomial complexity in computation or *** this paper,we focus on the phase estimation method that is crucial to the realization of a general multi-party computing model,which is able to be accelerated by quantum algorithms.A novel multi-party phase estimation algorithm and the related quantum circuit are proposed by using a distributed Oracle operator with *** proved theoretical communication complexity of this algorithm shows it can give the phase estimation before applying multi-party computing efficiently without increasing any additional ***,a practical problem of multi-party dating investigated shows it can make a successful estimation of the number of solution in advance with zero communication complexity by utilizing its special statistic *** simulations present the correctness,validity and efficiency of the proposed estimation method.

关键词： Edge computing security multi-party computing quantum algorithm phase estimation communication complexity

来源：评论

学校读者我要写书评

暂无评论

DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration

arXiv

引用

arXiv 2023年

作者： Zhou, Nan Chen, Jiaxin Huang, Di State Key Laboratory of Software Development Environment Beihang University Beijing China School of Computer Science and Engineering Beihang University Beijing China Hangzhou Innovation Institute Beihang University Hangzhou China

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

FedCPSO: Federated Learning with Combined Particle Swarm Optimization

FedCPSO: Federated Learning with Combined Particle Swarm Opt...

引用

Chinese Automation Congress (CAC)

作者： Hongjian Shi Ruhui Ma Haibing Guan Weishan Zhang School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai China Qingdao Institute of Software College of Computer Science and Technology China University of Petroleum (East China) Qingdao China

The development of the Internet of Things (IoT) has allowed devices to collect massive amounts of data, and Artificial Intelligence (AI) provides the ability to analyze those data. Moreover, researchers adopt Distributed Machine Learning (DML) methods to train neural networks collaboratively using different users' data. However, DML suffers from privacy issues, and Federated Learning (FL) has been an effective solution. FL transfers the model instead of the data to protect privacy, but the trained models have low accuracies over local datasets due to statistical heterogeneity. Thus, personalized FL (pFL) algorithms have been proposed to handle such heterogeneous data distribution. However, the communication overhead in the pFL algorithms is significant as it requires transmitting additional information. Thus, we propose Federated Learning with Com-bined Particle Swarm Optimization (FedCPSO) in this paper. FedCPSO replaces the aggregation process of FL algorithms with PSO, and we design a velocity in PSO specifically for FL algorithms, using the best global model, the best client models, and the best neighbor models. In addition, we also implement magnitude pruning to reduce the communication volume. The experimental results illustrate that FedCPSO can reduce up to 50% communication volume while having less than a 2% accuracy drop compared with the State-of-the-art (SOTA) pFL algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

OcTr: Octree-Based Transformer for 3D Object Detection

OcTr: Octree-Based Transformer for 3D Object Detection

引用

Conference on computer Vision and Pattern Recognition (CVPR)

作者： Chao Zhou Yanan Zhang Jiaxin Chen Di Huang State Key Laboratory of Software Development Environment Beihang University Beijing China School of Computer Science and Engineering Beihang University Beijing China Hangzhou Innovation Institute Beihang University Hangzhou China

关键词：

来源：评论

学校读者我要写书评

暂无评论

Web-based Automatic Deep Learning Service Generation System by Ontology Technologies

Web-based Automatic Deep Learning Service Generation System ...

引用

IEEE International Conference on Computational science and engineering, CSE

作者： Incheon Paik Kungan Zeng Munhan Bae School of Computer Science The University of Aizu Aizu-Wakamatsu City Fukushima Japan School of Software and Engineering Sun Yat-sen University Guangzhou China Dept. of Computer Science Mongolia International University Ulaanbaatar Mongolia

Although deep learning (DL) has obtained great achievements in the industry, the involvement of artificial intelligence (AI) experts in developing customized DL services raises high costs and hinders its wide application in the business domain. In this research, a Web-based automatic DL service generation system is presented to address the problem. The system can generate customized DL services without involving AI experts. The main principle of the system adopts ontology technologies to organize DL domain knowledge and generate target services based on the user's requests posted from the front-end web page. In the empirical study, the whole scenario of the system is demonstrated, and the scalability is also evaluated. The result shows that our system can generate customized services correctly and has good scalability.

关键词： Deep learning Knowledge engineering Industries Costs Scientific computing Scalability Web pages

来源：评论

学校读者我要写书评

暂无评论

Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

arXiv

引用

arXiv 2023年

作者： Li, Jiahe Zhang, Jiawei Bai, Xiao Zhou, Jun Gu, Lin School of Computer Science and Engineering State Key Laboratory of Software Development Environment Jiangxi Research Institute Beihang University China School of Information and Communication Technology Griffith University Australia RIKEN AIP Japan The University of Tokyo Japan

This paper presents ER-NeRF, a novel conditional Neural Radiance Fields (NeRF) based architecture for talking portrait synthesis that can concurrently achieve fast convergence, real-time rendering, and state-of-the-art performance with small model size. Our idea is to explicitly exploit the unequal contribution of spatial regions to guide talking portrait modeling. Specifically, to improve the accuracy of dynamic head reconstruction, a compact and expressive NeRF-based Tri-Plane Hash Representation is introduced by pruning empty spatial regions with three planar hash encoders. For speech audio, we propose a Region Attention Module to generate region-aware condition feature via an attention mechanism. Different from existing methods that utilize an MLP-based encoder to learn the cross-modal relation implicitly, the attention mechanism builds an explicit connection between audio features and spatial regions to capture the priors of local motions. Moreover, a direct and fast Adaptive Pose Encoding is introduced to optimize the head-torso separation problem by mapping the complex transformation of the head pose into spatial coordinates. Extensive experiments demonstrate that our method renders better high-fidelity and audio-lips synchronized talking portrait videos, with realistic details and high efficiency compared to previous methods. Code is available at https://***/Fictionarry/ ER-NeRF. Copyright © 2023, The Authors. All rights reserved.

关键词： Signal encoding

来源：评论

学校读者我要写书评

暂无评论

PPIRB:Achieving an privacy-preserving image retrieval scheme Based on Blockchain 5

PPIRB:Achieving an privacy-preserving image retrieval scheme...

引用

5th International Conference on Data science and Information Technology, DSIT 2022

作者： Yuan, Bo Peng, Jiahui Li, Chunpei Qiu, Wangjie School of Computer Science and Engineering State Key Laboratory of Software Development Environment Beihang University Beijing100191 China Guangxi Key Lab of Multi-source Information Mining Security Guangxi Normal University GuiLin China State Key Laboratory of Software Development Environment Beihang University Advanced Innovation Center for Future Blockchain and Privacy Computing Beijing100191 China

ISBN: (数字)9781665498685

ISBN: (纸本)9781665498685

With the proliferation of cloud services, the work of large-scale image retrieval is carried out on remote cloud become a trend in order to get rid of the storage burden and computation. However, traditional retrieval work based on centralized cloud services is considered to take risk in giving away users' privacy in many researches. We need a secure way for retrieving and storing images by decentralized approach. In this paper, we propose a decentralized scheme named PPIRB(An Privacy-Preserving Image Retrieval Scheme Base on Blockchain) for retrieving images based on blockchain. We used the data structure called amplifying locality sensitive hashing (ALSH) to improve search accuracy. Searchable symmetric en-cryption(SSE) and ALSH used to realize the encryption retrieval of image index on the blockchain. Our scheme also uses IPFS to replace the cloud server to realize distributed storage, and uses secret sharing scheme to realize the access control. We implemented and evaluated the scheme in Ethereum, and the results show that our scheme is efficient and accurate in finding similar images while preserving privacy. The overall overhead is reasonable. © 2022 IEEE.

关键词： Blockchain

来源：评论

学校读者我要写书评

暂无评论

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models

Instance-aware Dynamic Prompt Tuning for Pre-trained Point C...

引用

International Conference on computer Vision (ICCV)

作者： Yaohua Zha Jinpeng Wang Tao Dai Bin Chen Zhi Wang Shu-Tao Xia Tsinghua Shenzhen International Graduate School Tsinghua University College of Computer Science and Software Engineering Shenzhen University Harbin Institute of Technology Shenzhen Research Center of Artificial Intelligence Peng Cheng Laboratory

Pre-trained point cloud models have found extensive applications in 3D understanding tasks like object classification and part segmentation. However, the prevailing strategy of full fine-tuning in downstream tasks leads to large per-task storage overhead for model parameters, which limits the efficiency when applying large-scale pre-trained models. Inspired by the recent success of visual prompt tuning (VPT), this paper attempts to explore prompt tuning on pre-trained point cloud models, to pursue an elegant balance between performance and parameter efficiency. We find while instance-agnostic static prompting, e.g. VPT, shows some efficacy in downstream transfer, it is vulnerable to the distribution diversity caused by various types of noises in real-world point cloud data. To conquer this limitation, we propose a novel Instance-aware Dynamic Prompt Tuning (IDPT) strategy for pre-trained point cloud models. The essence of IDPT is to develop a dynamic prompt generation module to perceive semantic prior features of each point cloud instance and generate adaptive prompt tokens to enhance the model's robustness. Notably, extensive experiments demonstrate that IDPT outperforms full finetuning in most tasks with a mere 7% of the trainable parameters, providing a promising solution to parameter-efficient learning for pre-trained point cloud models. Code is available at https://***/zyh16143998882/ICCV23-IDPT.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：