检索结果-内蒙古大学图书馆

arXiv 2017年

作者： Zhou, Shuchang Wang, Yuzhi Wen, He He, Qinyao Zou, Yuheng University of Chinese Academy of Sciences Beijing100049 China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing100190 China Megvii Inc. Beijing100190 China

Quantized Neural Networks (QNNs), which use low bitwidth numbers for representing parameters and performing computations, have been proposed to reduce the computation complexity, storage size and memory usage. In QNNs, parameters and activations are uniformly quantized, such that the multiplications and additions can be accelerated by bitwise operations. However, distributions of parameters in Neural Networks are often imbalanced, such that the uniform quantization determined from extremal values may under utilize available bitwidth. In this paper, we propose a novel quantization method that can ensure the balance of distributions of quantized values. Our method first recursively partitions the parameters by percentiles into balanced bins, and then applies uniform quantization. We also introduce computationally cheaper approximations of percentiles to reduce the computation overhead introduced. Overall, our method improves the prediction accuracies of QNNs without introducing extra computation during inference, has negligible impact on training speed, and is applicable to both Convolutional Neural Networks and Recurrent Neural Networks. Experiments on standard datasets including ImageNet and Penn Treebank confirm the effectiveness of our method. On ImageNet, the top-5 error rate of our 4-bit quantized GoogLeNet model is 12.7%, which is superior to the state-of-the-arts of QNNs. Copyright © 2017, The Authors. All rights reserved.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis

arXiv

引用

arXiv 2024年

作者： Deng, Kaijun Zheng, Dezhi Xie, Jindong Wang, Jinbao Xie, Weicheng Shen, Linlin Song, Siyang Computer Vision Institute School of Computer Science and Software Engineering Shenzhen University China National Engineering Laboratory for Big Data System Computing Technology Shenzhen University China Guangdong Provincial Key Laboratory of Intelligent Information Processing China Department of Computer Science University of Exeter United Kingdom

Accurately synthesizing talking face videos and capturing fine facial features for individuals with long hair presents a significant challenge. To tackle these challenges in existing methods, we propose a decomposed per-embedding Gaussian fields (DEGSTalk), a 3D Gaussian Splatting (3DGS)-based talking face synthesis method for generating realistic talking faces with long hairs. Our DEGSTalk employs Deformable Pre-Embedding Gaussian Fields, which dynamically adjust pre-embedding Gaussian primitives using implicit expression coefficients. This enables precise capture of dynamic facial regions and subtle expressions. Additionally, we propose a Dynamic Hair-Preserving Portrait Rendering technique to enhance the realism of long hair motions in the synthesized videos. Results show that DEGSTalk achieves improved realism and synthesis quality compared to existing approaches, particularly in handling complex facial dynamics and hair preservation. Our code will be publicly available at https://***/CVI-SZU/DEGSTalk. Copyright © 2024, The Authors. All rights reserved.

关键词： Gaussian distribution

来源：评论

学校读者我要写书评

暂无评论

Random Occlusion Recovery with Noise Channel for Person Re-identification 16th

Random Occlusion Recovery with Noise Channel for Person Re-i...

引用

16th International Conference on Intelligent computing, ICIC 2020

作者： Zhang, Kun Wu, Di Yuan, Changan Qin, Xiao Wu, Hongjie Zhao, Xingming Zhang, Lijun Du, Yuchuan Wang, Hanli Institute of Machine Learning and Systems Biology School of Electronics and Information Engineering Tongji University Shanghai China Guangxi Academy of Science Nanning530025 China School of Computer and Information Engineering Nanning Normal University Nanning530299 China School of Computer Science and Technology Soochow University Suzhou215006 China School of Electronic and Information Engineering Suzhou University of Science and Technology Suzhou215009 China Fudan University Shanghai200433 China Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence Ministry of Education Shanghai China Collaborative Innovation Center of Intelligent New Energy Vehicle and School of Automotive Studies Tongji University Shanghai201804 China The Key Laboratory of Road and Traffic Engineering of the Ministry of Education Department of Transportation Engineering Tongji University Shanghai201804 China Department of Computer Science and Technology the Key Laboratory of Embedded System and Service Computing and Shanghai Institute of Intelligent Science and Technology Tongji University Shanghai200092 China

ISBN: (纸本)9783030607982

Person re-identification, as the basic task of a multi-camera surveillance system, plays an important role in a variety of surveillance applications. However, the current mainstream person re-identification model based on deep learning requires a lot of labeled data, which takes a lot of time and manpower. In this study, we proposed a person re-identification method based on random occlusion recovery with noise channel. We add random occlusion blocks to the original image, use the GAN model for repair, and use the repaired image to expand the original training set. After that, the generated image is adjusted through the noise channel. Finally, we use the enhanced data set to train the baseline model. Our model achieves the state-of-the-art on Market-1501 dataset, proving that the method is effective. © 2020, Springer Nature Switzerland AG.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Epitaxial growth of NaCl on Fe (100) and characterization of Fe/NaCl/Fe magnetic tunnel junctions

Epitaxial growth of NaCl on Fe (100) and characterization of...

引用

IEEE Conference on Nanotechnology

作者： Yuan-Tao Ji Qiang Li Yong-Chao Tang Lin Li Guo-Xing Miao State Key Laboratory for Manufacturing Systems Engineering and Systems Engineering Institute Xi’an Jiaotong University Xi’an China The Key Laboratory of Embedded System and Service Computing Ministry of Education Tongji University Shanghai China Department of Electrical and Computer Engineering New Jersey Institute of Technology Newark NJ USA

ISBN: (纸本)9781479956234

Growth of NaCl and Fe/NaCl/Fe Magnetic tunneling junctions on Si (100) has been achieved by using a high vacuum electron-beam deposition system. Epitaxial tunnel junctions turn out to be prone to pinholes as well as electrode oxidation. Instead, the best tunneling magnetoresistance we have achieved in this system is on polycrystalline tunnel barriers with thin Mg insertion, and reaching 22.3% at room temperature.

关键词： Iron Junctions Magnetic tunneling Epitaxial growth Tunneling magnetoresistance Electrodes Silicon

来源：评论

学校读者我要写书评

暂无评论

New Big Data Collecting Method Based on Compressive Sensing in WSN

New Big Data Collecting Method Based on Compressive Sensing ...

引用

International Conference on computer Communications and Networks (ICCCN)

作者： De-gan Zhang Xiao-hua Liu Yu-ya Cui Hong-tao Peng School of Computer Science &Engineering Tianjin University of Technology Tianjin China Computing & Novel software Technology Tianjin Key Lab of Intelligent Tianjin China Ministry of Education Key Laboratory of Computer Vision and System(TJUT) Tianjin China National Petroleum Corporation (CNPC) Managers Training Institute Beijing China

Considered the wireless sensor network clustering structure, a new big data collecting method based on compressive sensing is proposed. The collection process is as follows: in the cluster, the sink node sets the corresponding seed vector based on the distribution of network, and then sends it to each cluster head. Cluster head can generate corresponding own random spacing sparse matrix based on its received seed vector, and collect data through compressive sensing technology; Among clusters, clusters forward measurement values to sink node along multi-hop routing tree which we built before. Performance analyzing and comparison of results show that this method is superior to other methods regardless of in a cluster or inter-cluster.

关键词： Wireless sensor networks Compressed sensing Sparse matrices Big Data Routing Energy consumption Training

来源：评论

学校读者我要写书评

暂无评论

Frequency Scheduling For Resilient Chip Multi-Processors Operating at Near Threshold Voltage

Frequency Scheduling For Resilient Chip Multi-Processors Ope...

引用

Design, Automation & Test in Europe Conference & Exhibition

作者： Ying Wang Huawei Li Xiaowei Li State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences China

ISBN: (纸本)9781467392280

With the recently proposed redundancy-based core salvaging technology, resilient processors can survive the threat of severe timing violation induced by near-threshold Vdd and function correctly at aggressive clock rates. In our observation, proactively disabling the weakest components that limit the core frequency can still maintain a higher throughput at Near Threshold Voltage (NTV) supply if the cores with defected components are salvaged at a low cost. In this work, a resilience-aware frequency scaling and mapping strategy that considers defected processor states in scheduling is proposed to exploit the fault-tolerant architectures for higher energy efficiency. In our evaluation, it is witnessed that typical resilient multi-core processors can achieve significantly higher performance per watt in experiments compared to conventional scheduling policy.

关键词： NTV multi-core scheduling fault-tolerant

来源：评论

学校读者我要写书评

暂无评论

A highly-efficient and green data flow engine for solving euler atmospheric equations

A highly-efficient and green data flow engine for solving eu...

引用

International Conference on Field Programmable Logic and Applications

作者： Lin Gan Haohuan Fu Chao Yang Wayne Luk Wei Xue Oskar Mencer Xiaomeng Huang Guangwen Yang Department-of Computer-Science and Technology Tsinghua University Tsinghua National Laboratory for Information Science and Technology (TNList) Ministrv of Education Key Laboratory for Earth System Modeling Tsinghua University Institute of Software Chinese Academy of Sciences Department of Computing Imperial College London Mexeler Technologies

Atmospheric modeling is an essential issue in the study of climate change. However, due to the complicated algorithmic and communication models, scientists and researchers are facing tough challenges in finding efficient solutions to solve the atmospheric equations. In this paper, we accelerate a solver for the three-dimensional Euler atmospheric equations through reconfigurable data flow engines. We first propose a hybrid design that achieves efficient resource allocation and data reuse. Furthermore, through algorithmic offsetting, fast memory table, and customizable-precision arithmetic, we map a complex Euler kernel into a single FPGA chip, which can perform 956 floating point operations per cycle. In a 1U-chassis, our CPU-DFE unit with 8 FPGA chips is 18.5 times faster and 8.3 times more power efficient than a multicore system based on two 12-core Intel E5-2697 (Ivy Bridge) CPUs, and is 6.2 times faster and 5.2 times more power efficient than a hybrid unit equipped with two 12-core Intel E5-2697 (Ivy Bridge) CPUs and three Intel Xeon Phi 5120d (MIC) cards.

关键词： Mathematical model Atmospheric modeling Equations Field programmable gate arrays Computational modeling Kernel Three-dimensional displays

来源：评论

学校读者我要写书评

暂无评论

SFDA-rPPG: Source-Free Domain Adaptive Remote Physiological Measurement with Spatio-Temporal Consistency

arXiv

引用

arXiv 2024年

作者： Xie, Yiping Yu, Zitong Wu, Bingjie Xie, Weicheng Shen, Linlin Computer Vision Institute School of Computer Science & Software Engineering Shenzhen Institute of Artificial Intelligence and Robotics for Society Guangdong Key Laboratory of Intelligent Information Processing Shenzhen University Shenzhen518060 China School of Computing and Information Technology Great Bay University Dongguan523000 China National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Shenzhen518060 China Singapore

Remote Photoplethysmography (rPPG) is a non-contact method that uses facial video to predict changes in blood volume, enabling physiological metrics measurement. Traditional rPPG models often struggle with poor generalization capacity in unseen domains. Current solutions to this problem is to improve its generalization in the target domain through Domain Generalization (DG) or Domain Adaptation (DA). However, both traditional methods require access to both source domain data and target domain data, which cannot be implemented in scenarios with limited access to source data, and another issue is the privacy of accessing source domain data. In this paper, we propose the first Source-free Domain Adaptation benchmark for rPPG measurement (SFDA-rPPG), which overcomes these limitations by enabling effective domain adaptation without access to source domain data. Our framework incorporates a Three-Branch Spatio-Temporal Consistency Network (TSTC-Net) to enhance feature consistency across domains. Furthermore, we propose a new rPPG distribution alignment loss based on the Frequency-domain Wasserstein Distance (FWD), which leverages optimal transport to align power spectrum distributions across domains effectively and further enforces the alignment of the three branches. Extensive cross-domain experiments and ablation studies demonstrate the effectiveness of our proposed method in source-free domain adaptation settings. Our findings highlight the significant contribution of the proposed FWD loss for distributional alignment, providing a valuable reference for future research and applications. The source code is available at https://***/XieYiping66/SFDA-rPPG. Copyright © 2024, The Authors. All rights reserved.

关键词： Photoplethysmography

来源：评论

学校读者我要写书评

暂无评论

AIBench: Towards Scalable and Comprehensive Datacenter AI Benchmarking 1st

AIBench: Towards Scalable and Comprehensive Datacenter AI Be...

引用

1st International Symposium on Benchmarking, Measuring, and Optimization, Bench 2018

作者： Gao, Wanling Luo, Chunjie Wang, Lei Xiong, Xingwang Chen, Jianan Hao, Tianshu Jiang, Zihan Fan, Fanda Du, Mengjia Huang, Yunyou Zhang, Fan Wen, Xu Zheng, Chen He, Xiwen Dai, Jiahui Ye, Hainan Cao, Zheng Jia, Zhen Zhan, Kent Tang, Haoning Zheng, Daoyi Xie, Biwei Li, Wei Wang, Xiaoyu Zhan, Jianfeng State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China Dover DE United States Beijing Academy of Frontier Sciences and Technology Beijing China University of Chinese Academy of Sciences Beijing China Alibaba Hangzhou China Princeton University Princeton United States Wuba Zhuxi China Tencent Shenzhen China Baidu Beijing China China RISC-V Alliance Beijing China Cambricon Shenzhen China Intellifusion Shenzhen China

ISBN: (纸本)9783030328122

AI benchmarking provides yardsticks for benchmarking, measuring and evaluating innovative AI algorithms, architecture, and systems. Coordinated by BenchCouncil, this paper presents our joint research and engineering efforts with several academic and industrial partners on the datacenter AI benchmarks—AIBench. The benchmarks are publicly available from http://***/AIBench/***. Presently, AIBench covers 16 problem domains, including image classification, image generation, text-to-text translation, image-to-text, image-to-image, speech-to-text, face embedding, 3D face recognition, object detection, video prediction, image compression, recommendation, 3D object reconstruction, text summarization, spatial transformer, and learning to rank, and two end-to-end application AI benchmarks. Meanwhile, the AI benchmark suites for high performance computing (HPC), IoT, Edge are also released on the BenchCouncil web site. This is by far the most comprehensive AI benchmarking research and engineering effort. © 2019, Springer Nature Switzerland AG.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

F-CNN: An FPGA-based framework for training Convolutional Neural Networks

F-CNN: An FPGA-based framework for training Convolutional Ne...

引用

International Conference on Application Specific systems (ASAP), architectures and Processors

作者： Wenlai Zhao Haohuan Fu Wayne Luk Teng Yu Shaojun Wang Bo Feng Yuchun Ma Guangwen Yang Ministry of Education Key Laboratory for Earth System Modeling and Center for Earth System Science Tsinghua University China Tsinghua National Laboratory for Information Science and Technology China Department of Computing Imperial college London UK Department of Automatic Test and Control Harbin Institute of Technology China Department of Computer Science and Technology Tsinghua University China

ISBN: (纸本)9781509015047

This paper presents a novel reconfigurable framework for training Convolutional Neural Networks (CNNs). The proposed framework is based on reconfiguring a streaming datapath at runtime to cover the training cycle for the various layers in a CNN. The streaming datapath can support various parameterized modules which can be customized to produce implementations with different trade-offs in performance and resource usage. The modules follow the same input and output data layout, simplifying configuration scheduling. For different layers, instances of the modules contain different computation kernels in parallel, which can be customized with different layer configurations and data precision. The associated models on performance, resource and bandwidth can be used in deriving parameters for the datapath to guide the analysis of design trade-offs to meet application requirements or platform constraints. They enable estimation of the implementation specifications given different layer configurations, to maximize performance under the constraints on bandwidth and hardware resources. Experimental results indicate that the proposed module design targeting Maxeler technology can achieve a performance of 62.06 GFLOPS for 32-bit floating-point arithmetic, outperforming existing accelerators. Further evaluation based on training LeNet-5 shows that the proposed framework achieves about 4 times faster than CPU implementation of Caffe and about 7.5 times more energy efficient than the GPU implementation of Caffe.

关键词： Training Field programmable gate arrays Convolution Computational modeling Bandwidth Neural networks Runtime

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：