咨询与建议

限定检索结果

文献类型

  • 11 篇 会议
  • 10 篇 期刊文献

馆藏范围

  • 21 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 18 篇 工学
    • 17 篇 计算机科学与技术...
    • 13 篇 电气工程
    • 4 篇 软件工程
    • 2 篇 电子科学与技术(可...
    • 1 篇 控制科学与工程
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 21 篇 algorithm-archit...
  • 5 篇 transformers
  • 5 篇 domain-specific ...
  • 4 篇 dynamic quantiza...
  • 2 篇 computational ef...
  • 2 篇 natural language...
  • 2 篇 neural networks
  • 2 篇 fully homomorphi...
  • 2 篇 computational mo...
  • 2 篇 point cloud
  • 2 篇 feature extracti...
  • 2 篇 hardware
  • 2 篇 graphics process...
  • 2 篇 neural network a...
  • 2 篇 attention
  • 2 篇 domain-specific ...
  • 1 篇 gcd
  • 1 篇 dft
  • 1 篇 head
  • 1 篇 transformer

机构

  • 4 篇 shanghai jiao to...
  • 3 篇 shanghai jiao to...
  • 2 篇 shanghai qi zhi ...
  • 2 篇 univ hasselt eng...
  • 1 篇 shanghai jiao to...
  • 1 篇 csg digital grid...
  • 1 篇 chinese acad sci...
  • 1 篇 northeastern uni...
  • 1 篇 seoul natl univ
  • 1 篇 eth zurich eth z...
  • 1 篇 institute of com...
  • 1 篇 mit eecs cambrid...
  • 1 篇 stochastic inc a...
  • 1 篇 arizona state un...
  • 1 篇 arizona state un...
  • 1 篇 zhejiang univ in...
  • 1 篇 huixi technol pe...
  • 1 篇 univ maryland de...
  • 1 篇 mit cambridge ma...
  • 1 篇 uc santa barbara...

作者

  • 3 篇 lyu dongxu
  • 3 篇 li zhenyu
  • 3 篇 chen yuzhou
  • 3 篇 he guanghui
  • 2 篇 xu ningyi
  • 2 篇 jiang li
  • 2 篇 he zhezhi
  • 2 篇 han song
  • 2 篇 zhao yilong
  • 2 篇 liu zhili
  • 2 篇 liu fangxin
  • 2 篇 zhang zhekai
  • 2 篇 xiong dongliang
  • 2 篇 jiang xiaowen
  • 2 篇 chen junjian
  • 2 篇 huang kai
  • 2 篇 wang hanrui
  • 2 篇 he weifeng
  • 2 篇 yang tao
  • 2 篇 claesen luc

语言

  • 20 篇 英文
  • 1 篇 其他
检索条件"主题词=algorithm-architecture co-design"
21 条 记 录,以下是1-10 订阅
排序:
DPACS: Hardware Accelerated Dynamic Neural Network Pruning through algorithm-architecture co-design  2023
DPACS: Hardware Accelerated Dynamic Neural Network Pruning t...
收藏 引用
28th ACM International conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
作者: Gao, Yizhao Zhang, Baoheng Qi, Xiaojuan So, Hayden Kwok-Hay Univ Hong Kong Hong Kong Peoples R China
By eliminating compute operations intelligently based on the run time input, dynamic pruning (DP) promises to improve deep neural network inference speed substantially without incurring a major impact on their accurac... 详细信息
来源: 评论
Dadu-corki: algorithm-architecture co-design for Embodied AI-powered Robotic Manipulation  25
Dadu-Corki: Algorithm-Architecture Co-Design for Embodied AI...
收藏 引用
Proceedings of the 52nd Annual International Symposium on computer architecture
作者: Yiyang Huang Yuhui Hao Bo Yu Feng Yan Yuxin Yang Feng Min Yinhe Han Lin Ma Shaoshan Liu Qiang Liu Yiming Gan Institute of Computing Technology Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing China Shenzhen Institute of Artificial Intelligence and Robotics for Society Shenzhen China Meituan Beijing China Institute of Computing Technology Chinese Academy of Sciences Beijing China Tianjin University Tianjin China
来源: 评论
communication algorithm-architecture co-design for Distributed Deep Learning  21
Communication Algorithm-Architecture Co-Design for Distribut...
收藏 引用
ACM/IEEE 48th Annual International Symposium on computer architecture (ISCA)
作者: Huang, Jiayi Majumder, Pritam Kim, Sungkeun Muzahid, Abdullah Yum, Ki Hwan Kim, Eun Jung UC Santa Barbara Santa Barbara CA 93106 USA Texas A&M Univ College Stn TX USA
Large-scale distributed deep learning training has enabled developments of more complex deep neural network models to learn from larger datasets for sophisticated tasks. In particular, distributed stochastic gradient ... 详细信息
来源: 评论
AToM: Adaptive Token Merging for Efficient Acceleration of Vision Transformer
收藏 引用
IEEE TRANSACTIONS ON coMPUTERS 2025年 第5期74卷 1620-1633页
作者: Shin, Jaekang Kang, Myeonggu Han, Yunki Park, Junyoung Kim, Lee-Sup Korea Adv Inst Sci & Technol KAIST Sch Elect Engn Daejeon 34141 South Korea
Recently, Vision Transformers (ViTs) have set anew standard in computer vision (CV), showing unparalleledimage processing performance. However, their substantial com-putational requirements hinder practical deployment... 详细信息
来源: 评论
FLNA: Flexibly Accelerating Feature Learning Networks for Large-Scale Point Clouds With Efficient Dataflow Decoupling
收藏 引用
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 2024年 第4期32卷 739-751页
作者: Lyu, Dongxu Li, Zhenyu Chen, Yuzhou Wang, Gang He, Weifeng Xu, Ningyi He, Guanghui Shanghai Jiao Tong Univ Sch Elect Informat & Elect Engn Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ Qing Yuan Res Inst Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ Sch Elect Informat & Elect Engn Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai 200240 Peoples R China
Point cloud-based 3-D perception is poised to become a key workload on various applications. It always leverages a feature learning network (FLN) before backbones to obtain uniform representation from the scattered po... 详细信息
来源: 评论
An Efficient Multi-View Cross-Attention Accelerator for Vision-Centric 3D Perception in Autonomous Driving
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS 2025年
作者: Lyu, Dongxu Li, Zhenyu Xu, Yansong Wang, Gang Li, Wenjie Chen, Yuzhou Chen, Liyan He, Weifeng He, Guanghui Shanghai Jiao Tong Univ Sch Elect Informat & Elect Engn State Key Lab Micro Nano Engn Sci Shanghai 200240 Peoples R China
Vision-centric 3D perception has become a key mechanism in autonomous driving. It achieves exceptional perceptual performance mainly by introducing a novel attention, multi-view cross-attention (MVCA), for learnable f... 详细信息
来源: 评论
DTATrans: Leveraging Dynamic Token-Based Quantization With Accuracy compensation Mechanism for Efficient Transformer architecture
收藏 引用
IEEE TRANSACTIONS ON coMPUTER-AIDED design OF INTEGRATED CIRCUITS AND SYSTEMS 2023年 第2期42卷 509-520页
作者: Yang, Tao Ma, Fei Li, Xiaoling Liu, Fangxin Zhao, Yilong He, Zhezhi Jiang, Li Shanghai Jiao Tong Univ Sch Elect Informat & Elect Engn Shanghai 201100 Peoples R China Inceptio Technol Inst Shanghai 200093 Peoples R China Shanghai Qi Zhi Inst Shanghai 200232 Peoples R China Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai 200240 Peoples R China
Models based on the attention mechanism, i.e., transformers, have shown extraordinary performance in natural language processing (NLP) tasks. However, their memory foot-print, inference latency, and power consumption ... 详细信息
来源: 评论
Structured Dynamic Precision for Deep Neural Networks Quantization
收藏 引用
ACM TRANSACTIONS ON design AUTOMATION OF ELECTRONIC SYSTEMS 2023年 第1期28卷 1-24页
作者: Huang, Kai Li, Bowen Xiong, Dongliang Jiang, Haitian Jiang, Xiaowen Yan, Xiaolang Claesen, Luc Liu, Dehong Chen, Junjian Liu, Zhili Zhejiang Univ Inst VLSI Design 38 Zheda Rd Hangzhou 310030 Zhejiang Peoples R China Univ Hasselt Engn Technol Elect ICT Dept B-3590 Diepenbeek Belgium China Southern Power Grid Co Ltd Guangzhou 510670 Peoples R China Sec Chip Technol Co Ltd Hangzhou 310030 Peoples R China
Deep Neural Networks (DNNs) have achieved remarkable success in various Artificial Intelligence applications. Quantization is a critical step in DNNs compression and acceleration for deployment. To further boost DNN e... 详细信息
来源: 评论
Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator  30
Lightening-Transformer: A Dynamically-operated Optically-int...
收藏 引用
30th IEEE International Symposium on High-Performance computer architecture (HPCA)
作者: Zhu, Hanqing Gu, Jiaqi Wang, Hanrui Jiang, Zixuan Zhang, Zhekai Tang, Rongxing Feng, Chenghao Han, Song Chen, Ray T. Pan, David Z. Univ Texas Austin Austin TX 78712 USA MIT Cambridge MA 02139 USA Arizona State Univ Tempe AZ 85287 USA
The wide adoption and significant computing resource cost of attention-based transformers, e.g., Vision Transformers and large language models, have driven the demand for efficient hardware accelerators. While electro... 详细信息
来源: 评论
Triangle counting Accelerations: From algorithm to In-Memory computing architecture
收藏 引用
IEEE TRANSACTIONS ON coMPUTERS 2022年 第10期71卷 2462-2472页
作者: Wang, Xueyan Yang, Jianlei Zhao, Yinglin Jia, Xiaotao Yin, Rong Chen, Xuhang Qu, Gang Zhao, Weisheng Beihang Univ Sch Integrated Circuit Sci & Engn MIIT Key Lab Spintron Beijing 100191 Peoples R China Beihang Univ Sch Comp Sci & Engn State Key Lab Software Dev Environm NLSDE BDBC Beijing 100191 Peoples R China Chinese Acad Sci Inst Informat Engn Beijing 100049 Peoples R China Univ Maryland Dept Elect & Comp Engn College Pk MD 20742 USA Univ Maryland Inst Syst Res College Pk MD 20742 USA
Triangles are the basic substructure of networks and triangle counting (TC) has been a fundamental graph computing problem in numerous fields such as social network analysis. Nevertheless, like other graph computing p... 详细信息
来源: 评论