检索结果-内蒙古大学图书馆

arXiv 2024年

作者： Wu, Jialong Wang, Zhenglin Zhang, Linhai Lai, Yilong He, Yulan Zhou, Deyu School of Computer Science and Engineering Key Laboratory of Computer Network and Information Integration Ministry of Education Southeast University China Department of Informatics King’s College London United Kingdom The Alan Turing Institute United Kingdom

Key-Value (KV) cache has become a bottleneck of LLMs for long-context generation. Despite the numerous efforts in this area, the optimization for the decoding phase is generally ignored. However, we believe such optimization is crucial, especially for long-output generation tasks based on the following two observations: (i) Excessive compression during the prefill phase which requires specific full context, impairs the comprehension of the reasoning task;(ii) Deviation of heavy hitters1 occurs in the reasoning tasks with long outputs. Therefore, SCOPE, a simple yet efficient framework that separately performs KV cache optimization during the prefill and decoding phases, is introduced. Specifically, the KV cache during the prefill phase is preserved to maintain the essential information, while a novel strategy based on sliding is proposed to select essential heavy hitters for the decoding phase. Memory usage and memory transfer are further optimized using adaptive and discontinuous strategies. Extensive experiments on LONGGENBENCH show the effectiveness and generalization of SCOPE and its compatibility as a plug-in to other prefill-only KV compression methods. 2 Copyright © 2024, The Authors. All rights reserved.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

In-Forest: Distributed In-Network Classification with Ensemble Models

In-Forest: Distributed In-Network Classification with Ensemb...

引用

International Conference on Network Protocols

作者： Jiaye Lin Qing Li Guorui Xie Yong Jiang Zhenhui Yuan Changlin Jiang Yuan Yang International Graduate School Tsinghua University Shenzhen China Peng Cheng Laboratory Shenzhen China Department of Computer and Information Science Northumbria University Newcastle United Kingdom Department of Computer Science and Technology Tsinghua University Beijing China

A variety of model representation methods have been used in recent works to translate machine learning models into programmable switch rules to address network classification tasks at line-speed, i.e., in-network classification. These works generally deploy a complete but heavy model on a switch with limited hardware resources, causing both network-wide waste of resources and unsatisfactory accuracy. Therefore, we propose In-Forest, a general distributed in-network classification framework. Firstly, to improve accuracy with limited resources, we develop a Lightweight Ensemble Generic Optional Model (LEGO), which can be further enhanced into multiple enhanced base models with full functionality. Each switch only needs to deploy a simple base model, rather than the complete ensemble model. Thus, hardware resources required for both switches and the entire network can be significantly reduced. Secondly, as traffic traverses multiple switches, In-Forest aggregates the classification results from different enhanced base models for higher accuracy. Furthermore, we design a two-phase resource-aware model allocation strategy that assigns enhanced base models to switches under different scenarios. We use stable deep reinforcement learning to respond to dynamic traffic changes. Experimental results show that when compared to SwitchTree, Planter, and Netbeacon in two real network topologies, In-Forest can increase accuracy by up to 19.31%, while reducing the number of switch rules by 89.98%.

关键词：

来源：评论

学校读者我要写书评

暂无评论

基于语音识别的电磁调控智能超表面

引用

Engineering 2023年第3期22卷 185-190页

作者：柏林刘元可徐亮张政王强蒋卫祥仇成伟崔铁军 State Key Laboratory of Millimeter Waves School of Information Science and EngineeringSoutheast UniversityNanjing 210096China Purple Mountain Laboratories Nanjing 211111China Department of Electrical and Computer Engineering National University of SingaporeSingapore 117583Singapore

本文提出并实现了一种基于人类语音识别的智能超表面,用于对电磁波束进行可编程调控。该智能超表面平台由数字编码超表面、语音识别模块、单片机和数模转换器(DAC)电路组成,可根据预先存储的语音指令对电磁波进行智能控制。所构建的数... 详细信息

本文提出并实现了一种基于人类语音识别的智能超表面,用于对电磁波束进行可编程调控。该智能超表面平台由数字编码超表面、语音识别模块、单片机和数模转换器(DAC)电路组成,可根据预先存储的语音指令对电磁波进行智能控制。所构建的数字编码超表面包含6×6个超级子单元,每个超级子单元由4×4个嵌入了变容二极管的有源数字单元组成。语音识别模块配合DAC和单片机对语音指令进行识别,并生成对应的电压序列来控制超表面。此外,在超表面的设计过程中引入遗传算法,可有效优化超表面相位分布。为了验证智能超表面平台的性能,演示了雷达散射截面积缩减、涡旋波束生成和波束分裂三种典型功能。所提出的方案为调控电磁波提供了一种新的途径,并在电磁和声学通信之间架起了连接桥梁。

关键词： Speech recognition Programmable metasurface Genetic algorithm Smart electromagnetic manipulation

来源：评论

学校读者我要写书评

暂无评论

Enhanced CT Image Generation by GAN for Improving Thyroid Anatomy Detection

Enhanced CT Image Generation by GAN for Improving Thyroid An...

引用

2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022

作者： Shi, Jianyu Liu, Xiaohong Yang, Guoxing Wang, Guangyu Beijing University of Posts and Telecommunications State Key Laboratory of Networking and Switching Technology Beijing100876 China Tsinghua University Department of Computer Science and Technology Beijing100084 China Peng Cheng Laboratory Department of Mathematics and Theories Shenzhen518055 China

ISBN: (纸本)9781665468190

Computed tomography (CT) is one of the most imaging methods widely used to locate lesions such as nodules, tumors, and cysts, and make primary diagnosis. For clearer imaging of anatomical or lesions, contrast-enhanced CT (CECT) scans are imaging with injecting a contrast agent into a patient during examination. But there are limits to iodine contrast injections so that CECT scans are not convenient like non-contrast enhanced CT (NECT). Recently, deep learning models bring impressive results in computer vision, including image translation. So, we would like to apply image translation methods to generate CECT images from the more accessible NECT images, and evaluate the effects of generated images on image detection tasks. In this study, we propose a method called cross-modal enhancement training strategy for thyroid anatomy detection, which employs CycleGAN to translate non-constrast enhanced CT images to enhanced CT style images with content reserved. The experiments are conducted on thyroid CT images with anatomy object annotation. The experimental results show that by adding translated images into the training dataset, the performance of thyroid anatomy detection can be effectively improved. We achieve the best mAP of 82.5% compared to 73.2% in the along non-contrast enhanced CT training. © 2022 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Single Model Quality Estimation of Protein Structures via Non-negative Tensor Factorization 11th

Single Model Quality Estimation of Protein Structures via ...

引用

11th International Conference on Computational Advances in Bio and Medical sciences, ICCABS 2021

作者： Kabir, Kazi Lutful Bhattarai, Manish Alexandrov, Boian S. Shehu, Amarda Department of Computer Science George Mason University FairfaxVA22030 United States Theoretical Division Los Alamos National Laboratory Los AlamosNM87545 United States

ISBN: (纸本)9783031175305

Finding the inherent organization in the structure space of a protein molecule is central in many computational studies of proteins. Grouping or clustering tertiary structures of a protein has been leveraged to build representations of the structure-energy landscape, highlight stable and semi-stable structural states, support models of structural dynamics, and connect them to biological function. Over the years, our laboratory has introduced methods to reveal structural states and build models of state-to-state protein dynamics. These methods have also been shown competitive for an orthogonal problem known as model selection, where model refers to a computed tertiary structure. Building on this work, in this paper we present a novel, tensor factorization-based method that doubles as a non-parametric clustering method. While the method has broad applicability, here we focus and demonstrate its efficacy on the estimation of model accuracy (EMA) problem. The method outperforms state-of-the-art methods, including single-model methods that leverage deep neural networks and domain-specific insight. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Proteins

来源：评论

学校读者我要写书评

暂无评论

The Selectivity and Competition of the Mind’s Eye in Visual Perception

The Selectivity and Competition of the Mind’s Eye in Visual...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Edward Kim Maryam Daniali Jocelyn Rego Garrett T. Kenyon Department of Computer Science Drexel University PA Department of Biomedical and Health Informatics (DBHi) Children’s Hospital of Philadelphia PA Los Alamos National Laboratory NM

Research has shown that neurons within the brain are selective to certain stimuli. For example, the fusiform face area (FFA) region is known by neuroscientists to selectively activate when people see faces over non-face objects. While the exact mechanisms by which the primary visual system directs information to the correct higher levels of the brain are currently unknown, there are high-level neural mechanisms of perception that we can incorporate in a novel computational model - ones that utilizes lateral and top down feedback in the form of hierarchical competition. We demonstrate that these neural mechanisms provide the foundation of a novel classification framework that rivals traditional supervised learning in computer vision. Additionally, we show that the innate priors built into our architecture support out of distribution generalization on the application of face detection.

关键词：

来源：评论

学校读者我要写书评

暂无评论

RUE: A caching method for identifying and managing hot data by leveraging resource utilization efficiency

RUE: A caching method for identifying and managing hot data ...

引用

作者： Ai, Liang Deng, Yuhui Zhou, Yi Feng, Hao Department of Computer Science Jinan University Guangzhou China Wuhan National Laboratory for Optoelectronics Wuhan China The TSYS School of Computer Science Columbus State University ColumbusGA United States

In this study, we propose a caching method called RUE for dynamic large-scale data streams. We define a data model to facilitate hot data identification and management. At the heart of RUE model is hot degree that takes into account two factors data resource utilization efficiency and reuse distance, aiming to quantitatively reflect data popularity in a dynamic data stream. Based on data's hot degree, RUE classifies data into four types, each of which is assigned with an associated cache residence time. Guided by RUE model, we develop HM algorithm to identify and manage hot data in a dynamic data stream. HM algorithm is implemented by four stacks, namely, new stack, short stack, long stack, and temp stack. Moreover, an eviction and a migration algorithms are integrated into HM to facilitate block replacement and migration. To evaluate the performance of HM algorithm, we quantitatively compare the performance of RUE with three state-of-art algorithms, namely, LRU, LIRS, and ARC under various replacement policies, operations, and workloads. Experimental results show that RUE outperforms these three existing algorithms in terms of both read and write hit rates. Furthermore, we show that with the four stacks in place, the computing overhead of HM is negligible. © 2021 John Wiley & Sons Ltd.

关键词： Efficiency

来源：评论

学校读者我要写书评

暂无评论

Certifying robust graph classification under orthogonal gromov-wasserstein threats 22

Certifying robust graph classification under orthogonal grom...

引用

Proceedings of the 36th International Conference on Neural Information Processing Systems

作者： Hongwei Jin Zishun Yu Xinhua Zhang Mathematics and Computer Science Division Argonne National Laboratory Lemont IL Department of Computer Science University of Illinois Chicago Chicago IL

ISBN: (纸本)9781713871088

Graph classifiers are vulnerable to topological attacks. Although certificates of robustness have been recently developed, their threat model only counts local and global edge perturbations, which effectively ignores important graph structures such as isomorphism. To address this issue, we propose measuring the perturbation with the orthogonal Gromov-Wasserstein discrepancy, and building its Fenchel biconjugate to facilitate convex optimization. Our key insight is drawn from the matching loss whose root connects two variables via a monotone operator, and it yields a tight outer convex approximation for resistance distance on graph nodes. When applied to graph classification by graph convolutional networks, both our certificate and attack algorithm are demonstrated effective.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

arXiv

引用

arXiv 2022年

作者： Damer, Naser Fontanillo López, César Augusto Fang, Meiling Spiller, Noemie Pham, Minh Vu Boutros, Fadi Fraunhofer Institute for Computer Graphics Research IGD Darmstadt Germany Department of Computer Science TU Darmstadt Darmstadt Germany Centre for IT & IP Law KU Leuven Leuven Belgium

The main question this work aims at answering is: "can morphing attack detection (MAD) solutions be successfully developed based on synthetic data?". Towards that, this work introduces the first synthetic-based MAD development dataset, namely the Synthetic Morphing Attack Detection Development dataset (SMDD). This dataset is utilized successfully to train three MAD backbones where it proved to lead to high MAD performance, even on completely unknown attack types. Additionally, an essential aspect of this work is the detailed legal analyses of the challenges of using and sharing real biometric data, rendering our proposed SMDD dataset extremely essential. The SMDD dataset, consisting of 30,000 attack and 50,000 bona fide samples, is publicly available for research purposes. Copyright © 2022, The Authors. All rights reserved.

关键词： Biometrics

来源：评论

学校读者我要写书评

暂无评论

Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Modeling

Back to the Future: Bidirectional Information Decoupling Net...

引用

2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022

作者： Li, Yiyang Zhao, Hai Zhang, Zhuosheng Department of Computer Science and Engineering Shanghai Jiao Tong University China Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Shanghai Jiao Tong University China

Multi-turn dialogue modeling as a challenging branch of natural language understanding (NLU), aims to build representations for machines to understand human dialogues, which provides a solid foundation for multiple downstream tasks. Recent studies of dialogue modeling commonly employ pre-trained language models (PrLMs) to encode the dialogue history as successive tokens, which is insufficient in capturing the temporal characteristics of dialogues. Therefore, we propose Bidirectional Information Decoupling Network (BiDeN) as a universal dialogue encoder, which explicitly incorporates both the past and future contexts and can be generalized to a wide range of dialogue-related tasks. Experimental results on datasets of different downstream tasks demonstrate the universality and effectiveness of our BiDeN. The official implementation of BiDeN is available at https://***/EricLee8/BiDeN. © 2022 Association for Computational Linguistics.

关键词： Modeling languages

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：