检索结果-内蒙古大学图书馆

Midwest Symposium on Circuits and Systems (MWSCAS)

作者： Wenxiang Wang Ling Li Guangfei Zhang Dong Liu Ji Qiu Chinese Academy of Sciences Institute of Computing Technology Beijing China

This paper presents an Application Specific Instruction Set Processor (ASIP) pruned for high-throughput and variable-length Fast Fourier Transform (FFT), which is a key component of various Orthogonal Frequency Division Multiplexing (OFDM)-based wireless communication standards. The ASIP executes dedicated FFT instructions to process two radix-4 or four radix-2 butterfly operations every clock cycle. Furthermore, a shuffle-embedded register file and a programmable memory access coprocessor are employed to tackle the memory access bottleneck and reduce power consumption. The implementation results show that our ASIP requires only 892 clock cycles for a 1024-point FFT, which outperforms TI TMS320C64x DSP and Tensilica ConnX ASIP by 6.74X and 2.03X, respectively. A test chip of the proposed ASIP was fabricated using CMOS 65nm process with the core area of 1.9mm 2 . It consumes 85mW when it runs at the maximum frequency of 150MHz.

关键词： Field-flow fractionation OFDM Digital signal processing IEEE 802.16 Standards

来源：评论

学校读者我要写书评

暂无评论

JVM virtual method invoking optimization based on CAM table

JVM virtual method invoking optimization based on CAM table

引用

IEEE International Conference on Networking, Architecture and Storage

作者： Cai, Songsong Yang, Yongqiang Lin, Chuanwen Liu, Qi Key Laboratory of Computer System and Architecture Chinese Academy of Sciences Beijing China Institute of Computing Technology Chinese Academy of Sciences Beijing China Graduate University of Chinese Academy of Sciences Beijing China Dept. of Computer Science and Technology University of Science and Technology of China Hefei China Loongson Technology Corporation Limited Beijing China

ISBN: (纸本)9780769545097

In Java programs, it needs to use the information of the method type to resolve the virtual method dynamically, which restricts the performance greatly. Currently, the solution is mainly the technique of inline caching, which can be divided into two categories: monomorphic inline caching and polymorphic inline caching. Because of the simple implementation of monomorphic inline caching, it is more commonly used, but it cannot resolve the problem of frequently seeing different types of objects at one call site. Although polymorphic inline caching can solve this problem, it is costly. This paper presents the CAM hardware table, proposes a virtual method invoking mechanism based on software and hardware co-design. It solves the problem of frequently seeing different types of objects at one call site, and does not introduce additional overhead. The experimental evaluation shows that it improves the cached hit rate from 13.3% to 76.4% and improves 16.2% performance of the virtual test;it improves the performance of SPECjvm98 by 6.4% on average. © 2011 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Automatic Inspection Method for YOLOv5s Railway Freight Vehicles Based on Channel Pruning

Automatic Inspection Method for YOLOv5s Railway Freight Vehi...

引用

Intelligent Communication and Networking (ICN), International Conference on

作者： Shisheng Wang Chengxin Du Yukun Meng Fan Gao Yujing Cai Institute of Computing Technology Academy of Railway Sciences Beijing China

Railway freight safety inspection is an important component of railway transportation safety production. Traditional manual outdoor inspection methods are associated with issues such as high labor intensity, low efficiency, and the potential for oversight. To address these problems, this paper proposes a real-time automated railway freight vehicle inspection method based on channel pruning, aiming to improve the detection efficiency of freight trains and alleviate the pressure of manual inspection. First, a YOLOv5s model is constructed, consisting of functional modules such as Focus, BottleneckCSP, and SPP. Subsequently, the model is compressed using a channel pruning technique, reducing its size and enabling deployment on resource-constrained devices like small-scale machines. Finally, the model is adjusted to achieve quick and precise rail freight truck detection. The experimental findings indicate that the pruned model reduces the number of model parameters by 91.9%, decreases the model size by 11.23 MB, and achieves a mAP of 98.14%, which is only 0.183% less than the model without pruning. To demonstrate that the suggested approach is preferable, a comparison is conducted with the YOLOv5x, YOLOv5n, and YOLOv5m algorithms. The comparison results demonstrate that the proposed method has significantly faster forward inference times than YOLOv5x, YOLOv5m, and YOLOv5n, with reductions of 95.6 ms, 22 ms, and 3.4 ms, respectively. The model size is also smaller than YOLOv5x, YOLOv5m, and YOLOv5n by 168.43 MB, 39.15 MB, and 2.34 MB, respectively. Moreover, the mAP is only 0.15% and 0.07% lower than YOLOv5x and YOLOv5m, respectively. These findings show that the suggested method, which can be implemented on small devices, achieves automated inspection of railway freight cars while taking detection speed and model size into account.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Claim Decomposition Benchmark for Long-form Answer Verification

arXiv

引用

arXiv 2024年

作者： Zhang, Zhihao Fan, Yixing Zhang, Ruqing Guo, Jiafeng Institute of Computing Technology Chinese Academy of Sciences Beijing China

The advancement of large language models (LLMs) has significantly boosted the performance of complex long-form question answering tasks. However, one prominent issue of LLMs is the generated "hallucination" responses that are not factual. Consequently, attribution for each claim in responses becomes a common solution to improve the factuality and verifiability. Existing researches mainly focus on how to provide accurate citations for the response, which largely overlook the importance of identifying the claims or statements for each response. To bridge this gap, we introduce a new claim decomposition benchmark, which requires building system that can identify atomic and checkworthy claims for LLM responses. Specifically, we present the Chinese Atomic Claim Decomposition Dataset (CACDD), which builds on the WebCPM dataset with additional expert annotations to ensure high data quality. The CACDD encompasses a collection of 500 human-annotated question-answer pairs, including a total of 4956 atomic claims. We further propose a new pipeline for human annotation and describe the challenges of this task. In addition, we provide experiment results on zero-shot, few-shot and fine-tuned LLMs as baselines. The results show that the claim decomposition is highly challenging and requires further explorations. All code and data are publicly available 1 Copyright © 2024, The Authors. All rights reserved.

关键词： Question answering

来源：评论

学校读者我要写书评

暂无评论

Detecting adult image using multiple features

Detecting adult image using multiple features

引用

International Conferences on Info-tech and Info-net (ICII)

作者： Feng Jiao Wen Gao Lijuan Duan Guoqin Cui The Institute of Computing Technology Chinese Academy and Sciences Beijing China

This paper presents a new method to detect naked people in an image using multiple features. The skin color model is firstly used to detect naked skin areas roughly. The Sobel edge operator and Gabor filter are used to weed those that are not really human skin pixels. Images that have many naked skin areas are thought maybe to have naked people. The color coherence vector and color histogram of these images are calculated and the SVM (support vector machine) is used to determine which of these images contain images of naked people.

关键词： Skin Histograms Image edge detection Gabor filters Support vector machines Internet Power system modeling Humans Coherence Computers

来源：评论

学校读者我要写书评

暂无评论

A Fault Diagnosis Method of Rolling Bearing of CNC Machine Tool Based on Improved Convolutional Neural Network 11

A Fault Diagnosis Method of Rolling Bearing of CNC Machine T...

引用

11th International Conference of Information and Communication technology, ICTech 2022

作者： Gao, Ying Xia, Xiaojun University of Chinese Academy of Sciences Beijing100049 China School of Mathematics and Computer Science Chifeng University Chifeng024000 China Shenyang Institute of Computing Technology Chinese Academy of Sciences Shenyang110168 China

ISBN: (数字)9781665496940

ISBN: (纸本)9781665496940

In the industrial production process, the rolling bearing failures of huge mechanical equipment such as CNC machine tools frequently occur, which seriously affects the production performance and service life of the machine tools. In order to identify the types of faults in rolling bearings and improve the safety of the equipment, this paper presents a fault diagnosis method on account of an improved Convolution Neural Network (CNN). The improved CNN model is to add a convolutional layer before the fully connected layer, after several convolutional layers and several pooling layers, and use an improved stochastic gradient descent training algorithm with momentum to speed up the training speed to enhance the serviceability of the model. Traditional fault diagnosis methods are time-consuming, high in labor costs and low in work efficiency. The method in this paper improves the intelligence of the rolling bearing of CNC machine tools fault diagnosis process, improves the correctness of fault diagnosis, and adapts to the characteristics of big data fault diagnosis. Finally, the data set of Case Western Reserve University's rolling bearing database is used for experimental verification. The experimental results reveal that this method has a high recognition accuracy rate for various types and severity of rolling bearing faults, and has good practicability and application prospect. © 2022 IEEE.

关键词： Failure analysis

来源：评论

学校读者我要写书评

暂无评论

HCMonitor: An accurate measurement system for high concurrent network services

HCMonitor: An accurate measurement system for high concurren...

引用

作者： Song, Hui Zhang, Wenli Liu, Ke Shen, Yifan Chen, Mingyu State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China Frontier Research Center Peng Cheng Laboratory Shenzhen China

This article aims to enhance the monitoring accuracy of high concurrent network services. As modern network services grow rapidly in data centers, tail latency has become one of the most crucial deciding factors on user experience. Latency measurement and anomaly detection are essential in evaluating service performance. Existing monitoring tools can be divided into two categories according to estimation methods. First, approaches based on sample traffic sample network packets to unburden the measurement. Second, approaches based on full traffic like wrk, analyze all of the packets from the kernel network stack and load the client-side overhead into response delay. Therefore, we propose a high-performance monitor system named HCMonitor, which computes the server-side response latency and the round-trip time of per-request. It can afford full traffic monitoring on the basis of userspace, "zero copy" and pipeline. By switch mirroring, the measured latency eliminates the kernel network stack overhead and the queuing delay of the client-side. Such measurement results in improved accuracy, online analysis, anomaly detection, real-time display and transparent to network services. Our evaluations show HCMonitor obtains a higher throughput compared with tcpdump by over 200 times. Compared with wrk, the tail latency accuracy shows an increase by up to 72%–76% in high concurrent networks. © 2021 John Wiley & Sons Ltd.

关键词： Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

GramsDet: Hardware Trojan Detection Based on Recurrent Neural Network

GramsDet: Hardware Trojan Detection Based on Recurrent Neura...

引用

Asian Test Symposium (ATS)

作者： Renjie Lu Haihua Shen Yu Su Huawei Li Xiaowei Li University of Chinese Academy of Sciences Institute of Computing Technology Chinese Academy of Sciences

ISBN: (数字)9781728126951

ISBN: (纸本)9781728126968

Hardware Trojan (HT) has paid more and more attention to the academia and industry because of its significant potential threat. In this paper, we propose a novel approach, named GramsDet, to detect HT through capturing suspicious circuit connection structure using recurrent neural network. GramsDet considers that HT usually be inserted into the regions with low transition probability, so the circuit fragments associated with HT should have special connection structures. GramsDet models the target circuit using n-gram circuit segmentation technique, and implements the "gate embedding" by the order-sensitive co-occurrence matrix. Then, a stacked long short-term memory network is designed to build a robust HT detection model. The experimental results on different benchmarks show that GramsDet can detect effectively Trojan logic without the "Golden model" of the circuit under detection (CUD).

关键词： Logic gates Integrated circuit modeling Trojan horses Recurrent neural networks Natural language processing Directed graphs Training

来源：评论

学校读者我要写书评

暂无评论

ITERTL: An Iterative Framework for Fine-tuning LLMs for RTL Code Generation

arXiv

引用

arXiv 2024年

作者： Wu, Peiyang Guo, Nan Xiao, Xiao Li, Wenming Ye, Xiaochun Fan, Dongrui Institute of Computing Technology Chinese Academy of Sciences Beijing China

Recently, large language models (LLMs) have demonstrated excellent performance in understanding human instructions and generating code, which has inspired researchers to explore the feasibility of generating RTL code with LLMs. However, the existing approaches to fine-tune LLMs on RTL codes typically are conducted on fixed datasets, which do not fully stimulate the capability of LLMs and require large amounts of reference data. To mitigate these issues, we introduce a simple yet effective iterative training paradigm named ITERTL. During each iteration, samples are drawn from the model trained in the previous cycle. Then these new samples are employed for training in this loop. Through this iterative approach, the distribution mismatch between the model and the training samples is reduced. Additionally, the model is thus enabled to explore a broader generative space and receive more comprehensive feedback. Theoretical analyses are conducted to investigate the mechanism of the effectiveness. Experimental results show the model trained through our proposed approach can compete with and even outperform the state-of-the-art (SOTA) open-source model with nearly 37% reference samples, achieving remarkable 42.9% and 62.2% pass@1 rate on two VerilogEval evaluation datasets respectively. While using the same amount of reference samples, our method can achieved a relative improvement of 16.9% and 12.5% in pass@1 compared to the non-iterative method. This study facilitates the application of LLMs for generating RTL code in practical scenarios with limited data. © 2024, CC BY.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Towards More Efficient And Effective Inference: The Joint Decision Of Multi-Participants

Towards More Efficient And Effective Inference: The Joint De...

引用

IEEE International Conference on Image Processing

作者： Hui Zhu Zhulin An Kaiqiang Xu Xiaolong Hu Yongjun Xu Institute of Computing Technology Chinese Academy of Sciences Beijing China

ISBN: (数字)9781728163956

ISBN: (纸本)9781728163963

Existing approaches to improve the performances of convolutional neural networks by optimizing the local architectures or deepening the networks tend to increase the size of models significantly. In order to deploy and apply the neural networks to edge devices which are in great demand, reducing the scale of networks is quite crucial. However, It is easy to degrade the performance of image processing by compressing the networks. In this paper, we propose a method which is suitable for edge devices while improving the efficiency and effectiveness of inference. The joint decision of multiparticipants, mainly contain multi-layers and multi-networks, can achieve higher classification accuracy (0.26% on CFAR-10 and 4.49% on CFAR-100 at most) with similar total number of parameters for classical convolutional neural networks.

关键词： Training Computer architecture Convolutional neural networks Standards Image edge detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：