检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,768 篇 会议
57 册 图书
47 篇 期刊文献

馆藏范围

2,870 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

2,015 篇 工学
- 1,780 篇 计算机科学与技术...
- 943 篇 软件工程
- 297 篇 信息与通信工程
- 293 篇 电气工程
- 246 篇 电子科学与技术（可...
- 95 篇 控制科学与工程
- 52 篇 机械工程
- 49 篇 生物工程
- 44 篇 光学工程
- 41 篇 生物医学工程（可授...
- 37 篇 仪器科学与技术
- 28 篇 动力工程及工程热...
- 27 篇 化学工程与技术
- 21 篇 土木工程
- 20 篇 力学（可授工学、理...
- 19 篇 材料科学与工程（可...
- 18 篇 建筑学
541 篇 理学
- 385 篇 数学
- 106 篇 物理学
- 56 篇 生物学
- 48 篇 系统科学
- 32 篇 化学
- 32 篇 统计学（可授理学、...
197 篇 管理学
- 121 篇 管理科学与工程(可...
- 81 篇 图书情报与档案管...
- 56 篇 工商管理
51 篇 医学
- 42 篇 临床医学
- 16 篇 基础医学(可授医学...
19 篇 文学
17 篇 经济学
- 17 篇 应用经济学
15 篇 法学
- 14 篇 社会学
12 篇 农学
4 篇 教育学
3 篇 军事学

主题

342 篇 parallel process...
200 篇 parallel process...
193 篇 computer archite...
157 篇 graphics process...
153 篇 parallel archite...
112 篇 parallel algorit...
108 篇 graphics process...
106 篇 hardware
86 篇 image processing
78 篇 computational mo...
73 篇 signal processin...
71 篇 concurrent compu...
66 篇 instruction sets
65 篇 algorithm design...
65 篇 multicore proces...
63 篇 field programmab...
60 篇 parallel program...
59 篇 parallel computi...
53 篇 gpu
49 篇 digital signal p...

机构

10 篇 natl univ def te...
8 篇 college of compu...
6 篇 hosei univ dept ...
6 篇 college of compu...
5 篇 univ aizu dept c...
5 篇 national univers...
5 篇 natl univ def te...
5 篇 city university ...
5 篇 science and tech...
4 篇 chinese acad sci...
4 篇 school of comput...
4 篇 carleton univ sc...
4 篇 univ chinese aca...
4 篇 school of comput...
4 篇 charles univ pra...
4 篇 department of co...
4 篇 school of comput...
4 篇 hainan internati...
4 篇 purple mountain ...
4 篇 department of co...

作者

10 篇 liu jie
9 篇 jack dongarra
8 篇 roman wyrzykowsk...
7 篇 wang qinglin
7 篇 konrad karczewsk...
7 篇 quintana-orti en...
6 篇 gepner pawel
6 篇 peng shietung
6 篇 li kuan-ching
6 篇 li yamin
6 篇 chu wanming
6 篇 prasanna viktor ...
6 篇 rothermel kurt
6 篇 yang chao-tung
5 篇 dongarra jack
5 篇 olas tomasz
5 篇 hannig frank
5 篇 wanlei zhou
5 篇 qian depei
5 篇 ewa deelman

语言

2,845 篇 英文
16 篇 其他
13 篇 中文
1 篇 俄文

检索条件"任意字段=8th International Conference on Algorithms and Architectures for Parallel Processing"

共 2872 条记录，以下是2761-2770 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Template matching of aerial images using GPU

Template matching of aerial images using GPU

引用

international Bhurban conference on Applied Sciences & Technology, IBCAST

作者： Nabigha Nazneen Muhammad Shafiq Abdul Hameed KICSIT Rawalpindi Pakistan CESAT Islamabad Pakistan

During the last decade, processor architectures have emerged with hundreds and thousands of high speed processing cores in a single chip. these cores can work in parallel to share a work load for faster execution. this paper presents performance evaluations on such multicore and many-core devices by mapping a computationally expensive correlation kernel of a template matching process using various programming models. the work builds a base performance case by a sequential mapping of the algorithm on an Intel processor. In the second step, the performance of the algorithm is enhanced by parallel mapping of the kernel on a shared memory multicore machine using OpenMP programming model. Finally, the Normalized Cross-Correlation (NCC) kernel is scaled to map on a many-core K20 GPU using CUDA programming model. In all steps, the correctness of the implementation of algorithm is taken care by comparing computed data with reference results from a high level implementation in MATLAB. the performance results are presented with various optimization techniques for MATLAB, Sequential, OpenMP and CUDA based implementations. the results show that GPU based implementation achieves 32x and 5x speed-ups respectively to the base case and multicore implementations respectively. Moreover, using inter-block sub-sampling on an 8-bit 4000×4000 reference gray-scale image achieves the execution time upto 2.8sec with an error growth less than 20% for the selected templates of size 96×96.

关键词： Graphics processing units MATLAB Kernel Computational modeling Correlation Instruction sets Multicore processing

来源：评论

学校读者我要写书评

暂无评论

Implementing a Pickup and Delivery Problem with Time Windows algorithm on a GPU cluster

Implementing a Pickup and Delivery Problem with Time Windows...

引用

international conference on Computer Sciences and Convergence Information Technology (ICCIT)

作者： Geeratiya Srimool Putchong Uthayopas Juta Pichitlamkhen Department of Computer Engineering Faculty of Engineering Kasetsart University Bangkok Thailand Department of Industrial Engineering Faculty of Engineering Kasetsart University Bangkok Thailand

this work presents an implementation of a high speed Pickup and Delivery Problem with Time Window (PDPTW) problem using GPU cluster. this problem represents a class of a major logistic problem. the software implemented is tested on 8 nodes GPU cluster equipped with two of Tesla M2050 (448 cores) card on each node. the result shows a speedup of nearly 7 times for a small problem and 43 times using 4 nodes for a large problem. In the presentation, some factors that affect the performance will be discussed.

关键词： Graphics processing unit Instruction sets Vehicles parallel processing Clustering algorithms Handheld computers

来源：评论

学校读者我要写书评

暂无评论

Cloth Simulation Based on Local Adaptive Subdivision and Merging

Cloth Simulation Based on Local Adaptive Subdivision and Mer...

引用

the 8th international conference on Computer Supported Cooperative Work in Design(第八届计算机支持的协同工作设计国际会议)(CSCWD2004)

作者： JianZhen Lin Min Tang JinXiang Dong AI Institute State Key Laboratory of CAD&CGZheJiang UniversityHangZhou 310027

ISBN: (纸本)0780379411

To speed up cloth simulation while achieving realisticsimulation results, a local adaptive Catmull-Clarksubdivision is adopted in this paper. During the growingphase of cloth simulation, several new particles areadded to the system if the cloth grids collide with rigidobjects. the cloth grids are merged later when coarsegrids are sufficient. In this article, a mass-spring model isdeveloped for rectangle grids. Collision detection andresponse methods for local adaptive subdivision are alsodescribed. the exploitation of cache control instructionset and SIMD instructions allow us to achieve nearly100％ FPU utilization. the algorithm supports tradeoffsbetween simulation time and realistic results.

关键词： clothing fabrics digital simulation CAD product design production engineering computing cache storage instruction sets parallel processing cloth simulation local adaptive subdivision realistic simulation local adaptive Catmull-Clark subdivision cloth grid collision rigid objects cloth grid merging mass-spring model rectangle grids collision detection response methods cache control instruction set SIMD instructions FPU utilization computer aided design computer graphics

来源：评论

学校读者我要写书评

暂无评论

A high effective algorithm of 32-bit multiply and MAC instructions' VLSI implementation with 32/spl times/8 multiplier-accumulator in DSP applications

A high effective algorithm of 32-bit multiply and MAC instru...

引用

international conference on Signal processing Proceedings (ICSP)

作者： Ze Tian Dun-shan Yu Yu-lin Qiu Microelectron. R&D Center Acad. Sinica Beijing China Institute of Microelectronics Peking University Beijing China Microelectronics R&D Center of Chinese Academy of Sciences Beijing China

ISBN: (纸本)0780374886

Multiply and multiply-accumulate (MAC) instructions (see ARM DDI0l00E, ARM Architecture Reference Manual) are fundamental instructions in DSP applications. In an embedded digital signal processing (DSP) core and high-performance enhanced DSP instruction processor core, the implementation of high-performance multiply and MAC instructions is very important. An algorithm of 32/spl times/32 multiply and MAC instructions' VLSI implementation with 32/spl times/8 multiplier-accumulator in DSP applications is presented. the 32/spl times/32 multiplication is achieved by 4 times 32/spl times/8 multiplication. the result of one 32/spl times/8 multiplication serves as a partial product of the next 32/spl times/8 operation; when the result of four such multiplications is accumulated, we get the result of 32/spl times/32. the 32/spl times/8 multiplication is only implemented by the hardware Booth multiplier. the algorithm of multiply and MAC instructions' implementation is the better trade-off between serial multiplier and parallel multiplier.

关键词： Very large scale integration Digital signal processing Signal processing algorithms Hardware Clocks Silicon Microelectronics Embedded system Research and development Speech processing

来源：评论

学校读者我要写书评

暂无评论

Improved GAI-BP Detection for MIMO Systems Based on Message Post-processing

Improved GAI-BP Detection for MIMO Systems Based on Message ...

引用

international conference on ASIC

作者： Ruiyang Ji Wenyue Zhou Xiaosi Tan Xiaohu You Chuan Zhang Lab of Efficient Architectures for Digital-Communication and Signal-Processing (LEADS) National Mobile Communications Research Laboratory Southeast University Nanjing China Purple Mountain Laboratories Nanjing China

Multiple-input multiple-output serves as a key technique for modern wireless communication systems but brings big challenges in data detection. Belief propagation (BP) detection enjoys advances for its near-optimal error rate performance and strong robustness. However, the state-of-art BP detection algorithms suffer from either an exponentially increasing computational complexity or sub-optimal error rate performance. To address this issue, this paper proposes an efficient variant based on the real-domain GAI BP (RD-GAI-BP), called the improved RD-GAI-BP, achieving a better error rate performance at the expense of acceptable complexity increments. Numerical results demonstrate that, in the MIMO scenario with N r = 16, N t = 8 and 64-QAM modulation, the proposed improved RD-GAI-BP earns more than 2 dB SNR gains at the BER of 10 -3 when compared with the state-of-art RD-GAI-BP detection.

关键词：

来源：评论

学校读者我要写书评

暂无评论

FlexSFC: Flexible Resource Allocation and VNF parallelism for Improved SFC Placement

FlexSFC: Flexible Resource Allocation and VNF Parallelism fo...

引用

IEEE conference on Network Softwarization (NetSoft)

作者： Sagar Agarwal Venkatarami Reddy Chintapalli Bheemarjuna Reddy Tamma Indian Institute of Technology Hyderabad India

ISBN: (数字)9781665406949

ISBN: (纸本)9781665406956

To reduce the processing delay from the sequentially running virtual network functions (VNFs) in a service function chain (SFC), network function parallelism (NFP) is introduced that allows VNFs of the SFC to run in parallel. Existing NFP solutions only focused on improving parallelism benefits without paying much attention to resource utilization while deploying VNFs of SFCs. We take advantage of resource-delay dependency to propose a flexible and efficient parallelized SFC placement mechanism called FlexSFC which determines the optimal SFC placement while reducing resource usage and meeting end-to-end delay guarantees of the SFCs deployed. Initial results show that FlexSFC guarantees the end-to-end delay requirement with better resource utilization and SFC acceptance rate than the state-of-the-art approaches.

关键词： Service function chaining conferences parallel processing Dynamic scheduling Delays Resource management

来源：评论

学校读者我要写书评

暂无评论

An adaptive value-based scheduling policy for multiprocessor real-time database systems

An adaptive value-based scheduling policy for multiprocessor...

引用

international Workshop on Database and Expert Systems Applications

作者： Shin-Mu Tseng Y.H. Chin Wei-Ping Yang LInstitute of Computer and Information Science National Chiao Tung University Hsinchu Taiwan Institute of Computer Science National Tsing Hua University Hsinchu Taiwan

In a real-time application, a transaction may be assigned a value to reflect the profit of completing the transaction before its deadline. Satisfying both goals of maximizing the totally obtained profit and minimizing the number of missed transactions at the same time is a challenge. the authors present an adaptive real-time scheduling policy named value-based processor allocation (VPA-k) for scheduling value-based transactions in a multiprocessor real-time database system. Using the VPA-k policy, the transactions with higher values are given higher priorities to execute first, while at most k percentage of total processors are dynamically allocated to execute the urgent transactions. through simulation experiments, VPA-k is shown to outperform other scheduling policies substantially in both maximizing the totally obtained profit and minimizing the number of missed transactions under various system environments.

关键词： Adaptive scheduling Real time systems Database systems Processor scheduling Scheduling algorithm parallel processing Multiprocessing systems Information science Computer science Application software

来源：评论

学校读者我要写书评

暂无评论

MIMO Multiuser Detection for CDMA Systems

MIMO Multiuser Detection for CDMA Systems

引用

2006 8th international conference on Signal processing

作者： Yang Xiao Moon Ho Lee Institute of Information Science Beijing Jiaotong University Division of Electronics & Information Chonbuk National University Jeonju 561-756Korea

To combine presented MIMO scheme with multiuser detectors for uplink will suffer from the problems of high computation complexity and channenl ***,in this paper we propose a MIMO multiuser detection(MUD) scheme that reduces considerably the system computation complexity. the proposed algorithm adopts inverse channel matrix for MIMO decoding,which is not sensitive to the coherency of *** of the scattering characteristic of the MIMO channel,the inverse channel matrices are always nonsingular, which keeps the receivers can get stable spatial diversity gain. the MUD algorithms can be realized using a parallel modular *** is based on a Minimum Mean Square Error (MMSE) *** results show that our MIMO-MUD performs much better than presented MIMO-MUD for the same order of complexity,though the MIMO CDMA system has only two antennas at each BS and two antennas at each mobile station.

关键词： communication circuits space-time processing MIMO multiuser detection

来源：评论

学校读者我要写书评

暂无评论

Re-structuring CNN using quantum layer executed on FPGA hardware for classifying 2-D data

Re-structuring CNN using quantum layer executed on FPGA hard...

引用

international conference on Integrated Circuits, Design, and Verification (ICDV)

作者： Nhat Hoang Bach Le Ha Vu Dinh Lam Tran thanh Toan Dao thi thu Hong Luu Duy Ninh Nguyen Institute of Electronics Academy of Science and Technology Hanoi Vietnam Electronic department University of Transport and Communications Hanoi Vietnam

ISBN: (数字)9798350371864

ISBN: (纸本)9798350371871

the article proposes a solution to restructure Convolutional Neural Network (CNN) architectures by integrating parameter quantization techniques with traditional CNN models capable of deployment on Field-Programmable Gate Array (FPGA) hardware for evaluating the classification performance of two-dimensional data in real-world scenarios. the solution introduces an additional quantum layer before the final classification layer of the CNN to receive standardized outputs, followed by computations in the Hilbert vector space to generate probability values for assessing classification results. the quantization process helps the model swiftly identify data features while optimizing the parallel computing capability of FPGA hardware. the model is evaluated on the MNIST handwritten digit dataset, revealing two advantages: time-processing on FPGA is four times faster compared to using only the Central processing Unit (CPU) on the PynQ-z2 kit board; classification accuracy is higher when utilizing the quantum layer compared to without it, with the same number of training iterations. these results demonstrate the feasibility of hardware-accelerated AI algorithms combined with quantum algorithms in real applications.

关键词： Quantization (signal) Computational modeling Hardware Data models Vectors Central processing Unit Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Design and implementation of a memory architecture in dsp for wireless communication

Design and implementation of a memory architecture in dsp fo...

引用

international conference on Communications and Networking in China (CHINACOM)

作者： Chaoxing Zhao Jun Wu Xin Chen College of Electronics and Information Engineering Tongji University Shanghai China

ISBN: (纸本)9781479987962

Digital signal processors (DSP) play an important role in signal processing, wireless communication and many other fields. With the improvement of DSP's computing performance, memory architecture became the neck of the whole DSP's efficiency. A new memory architecture which can be accessed by two computation slots, DMA controller, debug module and wishbone bus in parallel is presented in this paper. Our data memory capacity is 1MB and instruction memory capacity is 256KB. After synthesized, placed, and routed in a commercial 65nm low power process, the area of our data memory is about 8, 600, 600μm2, and the area of our instruction memory is about 2, 140, 000μm2. the delay result of our data memory is 1.65ns (SS), and the delay result of our instruction memory is 1.78ns (SS).

关键词： Digital signal processing Pipelines VLIW Wireless communication Program processors Memory architecture Signal processing algorithms

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共288页 << < 273 274 275 276 277 278 279 280 281 282 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：