检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

16 篇 会议
11 篇 期刊文献
1 篇 学位论文

馆藏范围

28 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

25 篇 工学
- 15 篇 计算机科学与技术...
- 13 篇 电气工程
- 6 篇 信息与通信工程
- 2 篇 仪器科学与技术
- 2 篇 电子科学与技术（可...
- 2 篇 控制科学与工程
- 1 篇 石油与天然气工程
- 1 篇 交通运输工程
- 1 篇 生物医学工程（可授...
- 1 篇 软件工程
4 篇 理学
- 2 篇 数学
- 2 篇 物理学
- 1 篇 化学
4 篇 管理学
- 4 篇 管理科学与工程(可...
2 篇 医学
- 2 篇 临床医学

主题

28 篇 algorithm accele...
2 篇 sorting network
2 篇 clustering algor...
2 篇 inverse problems
2 篇 deep learning
2 篇 pattern matching
2 篇 isa
2 篇 parallel sorting
2 篇 acceleration
2 篇 sparse represent...
2 篇 parallel deposit
2 篇 pack
2 篇 bit manipulation...
2 篇 lista
2 篇 bioinformatics
2 篇 parallel extract
2 篇 bit gather
2 篇 steganography
2 篇 bitonic mergesor...
2 篇 compression

机构

2 篇 harbin inst tech...
2 篇 princeton univ p...
2 篇 harbin inst tech...
1 篇 pacific nw natl ...
1 篇 tsinghua univ ct...
1 篇 state grid henan...
1 篇 school of electr...
1 篇 univ oxford oxfo...
1 篇 department of co...
1 篇 electrical engin...
1 篇 univ shanghai sc...
1 篇 univ haute alsac...
1 篇 shanghai jiao to...
1 篇 natl chiao tung ...
1 篇 univ politecn ma...
1 篇 univ catolica sa...
1 篇 univ batna 2 ele...
1 篇 lviv polytechn n...
1 篇 ocean univ china...
1 篇 computer science...

作者

2 篇 lee ruby b.
2 篇 wang shucheng
2 篇 hilewitz yedidya
2 篇 cao sheng
1 篇 wu jiajing
1 篇 yarkun volodymyr
1 篇 verrier nicolas
1 篇 zhang ning
1 篇 stefanovic juraj
1 篇 rankovic vukasin
1 篇 lu dan
1 篇 cheung peter y. ...
1 篇 riesgo teresa
1 篇 aksas lyes
1 篇 sanchez-garcia f...
1 篇 popescu gabriel
1 篇 he cuihua
1 篇 le-nam tran
1 篇 raja giryes
1 篇 eldar yonina c.

语言

27 篇 英文
1 篇 中文

检索条件"主题词=Algorithm acceleration"

共 28 条记录，以下是21-30 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

acceleration and implementation of JPEG2000 encoder on TI DSP platform

Acceleration and implementation of JPEG2000 encoder on TI DS...

引用

IEEE International Conference on Image Processing (ICIP 2007)

作者： Liu, Chien-Chih Hang, Hsueh-Ming Natl Chiao Tung Univ Dept Elect Engn Hsinchu Taiwan

ISBN: (纸本)9781424414369

JPEG2000 provides excellent compression performance and fine granularity scalability but at the cost of high computational complexity. We propose two speed-up techniques and use the TI DSP optimization tools to accelerate the Tier1 module. We eliminate the unnecessary checking cycles by recording the NBC (Need-to-Be-Coded) samples on a list. Furthermore, the sample index is reordered to facilitate fast execution. In the DSP implementation of the proposed methods, we use code acceleration techniques, cache memory allocation, and TI DSP compiler-level optimization tools. Even when the original program is compiled with the same DSP optimization tools and proper cache assignment, our fast algorithm can still reduce the computation by 45%.

关键词： JPEG200 DSP algorithm acceleration

来源：评论

学校读者我要写书评

暂无评论

Fast bit gather, bit scatter and bit permutation instructions for commodity microprocessors

Fast bit gather, bit scatter and bit permutation instruction...

引用

17th IEEE International Conference on Application-Specific Systems, Architectures and Processors

作者： Hilewitz, Yedidya Lee, Ruby B. Princeton Univ PALMS Dept Elect Engn Princeton NJ 08544 USA

Advanced bit manipulation operations are not efficiently supported by commodity word-oriented microprocessors. Programming tricks are typically devised to shorten the long sequence of instructions needed to emulate these complicated bit operations. As these bit manipulation operations are relevant to applications that are becoming increasingly important, we propose direct support for them in microprocessors. In particular, we propose fast bit gather (or parallel extract), bit scatter (or parallel deposit) and bit permutation instructions (including group, butterfly and inverse butterfly). We show that all these instructions can be implemented efficiently using both the fast butterfly and inverse butterfly network datapaths. Specifically, we show that parallel deposit can be mapped onto a butterfly circuit and parallel extract can be mapped onto an inverse butterfly circuit. We define static, dynamic and loop invariant versions of the instructions, with static versions utilizing a much simpler functional unit. We show how a hardware decoder can be implemented for the dynamic and loop-invariant versions to generate, dynamically, the control signals for the butterfly and inverse butterfly datapaths. The simplest functional unit we propose is smaller and faster than an ALU. We also show that these instructions yield significant speedups over a basic RISC architecture for a variety of different application kernels taken from applications domains including bioinformatics, steganography, coding, compression and random number generation.

关键词： bit manipulations permutations bit scatter bit gather parallel extract parallel deposit pack unpack microprocessors instruction set architecture ISA algorithm acceleration bioinformatics pattern matching compression steganography cryptology

来源：评论

学校读者我要写书评

暂无评论

THE LEARNED INEXACT PROJECT GRADIENT DESCENT algorithm

THE LEARNED INEXACT PROJECT GRADIENT DESCENT ALGORITHM

引用

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Giryes, Raja Eldar, Yonina C. Bronstein, Alex M. Sapiro, Guillermo Tel Aviv Univ Sch Elect Engn IL-69978 Tel Aviv Israel Technion IIT Elect Engn Dept IL-32000 Haifa Israel Technion IIT Comp Sci Dept IL-32000 Haifa Israel Duke Univ Elect & Comp Engn Dept Durham NC 27708 USA

ISBN: (纸本)9781538646588

Accelerating iterative algorithms for solving inverse problems using neural networks have become a very popular strategy in the recent years. In this work, we propose a theoretical analysis that may provide an explanation for its success. Our theory relies on the usage of inexact projections with the projected gradient descent (PGD) method. It is demonstrated in various problems including image super-resolution.

关键词： Inverse Problems Sparse Representation Deep Learning LISTA algorithm acceleration

来源：评论

学校读者我要写书评

暂无评论

R³SGM: Real-time Raster-Respecting Semi-Global Matching for Power-Constrained Systems 17

R<SUP>3</SUP>SGM: Real-time Raster-Respecting Semi-Global Ma...

引用

17th International Conference on Field-Programmable Technology (FPT)

作者： Rahnama, Oscar Cavallari, Tommaso Golodetz, Stuart Walker, Simon Torr, Philip H. S. Univ Oxford Oxford England

ISBN: (纸本)9781728102139

Stereo depth estimation is used for many computer vision applications. Though many popular methods strive solely for depth quality, for real-time mobile applications (e.g. prosthetic glasses or micro-UAVs), speed and power efficiency are equally, if not more, important. Many real-world systems rely on Semi-Global Matching (SGM) to achieve a good accuracy vs. speed balance, but power efficiency is hard to achieve with conventional hardware, making the use of embedded devices such as FPGAs attractive for low-power applications. However, the full SGM algorithm is ill-suited to deployment on FPGAs, and so most FPGA variants of it are partial, at the expense of accuracy. In a nonFPGA context, the accuracy of SGM has been improved by More Global Matching (MGM), which also helps tackle the streaking artifacts that afflict SGM. In this paper, we propose a novel, resource-efficient method that is inspired by MGM's techniques for improving depth quality, but which can be implemented to run in real time on a low-power FPGA. Through evaluation on multiple datasets (KITTI and Middlebury), we show that in comparison to other real-time capable stereo approaches, we can achieve a state-of-the-art balance between accuracy, power efficiency and speed, making our approach highly desirable for use in real-time systems with limited power.

关键词： algorithm acceleration Depth FPGA Image Processing Low Power Real Time Stereo Zynq

来源：评论

学校读者我要写书评

暂无评论

Accelerating k-NN algorithm with Hybrid MPI and OpenSHMEM 2nd

Accelerating <i>k</i>-NN Algorithm with Hybrid MPI and OpenS...

引用

2nd Workshop OpenSHMEM and Related Technologies

作者： Lin, Jian Hamidouche, Khaled Zhang, Jie Lu, Xiaoyi Vishnu, Abhinav Panda, Dhabaleswar Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Pacific NW Natl Lab Richland WA 99352 USA

ISBN: (纸本)9783319264288;9783319264271

Machine learning algorithms are benefiting from the continuous improvement of programming models, including MPI, MapReduce and PGAS. k-Nearest Neighbors (k-NN) algorithm is a widely used machine learning algorithm, applied to supervised learning tasks such as classification. Several parallel implementations of k-NN have been proposed in the literature and practice. However, on high-performance computing systems with high-speed interconnects, it is important to further accelerate existing designs of the k-NN algorithm through taking advantage of scalable programming models. To improve the performance of k-NN on large-scale environment with InfiniBand network, this paper proposes several alternative hybrid MPI+OpenSHMEM designs and performs a systemic evaluation and analysis on typical workloads. The hybrid designs leverage the one-sided memory access to better overlap communication with computation than the existing pureMPI design, and propose better schemes for efficient buffer management. The implementation based on k-NN program from MaTEx toolkit with MVAPICH2-X (Unified MPI+ PGAS Communication Runtime over InfiniBand) shows up to 9.0% time reduction for training KDD Cup 2010 workload over 512 cores, and 27.6% time reduction for small workload with balanced communication and computation. Experiments of running with varied number of cores show that our design can maintain good scalability.

关键词： MPI OpenSHMEM Hybrid programming model algorithm acceleration

来源：评论

学校读者我要写书评

暂无评论

Akcelerace algoritmů Lattice-Boltzmann pro modelování toku krve v mozku

Akcelerace algoritmů Lattice-Boltzmann pro modelování tok...

引用

作者： Kompová, Radmila Brno University of Technology

Tato práce se zabývá implementací a možnými optimalizacemi metody lattice-Boltzmann. Tato metoda umožňuje modelovat tok kapalin pomocí simulace pohybu fiktivních částic. Práce se zaměřuje na možná vylepšení existujícícho nástroje HemeLB, který se specializuje na simulaci proudění krve v mozku. V práci jsou mimo jiné zkoumány techniky vektorizace a paralelizace jejichž implementace by mohla pro tento nástroj být přínosná. Součástí práce je implementace aplikace srovnávající několik vybraných algoritmů pro metodu lattice-Boltzmann včetně jejich možných optimalizací. Zahrnuty jsou rovněž testy zaměřené na srovnání těchto algoritmů dle dosaženého výkonu, využití paměti cache a celkové spotřeby paměti. Nejlepší dosažený výkon byl 150 milionů aktualizovaných bodů mřížky za sekundu.

关键词： Lattice-Boltzmann metoda modelování toku krve HemeLB akcelerace algoritmů vektorizace paralelizace OpenMP Lattice-Boltzmann method bloodflow modeling HemeLB algorithm acceleration vectorization parallelization OpenMP Text

来源：评论

学校读者我要写书评

暂无评论

The Learned Inexact Project Gradient Descent algorithm

The Learned Inexact Project Gradient Descent Algorithm

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Raja Giryes Yonina C. Eldar Alex M. Bronstein Guillermo Sapiro School of Electrical Engineering Tel Aviv University Tel Aviv Israel 69978 Electrical Engineering Department Technion - IIT Haifa 32000 Israel Computer Science Department Technion - IIT Haifa 32000 Israel Electrical and Computer Engineering Department Duke University Durham NC 27708

ISBN: (纸本)9781538646595

关键词： Inverse Problems Sparse Representation Deep Learning LISTA algorithm acceleration

来源：评论

学校读者我要写书评

暂无评论

Performance of the Bitonic Mergesort Network on a Dataflow Computer

Performance of the Bitonic Mergesort Network on a Dataflow C...

引用

Telecommunications Forum Telfor

作者： Vukasin Rankovic Anton Kos Saso Tomazic Veljko Milutinovic School of Electrical Engineering University of Belgrade Faculty of Electrical Engineering University of Ljubljana

ISBN: (纸本)9781479914180

High speed computing and growing amounts of data are driving the quest for ever faster sorting algorithms. Sorting networks executing parallel sorting and dataflow computational paradigm are offered as a possible solution. In presented experiments Bitonic mergesort algorithm is implemented on an entry model of the Maxeler dataflow supercomputing system. Our results show, that sorting of a small size arrays on Maxeler, comparing to the fastest sorting algorithm on a CPU, achieves the speedup factor of 16. Using more advanced Maxeler systems, we expect to be able to sort larger arrays and achieve greater speedups.

关键词： Dataflow computing Bitonic mergesort Parallel sorting algorithm acceleration Sorting network

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共3页 << < 1 2 3 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：