检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,694 篇 会议
58 册 图书
53 篇 期刊文献

馆藏范围

2,805 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,844 篇 工学
- 1,629 篇 计算机科学与技术...
- 847 篇 软件工程
- 340 篇 电气工程
- 222 篇 电子科学与技术（可...
- 209 篇 信息与通信工程
- 84 篇 控制科学与工程
- 63 篇 光学工程
- 57 篇 机械工程
- 41 篇 仪器科学与技术
- 39 篇 生物医学工程（可授...
- 38 篇 生物工程
- 31 篇 材料科学与工程（可...
- 25 篇 动力工程及工程热...
- 21 篇 化学工程与技术
- 20 篇 建筑学
- 15 篇 土木工程
- 13 篇 力学（可授工学、理...
- 12 篇 交通运输工程
499 篇 理学
- 343 篇 数学
- 113 篇 物理学
- 51 篇 系统科学
- 48 篇 生物学
- 30 篇 统计学（可授理学、...
- 26 篇 化学
173 篇 管理学
- 119 篇 管理科学与工程(可...
- 62 篇 图书情报与档案管...
- 49 篇 工商管理
40 篇 医学
- 30 篇 临床医学
- 14 篇 基础医学(可授医学...
15 篇 法学
- 15 篇 社会学
9 篇 经济学
9 篇 农学
8 篇 文学
2 篇 军事学
1 篇 教育学

主题

363 篇 parallel process...
219 篇 computer archite...
205 篇 graphics process...
146 篇 parallel archite...
136 篇 graphics process...
129 篇 hardware
116 篇 parallel algorit...
112 篇 image processing
99 篇 computational mo...
94 篇 concurrent compu...
87 篇 instruction sets
86 篇 field programmab...
83 篇 algorithm design...
79 篇 multicore proces...
77 篇 signal processin...
76 篇 parallel process...
66 篇 parallel program...
60 篇 throughput
60 篇 gpu
59 篇 kernel

机构

11 篇 natl univ def te...
6 篇 college of compu...
6 篇 school of comput...
6 篇 hosei univ dept ...
6 篇 natl univ def te...
5 篇 univ aizu dept c...
5 篇 carleton univ sc...
5 篇 school of comput...
5 篇 computer science...
5 篇 inria rennes
5 篇 city university ...
4 篇 chinese acad sci...
4 篇 univ michigan ad...
4 篇 institute of com...
4 篇 univ chinese aca...
4 篇 school of comput...
4 篇 univ jaume 1 dep...
4 篇 hainan internati...
4 篇 tech univ cluj n...
4 篇 department of co...

作者

11 篇 jack dongarra
10 篇 roman wyrzykowsk...
9 篇 konrad karczewsk...
9 篇 quintana-orti en...
7 篇 dongarra jack
7 篇 kothapalli kisho...
6 篇 hannig frank
6 篇 liu jie
6 篇 su jinshu
6 篇 nakano koji
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 wyrzykowski roma...
6 篇 thulasiraman par...
5 篇 ito yasuaki
5 篇 jerzy waśniewski
5 篇 wang guojun
5 篇 geyong min
5 篇 wanlei zhou

语言

2,744 篇 英文
36 篇 其他
18 篇 中文
11 篇 俄文
2 篇 乌克兰文
1 篇 西班牙文

检索条件"任意字段=10th International Conference on Algorithms and Architectures for Parallel Processing"

共 2805 条记录，以下是311-320 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

NPV: Fast Network Policy Verification for Cloud-Native Networking

NPV: Fast Network Policy Verification for Cloud-Native Netwo...

引用

international conference on Distributed Computing Systems

作者： Shunbin Dong Yumin Xie Jin Zhao Kun Qiu School of Computer Science Fudan University Shanghai China Shanghai Key Laboratory of Intelligent Information Processing Shanghai China

ISBN: (数字)9798350386059

ISBN: (纸本)9798350386066

Network policy plays a crucial role in cloud-native networking, especially in multi-tenant scenarios. It provides precise control over connectivity by specifying source and destination endpoints, traffic types, and other criteria to allow or deny traffic. However, manual configuration of these policies introduces the risk of errors, leading to isolation violations or network service unavailability. therefore, network policy verification is essential for maintaining security and quality of service in cloud-native networking. Currently, a naïve approach involves individually checking each policy within the cluster, which can take over 100s for verification in a cluster size of over 100k. Existing verification frameworks, like Kano and Verikube, improve performance by leveraging pre-filtering and Satisfiability Modulo theories (SMT) solvers, achieving a 3.12x to 12.99x performance boost over the naïve baseline. However, as network policy changes rapidly within 100ms in real cloud-native networks, both frameworks need over 10s to perform verification for cluster sizes over 100k, which is far from satisfying. To overcome these issues, we propose and implement a novel network policy verification framework NPV, which utilizes the policy-label pre-filter process with bitwise compression. We further enhance the policy verification algorithm with a policy-namespace divide-and-conquer strategy to improve the data-level parallelism. We implement NPV on commodity servers and evaluate its performance using real network policy datasets. Our experiments indicate that, compared with the state-of-the-art methods, NPV can achieve up to 139.00x to 651.06x improvement in verification time compared to Kano and Verikube, with 65% less memory usage.

关键词： Memory management Clustering algorithms Quality of service Manuals parallel processing Servers Security

来源：评论

学校读者我要写书评

暂无评论

Optimizing B⁺-Tree Searches on Coupled CPU-GPU architectures 20th

Optimizing B<SUP>+</SUP>-Tree Searches on Coupled CPU-GPU Ar...

引用

20th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Huang, Han Luan, Hua Beijing Normal Univ Beijing Peoples R China

ISBN: (纸本)9783030602451;9783030602444

the B+-tree is an important index in the fields of data warehousing and database management systems. With the development of new hardware technologies, the B+-tree needs to be revisited to fully take advantage of hardware resources. In this paper, we focus on optimization techniques to increase the searching performance of B+-trees on the coupled CPU-GPU architecture. First, we propose a hierarchical searching approach on the single coupled GPU to efficiently deal with leaf nodes of B+-trees. It adopts a flexible strategy to determine the number of work items in a work group to search one key in order to reduce irregular memory accesses and divergent branches in the work group. Second, we present a co-processing pipeline method on the coupled architecture. the CPU and the integrated GPU process the sorting and searching tasks simultaneously to hide sorting and partial searching latencies. A distribution model is designed to support the workload balance strategy based on real-time performance. Our performance study shows that the hierarchical searching scheme provides an improvement up to 36% on the GPU compared to the baseline algorithm with fixed number of work items and the co-processing pipeline method further increases the throughput by a factor of 1.8. To the best of our knowledge, this paper is the first study to consider both the CPU and the coupled GPU to optimize B+-trees searches.

关键词： B+-trees the coupled architecture Integrated GPU Co-processing

来源：评论

学校读者我要写书评

暂无评论

Voltage Island-Aware Energy-Efficient Scheduling of parallel Streaming Tasks on Many-Core CPUs 28

Voltage Island-Aware Energy-Efficient Scheduling of Parallel...

引用

28th Euromicro international conference on parallel, Distributed and Network-Based processing (PDP)

作者： Melot, Nicolas Kessler, Christoph Keller, Joerg Linkoping Univ S-58183 Linkoping Sweden Fernuniv D-58084 Hagen Germany

ISBN: (纸本)9781728165820

For multi- and many-core CPUs, dynamic voltage and frequency scaling (DVPS) for individual cores provides an effective way for energy-efficient execution of applications. However, this requires additional hardware within the chip that regulates voltage and frequency for each hardware sub-component that can be scaled separately. Because of the significant cost of this control hardware, it is often not realistic to provide such a regulator for each individual core. Instead, chip manufacturers group cores into islands consisting of multiple cores with a common regulator, and energy optimizing solutions must lake this constraint into account when assigning frequencies 10 jobs and cores. Crown Scheduling is a technique for the combined resource allocation, mapping and discrete DVFS-level selection for actor networks consisting of moldable parallel streaming tasks for energy efficient execution given a throughput constraint. We extend crown scheduling to compute correct schedules also in the presence of DVFS islands constraints. We find that, for most task sets, the crown scheduler computes almost equally good schedules for target architectures with and without island constraints.

关键词： static scheduling energy-efficient execution optimization algorithm

来源：评论

学校读者我要写书评

暂无评论

Studying the Performance of Vector-Based Quicksort Algorithm 13th

Studying the Performance of Vector-Based Quicksort Algorithm

引用

13th international conference on parallel processing and Applied Mathematics, PPAM 2019

作者： Marowka, Ami Parallel Research Lab Jerusalem Israel

ISBN: (纸本)9783030432218

the performance of parallel algorithms is often inconsistent with their preliminary theoretical analyses. Indeed, the difference is increasing between the ability to theoretically predict the performance of a parallel algorithm and the results measured in practice. this is mainly due to the accelerated development of advanced parallel architectures, whereas there is still no agreed model for parallel computation, which has implications for the design of parallel algorithms. In this study, we examined the practical performance of Cormen’s Quicksort parallel algorithm. We determined the performance of the algorithm with different parallel programming approaches and examine the capacity of theoretical performance analyses of the algorithm for predicting the actual performance. © 2020, Springer Nature Switzerland AG.

关键词： Python

来源：评论

学校读者我要写书评

暂无评论

Rise the Momentum: A Method for Reducing the Training Error on Multiple GPUs 19th

Rise the Momentum: A Method for Reducing the Training Error ...

引用

19th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Tang, Yu Yin, Lujia Zhang, Zhaoning Li, Dongsheng Natl Univ Def Technol Sci & Technol Parallel & Distributed Lab Changsha Peoples R China

ISBN: (纸本)9783030389611;9783030389604

Deep neural network training is a common issue that is receiving increasing attention in recent years and basically performed on Stochastic Gradient Descent or its variants. Distributed training increases training speed significantly but causes precision loss at the mean time. Increasing batchsize can improve training parallelism in distributed training. However, if the batchsize is too large, it will bring difficulty to training process and introduce more training error. In this paper, we consider controlling the total batchsize and lowering batchsize on each GPU by increasing the number of GPUs in distributed training. We train Resnet50 [4] on CIFAR-10 dataset by different optimizers, such as SGD, Adam and NAG. the experimental results show that large batchsize speeds up convergence to some degree. However, if the batchsize of per GPU is too small, training process fails to converge. Large number of GPUs, which means a small batchsize on each GPU declines the training performance in distributed training. We tried several ways to reduce the training error on multiple GPUs. According to our results, increasing momentum is a well-behaved method in distributed training to improve training performance on condition of multiple GPUs of constant large batchsize.

关键词： Multiple GPUs Batchsize Distributed training Momentum

来源：评论

学校读者我要写书评

暂无评论

SCALPsim, a tool for modeling asynchronous Self-Organizing 3-D NoC architectures 27

SCALPsim, a tool for modeling asynchronous Self-Organizing 3...

引用

27th IEEE international conference on Electronics, Circuits and Systems (IEEE ICECS)

作者： Barrientos, Diego Sousa, Claudio Upegui, Andres Girau, Bernard Univ Appl Sci Western Switzerland Geneva Switzerland Univ Lorraine LORIA UMR 7503 CNRS Vandoeuvre Les Nancy France

ISBN: (纸本)9781728160443

Manycore architectures are mainly composed of a very large amount of computing nodes interconnected with a multiplicity of links usually forming a NoC-like mesh architecture. High-speed links permit to obtain a higher throughput but are much more expensive than normal links, making the interconnection of the system a cost/performance trade-off. Simulating such architectures is very important in order to characterise the optimal network topology for a given problem. In this work we introduce SCALPsim: a simulation framework permitting to evaluate routing algorithms and network properties in 1-D, 2-D and 3-D regular mesh topologies simultaneously using links of different characteristics in terms of latency and throughput. these features are particularly interesting in large scale systems with processing elements grouped into clusters, where communication properties differ largely inside and between clusters. this paper presents the framework and an application based on Cellular Self-Organizing Maps - CSOM.

关键词： Cellular Self-Organising Maps parallel computation multi-FPGA

来源：评论

学校读者我要写书评

暂无评论

APENAS: An Asynchronous parallel Evolution Based Multi-objective Neural Architecture Search 18

APENAS: An Asynchronous Parallel Evolution Based Multi-objec...

引用

18th IEEE Int Symp on parallel and Distributed Proc with Applicat (ISPA) / 10th IEEE Int Conf on Big Data and Cloud Comp (BDCloud) / IEEE Int Symp on Social Comp and Networking (SocialCom) / IEEE Int Conf on Sustainable Comp and Commun (SustainCom)

作者： Hu, Mengtao Liu, Li Wang, Wei Liu, Yao East China Normal Univ Sch Data Sci & Engn Shanghai Peoples R China

ISBN: (纸本)9781665414852

Machine learning is widely used in pattern classification, image processing and speech recognition. Neural architecture search (NAS) could reduce the dependence of human experts on machine learning effectively. Due to the high complexity of NAS, the tradeoff between time consumption and classification accuracy is vital. this paper presents APENAS, an asynchronous parallel evolution based multi-objective neural architecture search, using the classification accuracy and the number of parameters as objectives, encoding the network architectures as individuals. To make full use of computing resource, we propose a multi-generation undifferentiated fusion scheme to achieve asynchronous parallel evolution on multiple GPUs or CPUs, which speeds up the process of NAS. Accordingly, we propose an election pool and a buffer pool for two-layer filtration of individuals. the individuals are sorted in the election pool by non-dominated sorting and filtered in the buffer pool by the roulette algorithm to improve the elitism of the Pareto front. APENAS is evaluated on the CIFAR-10 and CIFAR-100 datasets [25]. the experimental results demonstrate that APENAS achieves 90.05% accuracy on CIFAR-10 with only 0.07 million parameters, which is comparable to state of the art. Especially, APENAS has high parallel scalability, achieving 92.5% parallel efficiency on 64 nodes.

关键词： automated machine learning neural architecture search multi-objective asynchronous parallel evolution

来源：评论

学校读者我要写书评

暂无评论

Reconfigurable HW-SW Co-design Platform for Lung Cancer Detection and Classification

Reconfigurable HW-SW Co-design Platform for Lung Cancer Dete...

引用

international conference on VLSI Design

作者： Ayushparth Sharma Kusum Lata The LNM Institute of Information Technology Jaipur India

High-performance electronics has fueled the rich emergence of medical imaging applications that led to the exponential growth in treatment and diagnostic solutions of various medical problems. High-throughput and Energy-efficient systems are required to enable the development of complex medical imaging applications. this article presents an energy-efficient hardware-software (HW-SW) co-design of a scalable and reconfigurable image segmentation/classification streaming-based processing platform explored at various design abstraction levels. Optimized algorithms and architectural techniques achieve significant savings in energy consumption and operational time. the proposed platform has been implemented on Xilinx Spartan-6 FPGA board and co-simulated with Xilinx system generator, enabled real-time processing of CT scans for pulmonary nodule detection. Optimized pipelining and scheduling have minimized the memory requirements to few kB. parallel architecture has been employed achieving 10× higher energy-efficiency compared to serial counterpart and reduced execution period by 70 ×. Clinical validation shows that parallel architecture introduces 5-7 % error in nodule characteristic determination in comparison to serial one.

关键词： Memory management Lung cancer Very large scale integration Streaming media Energy efficiency Real-time systems parallel architectures

来源：评论

学校读者我要写书评

暂无评论

Riemannian SOS-Polynomial Normalizing Flows 1

引用

42nd German conference on Pattern Recognition, DAGM GCPR 2020 held in parallel with 25th international Symposium on Vision, Modeling, and Visualization, VMV 2020 and 10th Eurographics Workshop on Visual Computing for Biology and Medicine, VCBM 2020

作者： Schwarz, Jonathan Draxler, Felix Köthe, Ullrich Schnörr, Christoph Heidelberg Collaboratory for Image Processing Heidelberg University Heidelberg Germany Image and Pattern Analysis Group Heidelberg University Heidelberg Germany Visual Learning Lab Heidelberg University Heidelberg Germany

ISBN: (数字)9783030712785

ISBN: (纸本)9783030712778

Sum-of-Squares polynomial normalizing flows have been proposed recently, without taking into account the convexity property and the geometry of the corresponding parameter space. We develop two gradient flows based on the geometry of the parameter space of the cone of SOS-polynomials. Few proof-of-concept experiments using non-Gaussian target distributions validate the computational approach and illustrate the expressiveness of SOS-polynomial normalizing flows. © 2021, Springer Nature Switzerland AG.

关键词： Polynomials

来源：评论

学校读者我要写书评

暂无评论

8th international conference on Scale Space and Variational Methods in Computer Vision, SSVM 2021

8th International Conference on Scale Space and Variational ...

引用

8th international conference on Scale Space and Variational Methods in Computer Vision, SSVM 2021

ISBN: (纸本)9783030755485

the proceedings contain 45 papers. the special focus in this conference is on Scale Space and Variational Methods in Computer Vision. the topics include: Multiscale Registration;challenges for Optical Flow Estimates in Elastography;an Anisotropic Selection Scheme for Variational Optical Flow Methods with Order-Adaptive Regularisation;low-Rank Registration of Images Captured Under Unknown, Varying Lighting;towards Efficient Time Stepping for Numerical Shape Correspondence;first Order Locally Orderless Registration;first-Order Geometric Multilevel Optimization for Discrete Tomography;bregman Proximal Gradient algorithms for Deep Matrix Factorization;Hessian Initialization Strategies for -BFGS Solving Non-linear Inverse Problems;inverse Scale Space Iterations for Non-convex Variational Problems Using Functional Lifting;quantisation Scale-Spaces;A Scaled and Adaptive FISTA Algorithm for Signal-Dependent Sparse Image Super-Resolution Problems;Convergence Properties of a Randomized Primal-Dual Algorithm with Applications to parallel MRI;wasserstein Generative Models for Patch-Based Texture Synthesis;sketched Learning for Image Denoising;Translating Numerical Concepts for PDEs into Neural architectures;CLIP: Cheap Lipschitz Training of Neural Networks;variational Models for Signal processing with Graph Neural Networks;synthetic Images as a Regularity Prior for Image Restoration Neural Networks;geometric Deformation on Objects: Unsupervised Image Manipulation via Conjugation;learning Local Regularization for Variational Image Restoration;Equivariant Deep Learning via Morphological and Linear Scale Space PDEs on the Space of Positions and Orientations;on the Correspondence Between Replicator Dynamics and Assignment Flows;learning Linear Assignment Flows for Image Labeling via Exponential Integration;on the Geometric Mechanics of Assignment Flows for Metric Data Labeling;a Deep Image Prior Learning Algorithm for Joint Selective Segmentation and Registration.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共281页 << < 28 29 30 31 32 33 34 35 36 37 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：