检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

16,240 篇 会议
369 篇 期刊文献
22 册 图书

馆藏范围

16,631 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

9,338 篇 工学
- 8,537 篇 计算机科学与技术...
- 4,020 篇 软件工程
- 1,985 篇 电气工程
- 1,383 篇 信息与通信工程
- 673 篇 电子科学与技术（可...
- 535 篇 控制科学与工程
- 228 篇 网络空间安全
- 187 篇 仪器科学与技术
- 140 篇 机械工程
- 115 篇 生物医学工程（可授...
- 106 篇 动力工程及工程热...
- 105 篇 测绘科学与技术
- 97 篇 光学工程
- 91 篇 生物工程
- 82 篇 建筑学
- 70 篇 土木工程
- 63 篇 环境科学与工程（可...
- 61 篇 安全科学与工程
1,973 篇 理学
- 1,505 篇 数学
- 245 篇 物理学
- 203 篇 统计学（可授理学、...
- 177 篇 系统科学
- 115 篇 生物学
- 100 篇 地球物理学
- 69 篇 化学
1,462 篇 管理学
- 1,204 篇 管理科学与工程(可...
- 468 篇 工商管理
- 321 篇 图书情报与档案管...
106 篇 医学
- 86 篇 临床医学
96 篇 经济学
- 93 篇 应用经济学
56 篇 法学
53 篇 农学
18 篇 教育学
12 篇 文学
9 篇 军事学
1 篇 艺术学

主题

2,212 篇 parallel process...
1,199 篇 computer archite...
1,129 篇 concurrent compu...
1,116 篇 distributed comp...
1,063 篇 computational mo...
1,038 篇 application soft...
1,017 篇 distributed proc...
991 篇 hardware
905 篇 computer science
710 篇 graphics process...
595 篇 runtime
527 篇 scalability
520 篇 parallel process...
507 篇 algorithm design...
496 篇 parallel program...
490 篇 parallel algorit...
470 篇 graphics process...
460 篇 kernel
446 篇 processor schedu...
440 篇 conferences

机构

38 篇 ibm thomas j. wa...
33 篇 college of compu...
31 篇 school of comput...
27 篇 oak ridge nation...
26 篇 university of ch...
26 篇 oak ridge natl l...
25 篇 georgia inst tec...
25 篇 ohio state univ ...
24 篇 department of co...
23 篇 pacific northwes...
22 篇 tsinghua univers...
21 篇 argonne national...
21 篇 oak ridge nation...
20 篇 georgia inst tec...
19 篇 college of compu...
19 篇 school of comput...
19 篇 department of co...
19 篇 argonne natl lab...
19 篇 pacific northwes...
19 篇 national laborat...

作者

39 篇 jack dongarra
31 篇 dongarra jack
29 篇 zomaya albert y.
26 篇 bader david a.
23 篇 feng wu-chun
22 篇 boukerche azzedi...
19 篇 hoefler torsten
18 篇 gagan agrawal
18 篇 schulz martin
16 篇 dhabaleswar k. p...
16 篇 p. sadayappan
16 篇 wang yijie
15 篇 ito yasuaki
15 篇 yves robert
14 篇 h. casanova
14 篇 alexey lastovets...
14 篇 azad ariful
13 篇 dongsheng li
13 篇 wang guojun
13 篇 kishore kothapal...

语言

16,421 篇 英文
180 篇 其他
27 篇 中文
2 篇 土耳其文
1 篇 葡萄牙文

检索条件"任意字段=IEEE International Symposium on Parallel and Distributed Processing with Applications"

共 16631 条记录，以下是401-410 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Contrastive Learning-Based Generic Audio-to-Lyrics Alignment 21

Contrastive Learning-Based Generic Audio-to-Lyrics Alignment

引用

21st ieee international symposium on parallel and distributed processing with applications, 13th ieee international Conference on Big Data and Cloud Computing, 16th ieee international Conference on Social Computing and Networking and 13th international Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2023

作者： Jin, Zhenkun Huang, Shuai Nie, Xin Zhou, Xinlei Yi, Yuanyuan Zhou, Gefei Wuhan Business UIniversity Wuhan China Wuhan Research Institute of Posts and Telecommunications Wuhan China Hubei Key Laboratory of Distributed System Security Wuhan China Hubei Engineering Research Center on BigData Security Wuhan China Huazhong University of Science and Technology School of Cyber Science and Engineering Wuhan China Huazhong University of Science and Technology School of Journalism and Information Communication Wuhan China Huazhong University of Science and Technology School of Computer Science and Technology Wuhan China

ISBN: (纸本)9798350329223

In this research, we tackled the complex task of aligning audio and lyrics content automatically. This task entails the precise matching of lyrics with the corresponding audio segments in songs, necessitating the coordination of two distinct modalities: text and audio. To surmount this challenge, we employed a contrastive learning approach that generates cross-modal embeddings capable of establishing connections between the audio and text domains. Furthermore, we adopted a hierarchical and cascading design. Initially, the system predicts synchronization at the sentence level and subsequently refines the alignment at the word level. This design equips the system to handle extended sequences effectively, as cross-correlations leverage secondary memory that corresponds to sequence length. Through this innovative approach, we developed a trainable end-to-end system that can harness weakly annotated training data, concurrently learning a robust text model optimized for alignment tasks. Our experiments demonstrated substantial enhancements in average alignment accuracy when utilizing our proposed system, underscoring its robustness compared to previous state-of-the-art models. © 2023 ieee.

关键词： Alignment

来源：评论

学校读者我要写书评

暂无评论

Ranking on Heterogeneous Manifold for Multimodal Information Retrieval 21

Ranking on Heterogeneous Manifold for Multimodal Information...

引用

作者： Jin, Zhenkun Wan, Xingshi Nie, Xin Zhou, Xinlei Yi, Yuanyuan Zhou, Gefei Wuhan Business University Wuhan Research Institute of Posts and Telecommunications Wuhan China Wuhan Research Institute of Posts and Telecommunications Wuhan China Huazhong University of Science and Technology Hubei Key Laboratory of Distributed System Security Hubei Engineering Research Center on Big Data Security Wuhan China Huazhong University of Science and Technology School of Journalism and Information Communication Wuhan China Huazhong University of Science and Technology School of Computer Science & Technology Wuhan China

ISBN: (纸本)9798350329223

In light of recent advancements in Internet of Multimedia Things (IoMT) and 5G technology, both the variety and quantity of data have been rapidly increasing. Consequently, handling zero-shot cross-modal retrieval (ZSCMR) of unlabeled multi-modal data has emerged as a prominent research topic. Early models utilized generative adversarial networks and class embeddings to assist in building a common feature space. However, this approach resulted in a loss of reliability in the feature distributions of individual modalities. In this paper, we propose a method called Ranking on Heterogeneous Manifold (RHM) to address the aforementioned issue. Our model comprises three feature extractors, each dedicated to extracting features from text, images and multimodal data. Subsequently, the relationship structures derived from these feature extractors are employed to construct a heterogeneous manifold. A diffusion process is then conducted on this heterogeneous manifold, ultimately yielding the final ranking scores. Comprehensive experiments demonstrate that the proposed modal has the capability to acquire more effective knowledge representation, thus enhancing the effectiveness of ZSCMR. © 2023 ieee.

关键词： Knowledge representation

来源：评论

学校读者我要写书评

暂无评论

Dynamic Adaptive Checkpoint Mechanism for Streaming applications Based on Reinforcement Learning 28

Dynamic Adaptive Checkpoint Mechanism for Streaming Applicat...

引用

ieee 28th international Conference on parallel and distributed Systems (ieee ICPADS)

作者： Zhang, Zhan Liu, Tianming Shu, Yanjun Chen, Siyuan Liu, Xian Harbin Inst Technol Dept Comp Sci & Technol Harbin Peoples R China

ISBN: (纸本)9781665473156

For a stream processing system that uses checkpoints as a fault-tolerant method, selecting the appropriate checkpoint period is the key to ensuring the efficient operation of streaming applications. State-of-art stream processing systems currently only support fixed-cycle checkpoints, which is difficult to make a good trade-off between fault-tolerant processing and the cost of failure recovery in dynamically changing streaming application scenarios. Moreover, in a complex distributed streaming application environment, the dynamic environmental indicators (e.g., the values of workloads and failure rates) are not in coincidence with the model assumptions, such as the dynamics of Twitter's hot events data changing quickly. In this paper, we consider the dynamic changes of environmental indicators and adaptively optimize the processing delay and fault recovery time. Then, we propose a dynamic adjustment method for the checkpoint interval by reinforcement learning, which is named DACM. DACM adaptively optimizes the processing delay and fault recovery time, while avoiding the overall environment modeling of streaming applications. The experiments conducted on the Flink platform show that DACM reduces the processing delay by 10% and the failure recovery time by 37% compared with the existing checkpoint interval optimization models.

关键词： stream processing fault-tolerance checkpoint interval reinforcement learning Flink

来源：评论

学校读者我要写书评

暂无评论

Least Squares on GPUs in Multiple Double Precision 36

Least Squares on GPUs in Multiple Double Precision

引用

36th ieee international parallel and distributed processing symposium (ieee IPDPS)

作者： Verschelde, Jan Univ Illinois Dept Math Stat & Comp Sci 851 S Morgan StM-C 249 Chicago IL 60607 USA

ISBN: (纸本)9781665497473

This paper describes the application of the code generated by the CAMPARY software to accelerate the solving of linear systems in the least squares sense on Graphics processing Units (GPUs), in double double, quad double, and octo double precision. The goal is to use accelerators to offset the cost overhead caused by multiple double precision arithmetic. For the blocked Householder QR and the back substitution, of interest are those dimensions at which teraflop performance is attained. The other interesting question is the cost overhead factor that appears each time the precision is doubled. Experimental results are reported on five different NVIDIA GPUs, with a particular focus on the P100 and the V100, both capable of teraflop performance. Thanks to the high Compute to Global Memory Access (CGMA) ratios of multiple double arithmetic, teraflop performance is already attained running the double double QR on 1,024-by-1,024 matrices, both on the P100 and the V100. For the back substitution, the dimension of the upper triangular system must be as high as 17,920 to reach one teraflops on the V100, in quad double precision, and then taking only the times spent by the kernels into account. The lower performance of the back substitution in small dimensions does not prevent teraflop performance of the solver at dimension 1,024, as the time for the QR decomposition dominates. In doubling the precision from double double to quad double and from quad double to octo double, the observed cost overhead factors are lower than the factors predicted by the arithmetical operation counts. This observation correlates with the increased performance for increased precision, which can again be explained by the high CGMA ratios.

关键词： acceleration back substitution blocked Householder QR Graphics processing Unit (GPU) least squares multiple double multiprecision

来源：评论

学校读者我要写书评

暂无评论

A 1024-Input Multi-Stage Voltage-Mode WTA Circuit for Selective Attention Based processing in Massive parallel Sensing applications 10

A 1024-Input Multi-Stage Voltage-Mode WTA Circuit for Select...

引用

10th ieee international symposium on Smart Electronic Systems, iSES 2024

作者： Pandey, P.K. Bhuvan, B. National Institute of Technology Calicut Department of Electronics and Communication Engineering Calicut India

ISBN: (纸本)9798331533229

This work presents a high-speed high-resolution 1024-input voltage-mode WTA circuit which is suitable for selective attention based processing systems. The circuit uses multi-stage WTAs to improve the resolution and speed. The WTA blocks use a max selector circuit to ensure error-free comparisons in their succeeding stages. The outputs of the max selector circuits are buffered to reduce the delay due to the cascaded transmission gates. The 1024-input WTA circuit designed in UMC 180 nm CMOS process experiences a worst-case delay of 235 ns for an input resolution of 8 mV. © 2024 ieee.

关键词： Delay circuits

来源：评论

学校读者我要写书评

暂无评论

PokeMem: Taming Wild Memory Consumers in Apache Spark 36

PokeMem: Taming Wild Memory Consumers in Apache Spark

引用

36th ieee international parallel and distributed processing symposium (ieee IPDPS)

作者： Kweun, Minhyeok Kim, Goeun Oh, Byungsoo Jung, Seongho Um, Taegeon Lee, Woo-Yeon Samsung Res Seoul South Korea Kakao Corp Jeju Si South Korea

ISBN: (纸本)9781665481069

Apache Spark is a widely used in-memory processing system due to its high performance. For fast data processing, Spark manages in-memory data such as cached or shuffling (aggregate and sorting) data in its own managed memory pools. However, despite its sophisticated memory management scheme, we found that Spark still suffers from out-of-memory (OOM) exceptions and high garbage collection (GC) overheads when wild memory consumers, who are not tracked by Spark and execute external codes, use a large amount of memory. To resolve the problems, we propose PokeMem, which is an enhanced Spark that incorporates wild memory consumers into the managed ones to prevent them from taking up memory spaces excessively in stealth. Our main idea is to open the black-box of unmanaged memory regions in external codes by providing customized data collections. PokeMem enables fine-grained controls of created objects within running tasks, by spilling and reloading the objects of custom data collections based on the memory pressure and access patterns. To further reduce memory pressures, PokeMem exploits pre-built memory estimation models to predict the external code's memory usage and proactively acquires memory before the execution of external code, and also performs JVM heap-usage monitoring to avoid critical memory pressures. With the help of these techniques, our evaluations show that PokeMem outperforms vanilla Spark with at most 3x faster execution with 3.9x smaller GC overheads, and successfully runs workloads without OOM exception that vanilla Spark has failed to run.

关键词： distributed framework Apache Spark memory management

来源：评论

学校读者我要写书评

暂无评论

pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures 51

pSyncPIM: Partially Synchronous Execution of Sparse Matrix O...

引用

ACM/ieee 51st Annual international symposium on Computer Architecture (ISCA)

作者： Baek, Daehyeon Hwang, Soojin Huh, Jaehyuk Samsung SDS Technol Res Seoul South Korea Korea Adv Inst Sci & Technol Sch Comp Daejeon South Korea

ISBN: (纸本)9798350326598;9798350326581

Recent commercial incarnations of processing-inmemory (PIM) maintain the standard DRAM interface and employ the all-bank mode execution to maximize bank-level memory bandwidth. Such a synchronized all-bank PIM control can effectively manage conventional dense matrix-vector operations on evenly distributed matrices across banks with lock-step execution. Sparse matrix processing is another critical computation that can significantly benefit from the PIM architecture, but the current all-bank PIM control cannot support diverging executions due to the random sparsity. To accelerate such sparse matrix applications, this paper proposes a partially synchronous execution on sparse matrix-vector multiplication (SpMV) and sparse triangular matrix-vector solve (SpTRSV), filling the gap between the practical constraint of PIM and the irregular nature of sparse computation. It allows the execution of the processing unit of each bank to diverge in a limited way to manage the irregular execution path of sparse matrix computation. It proposes compaction and distribution policies for the input matrix and vector. In addition to SpMV, this paper identifies SpTRSV is another key kernel, and proposes SpTRSV acceleration on PIM technology. The experimental evaluation shows that the new sparse PIM architecture outperforms NVIDIA Geforce RTX 3080 GPU by 4.43x speedup for SpMV and 3.53x speedup for SpTRSV with a similar amount of DRAM bandwidth.

关键词： processing-in-memory sparse matrix memory bandwidth predicated execution

来源：评论

学校读者我要写书评

暂无评论

A Mathematical Model and a Convergence Result for Totally Asynchronous Federated Learning

A Mathematical Model and a Convergence Result for Totally As...

引用

1st international Conference on Smart Energy Systems and Artificial Intelligence (SESAI)

作者： El-Baz, Didier Luo, Jia Mo, Hao Shi, Lei Univ Toulouse LAAS Toulouse France Chongqing Res Inst Chongqing Peoples R China Beijing Univ Technol Beijing Peoples R China Commun Univ China State Key Lab Media Convergence & Commun Beijing Peoples R China

ISBN: (纸本)9798350364613;9798350364606

A totally asynchronous gradient algorithm, with fixed step size is proposedfor federated learning. A mathematical model is presented and a convergence result is established. The convergence result is based on the concept of macro iterations sequence. The interest of the contribution is to show that the asynchronous federated learning method converges when gradients of loss functions are updated by workers without order nor synchronization and with possible unbounded delays.

关键词： machine learning federated learning convex optimization gradient algorithms asynchronous iterative algorithms distributed computing

来源：评论

学校读者我要写书评

暂无评论

Avoiding Soft Error-induced Illegal Memory Accesses in GPU with Inter-thread Communication 29

Avoiding Soft Error-induced Illegal Memory Accesses in GPU w...

引用

29th ieee international symposium on On-Line Testing and Robust System Design (IOLTS)

作者： Iwamoto, Riku Hashimoto, Masanori Osaka Univ Dept Informat Syst Engn Osaka Japan Kyoto Univ Dept Informat Kyoto Japan

ISBN: (纸本)9798350341355

A soft error caused by terrestrial neutrons poses a threat to the reliability of safety-critical systems, such as self-driving applications. These applications, often comprised of neural networks, rely on graphic processing units (GPUs) due to their requirement for massive parallel computation. While neural networks inherently include redundant computation and possess a certain level of error tolerance, detectable unrecoverable errors (DUEs) can be more detrimental than silent data corruption (SDC), as they can result in temporary service unavailability. This study specifically focuses on addressing illegal memory access, a primary cause of DUEs, and proposes a programming method that can detect illegal addresses. In the single instruction, multiple threads (SIMT) scheme, the data address is regularly calculated based on the thread ID, and this regularity is exploited to identify illegal addresses through inter-thread communication. To evaluate the effectiveness of the proposed method, fault injection campaigns were conducted for matrix multiplication, vector addition, and transposition. The experimental results indicate that the proposed method resulted in a reduction of the DUE rate by 17.3%, 86.8%, and 87.1% for these respective operations.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Software and Hardware processing Method for Blockchain Light Node SPV Verification 21

Software and Hardware Processing Method for Blockchain Light...

引用

作者： Ma, Zhan'Gang Yan, Jiandong Zou, Feng Cao, Xixin Peking University School of Software and Microelectronics Dept. Integrated Circuits and Intelligent Systems Beijing China Ruiyuan Venture Capital Co. Ltd. Changzhou Zhonglou Jinlong Holding Group Dept. Development Planning and Strategic Inverstment Changzhou China

ISBN: (纸本)9798350329223

With the application of blockchain light nodes in embedded devices, how to alleviate computing pressure brought by complex operations such as transaction's SPV Verification for CPU of embedded devices and improve the performance of devices in these aspects has gradually become a research topic in industry and academia. This paper proposes a series of methods to improve the performance of blockchain SPV Verification from the perspectives of system architecture and hash computing unit: (1) According to the computational characteristics of SPV Verification, this paper customizes macro instructions and microinstructions for the coprocessor to meet the requirements of flexibility;The built-in dedicated cache holds transaction data fetched from external memory and intermediate data generated by internal Hash Computing Unit, which not only prepares transaction data for hash computation, but also avoids frequent access to the bus and external memory. (2) Techniques like two-round unfolded computing, timing-balanced pipeline architecture and optimized adders are adopted to improve the performance of SHA256 computation. (3) When double hash computing is required for transactions, Hash Computing Unit can directly perform the second hash computation based on the first hash computation, reducing the frequency of accessing to external memory, thereby improving the performance to a certain extent. Through these methods, the performance of the hardware coprocessor for SVP verification of transactions is more than double that of traditional solutions. © 2023 ieee.

关键词： Blockchain

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 37 38 39 40 41 42 43 44 45 46 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：