检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,883 篇 会议
64 册 图书
45 篇 期刊文献

馆藏范围

2,991 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

2,089 篇 工学
- 1,867 篇 计算机科学与技术...
- 969 篇 软件工程
- 351 篇 电气工程
- 271 篇 信息与通信工程
- 267 篇 电子科学与技术（可...
- 109 篇 控制科学与工程
- 76 篇 机械工程
- 63 篇 生物工程
- 50 篇 仪器科学与技术
- 48 篇 生物医学工程（可授...
- 41 篇 动力工程及工程热...
- 37 篇 光学工程
- 33 篇 建筑学
- 30 篇 材料科学与工程（可...
- 30 篇 土木工程
- 25 篇 化学工程与技术
- 25 篇 交通运输工程
- 24 篇 网络空间安全
- 23 篇 安全科学与工程
601 篇 理学
- 397 篇 数学
- 115 篇 物理学
- 68 篇 生物学
- 62 篇 系统科学
- 41 篇 化学
- 32 篇 统计学（可授理学、...
239 篇 管理学
- 160 篇 管理科学与工程(可...
- 101 篇 图书情报与档案管...
- 72 篇 工商管理
55 篇 医学
- 48 篇 临床医学
25 篇 经济学
- 25 篇 应用经济学
21 篇 法学
15 篇 文学
14 篇 农学
4 篇 军事学
3 篇 教育学
1 篇 艺术学

主题

366 篇 parallel process...
190 篇 graphics process...
173 篇 computer archite...
135 篇 parallel archite...
121 篇 graphics process...
113 篇 hardware
106 篇 parallel algorit...
104 篇 parallel process...
85 篇 computational mo...
79 篇 instruction sets
78 篇 image processing
76 篇 signal processin...
70 篇 multicore proces...
69 篇 parallel program...
69 篇 field programmab...
64 篇 concurrent compu...
63 篇 gpu
62 篇 algorithm design...
62 篇 kernel
60 篇 optimization

机构

9 篇 natl univ def te...
6 篇 hosei univ dept ...
6 篇 school of comput...
6 篇 inria rennes
6 篇 national laborat...
5 篇 college of compu...
5 篇 univ aizu dept c...
5 篇 college of compu...
5 篇 karlsruhe instit...
5 篇 city university ...
5 篇 st francis xavie...
4 篇 queens univ belf...
4 篇 nanyang technol ...
4 篇 chinese acad sci...
4 篇 univ chinese aca...
4 篇 hainan internati...
4 篇 department of co...
4 篇 universidad carl...
4 篇 sun yat-sen univ...
4 篇 institute of com...

作者

11 篇 jack dongarra
8 篇 roman wyrzykowsk...
8 篇 quintana-orti en...
7 篇 hannig frank
7 篇 teich juergen
7 篇 nakano koji
7 篇 konrad karczewsk...
6 篇 ito yasuaki
6 篇 liu jie
6 篇 carretero jesus
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 wang gang
5 篇 dongarra jack
5 篇 wanlei zhou
5 篇 qian depei
5 篇 namyst raymond
5 篇 ewa deelman
5 篇 dolz manuel f.

语言

2,903 篇 英文
78 篇 其他
16 篇 中文
2 篇 俄文

检索条件"任意字段=7th International Conference on Algorithms and Architectures for Parallel Processing"

共 2992 条记录，以下是131-140 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

AshPipe: Asynchronous Hybrid Pipeline parallel for DNN Training 24

AshPipe: Asynchronous Hybrid Pipeline Parallel for DNN Train...

引用

7th international conference on High Performance Computing in Asia-Pacific Region (HPC Asia)

作者： Hosoki, Ryubu Endo, Toshio Hirofuchi, Takahiro Ikegami, Tsutomu Tokyo Inst Technol Yokohama Kanagawa Japan Natl Inst Adv Ind Sci & Technol Tokyo Japan

ISBN: (纸本)9798400708893

Deep Neural Networks (DNNs) have become increasingly computationally intensive and have larger parameters, requiring efficient parallelization or distribution using multiple accelerators. Pipeline parallelism has been proposed as an effective way to distribute models and improve hardware utilization. However, the problem with pipeline parallelism is the trade-off between speedup and accuracy: synchronous approaches do not provide sufficient speedup, while asynchronous approaches suffer from accuracy degradation due to a different scheme from a single worker. In this paper, we propose AshPipe, a hybrid parallel framework that combines data parallelism and asynchronous pipeline parallelism to achieve efficient speedup for training. the proposed runtime uses the 1F1B schedule and data parallelism, with non-uniform numbers of workers and identical global batch sizes across stages. A Switch parallelism (SP) mechanism is also proposed as an option to mitigate accuracy degradation, which switches over from data parallelism to hybrid parallelism in the course of training. Experimental results show that AshPipe achieves 1.844x the throughput of data parallelism for ViT-H/14 whose parameter size is 632M. With the SP mechanism, AshPipe achieved a 30.2% reduction in training time with comparable accuracy compared to data parallelism when training on the CIFAR100 dataset.

关键词： Distributed Deep Learning Data parallelism Pipeline parallelism Hybrid parallelism

来源：评论

学校读者我要写书评

暂无评论

Generalising Whole Page Handwritten Text Recognition With parallel Scheduled Sampling 7

Generalising Whole Page Handwritten Text Recognition With Pa...

引用

7th international conference on Image Information processing, ICIIP 2023

作者： Vivek, H.N. Mehrotra, Kapil Shah, Ronak Mehta, Swati Pune India

ISBN: (纸本)9798350371406

the Encoder-Decoder architecture is often used for the seq-to-seq conversion problems in natural language processing. the Encoder is used for feature extraction and the Decoder to generate the sequence. this paper addresses a similar Encoder-Decoder architecture for the image-to-text conversion for offline Handwritten Text (whole paragraphs as input without prior segmentation). the resnet-based encoder and transformer-based decoder inspire the model. It was observed that the auto-regressive nature of transformers coupled with teacher forcing, produced good results in training but failed badly while inferencing on unseen data. So, we propose parallel Scheduled Sampling and teacher forcing to generalize the model for better inferencing. this makes the model more generalized as it replaces the ground truth labels used in teacher forcing with the model-predicted labels with a random probability. thus, making the model less dependent on teacher forcing as the training proceeds. We have extended the training dataset by augmenting 6000 lines & 12000 paragraphs to cover handwritten data variations. the evaluation was done on the standard RWth Aachen data splits(Test & Val) for IAM dataset. © 2023 IEEE.

关键词： Personnel training

来源：评论

学校读者我要写书评

暂无评论

Malaria Cell Detection Using Deep Learning architectures 7th

Malaria Cell Detection Using Deep Learning Architectures

引用

7th international conference on Intelligent Computing and Optimization, ICO 2023

作者： Ahmed, Mahade Chowdhury, Anindya Johora, Fatema Tuj Haque, Md. Inzamamul Jennifer, Sanjeda Sara Shamim, Mahbub Hasan Reza, Ahmed Wasif Department of Computer Science and Engineering East West University Dhaka1212 Bangladesh

ISBN: (纸本)9783031733239

Malaria is a common and life-threatening disease in our country, with high-risk areas in several villages and hill tracts. Current detection methods are time-consuming and inaccessible. Our system analyzes digital images of blood cells to identify signs of malaria infection. By utilizing image processing techniques and the Deep Learning method, the system accurately detects malaria-infected cells. the project involves collecting a labeled dataset of infected and uninfected blood cell images, applying preprocessing techniques, extracting relevant features, training the machine learning model, and evaluating its performance. Early and automated malaria diagnosis can improve healthcare outcomes. In our machine learning model, we get the best accuracy from the EfficientNetV2L model, which is 97%. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

the Design of Hardware processing System Platform for Directional Sound Sources 7

The Design of Hardware Processing System Platform for Direct...

引用

7th international conference on Intelligent Computing and Signal processing, ICSP 2022

作者： Ding, Dandan Cheng, Long Xu, Yongchun Guan, Jinyu Bu, Fanliang People's Public Security University of China School of Information Network Security Beijing100045 China Hebei North University Hebei075132 China

ISBN: (纸本)9781665478571

the DSP pipeline serial structure restricts the performance of the system, while the FPGA parallel processing data high frequency sampling data to meet the design needs. Firstly, this paper introduces the FPGA processing chip and development process, designs the power amplifier circuit and does a detailed analysis. Secondly, the WM8731 audio processing chip and its workflow are introduced, and the design process of the directional sound source system is described. then, the hardware platform and software program are integrated together to build the system test platform, and the generated test files are downloaded to the board, and the test data are recorded one by one at different angles within a certain distance after power-on. the test results are plotted on the directivity line graph, and the results show that the sound has strong directivity and high sound pressure intensity. © 2022 IEEE.

关键词： Ultrasonic transducers

来源：评论

学校读者我要写书评

暂无评论

Dynamic Privacy Protection with Large Language Model in Social Networks 24th

Dynamic Privacy Protection with Large Language Model in So...

引用

24th international conference on algorithms and architectures for parallel processing, ICA3PP 2024

作者： Xie, Yizhe Zhu, Congcong Zhang, Xinyue Hu, Xiangyu Liu, Xuan School of Data Science City University of Macau Macau China Hainan International College Minzu University of China Hainan China School of Information Engineering Minzu University of China Beijing China School of Information and Software Engineering University of Electronic Science and Technology of China Sichuan China

ISBN: (纸本)9789819615445

In contemporary social networks, dynamic privacy protection remains a pivotal yet challenging endeavor due to the intricate and evolving nature of information exchange. Traditional privacy models, predominantly static, falter in effectively safeguarding privacy amidst the complex interplay of continuously changing network interactions and structures. Addressing these deficiencies, this study introduces a novel dynamic privacy protection system anchored by large language model (LLM). Leveraging the natural language processing prowess of LLM, this system excels in real-time, context-sensitive analysis and protection of textual data within vast and variable social networks. By integrating closed-loop control theory, the system adeptly balances robust privacy safeguards with the requisite fluidity of information exchange. Experimental validations on large network datasets illustrate the system’s adeptness in balancing privacy leaks and information distortion through intelligent adaptations to privacy thresholds and strategic noise injection. the outcomes highlight the system’s utility in enhancing data security and operational efficiency, promising significant implications for future applications in broader domains such as mobile computing and IoT. this study not only propels forward the capabilities of dynamic privacy protection mechanisms but also sets a foundational architecture for subsequent innovations in privacy-sensitive, data-intensive environments. © the Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Differential privacy

来源：评论

学校读者我要写书评

暂无评论

Dynamic Offloading Control for Waste Sorting Based on Deep Q-Network 24th

Dynamic Offloading Control for Waste Sorting Based on Deep...

引用

24th international conference on algorithms and architectures for parallel processing, ICA3PP 2024

作者： Wang, Jing Wang, Xiaoyang Guo, Jianxiong Tang, Zhiqing Ding, Xingjian Wang, Tian Advanced Institute of Natural Sciences Beijing Normal University Zhuhai China Guangdong Key Lab of AI and Multi-modal Data Processing Department of Computer Science BNU-HKBU United International College Zhuhai China Faculty of Information Technology Beijing University of Technology Beijing China

ISBN: (纸本)9789819615278

With the increasing concern for environmental protection and resource optimization, efficient waste sorting has become a serious challenge today. In this paper, we propose a new offloading control problem that aims to solve waste sorting in wireless bin communication networks. Due to limited computational power, bins belonging to embedded devices rely on simple classification models with varying accuracy. In this scenario, consider a network of intelligent bins, each acting as an independent agent capable of deciding to offload an image to the edge server with a more accurate but resource-intensive model when the local classification is deemed inaccurate. thus, Our goal is to find a lightweight online offloading policy that can achieve the best possible sorting accuracy while balancing transmission traffic. the method utilizes a Deep Q-network algorithm that enables each intelligent bin to make image-processing decisions autonomously. In the experiment, we validate the effectiveness of improving the performance of waste classification compared with existing flow control methods. © the Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Distributed Resource Allocation Intelligent Waste Management Waste Sorting Wireless Communication Network

来源：评论

学校读者我要写书评

暂无评论

Performance Evaluation of parallel processing Adder Against Basic Adders on FPGAs 5th

Performance Evaluation of Parallel Processing Adder Against ...

引用

5th international conference on Computing Science, Communication and Security

作者： Fichadia, Dhaval Purohit, Kishor Soni, Bhavesh Einfochips Ltd Ahmadabad India Ganpat Univ Mehsana India

ISBN: (纸本)9783031751691;9783031751707

Adders are essential components of modern digital circuits, and their primary design goal is to achieve high speed. However, power consumption and chip area are also important considerations in modern circuit design. Optimizing digital adder performance plays a crucial role in enhancing the speed of binary operations within complex circuits. Various architectures address the carry propagation bottleneck, each with its own strengths and weaknesses. Choosing the most appropriate architecture depends on the specific application requirements, ensuring optimal performance within the available resource constraints. this paper provides a comprehensive analysis of various adder topologies and their performance characteristics. By carefully considering the trade-offs between delay, power consumption, and area, engineers can choose the optimal architecture for their specific application requirements, leading to significant improvements in digital system performance and efficiency. the analyzed adder topologies include Ripple Carry Adder (RCA), Carry Lookahead Adder (CLA), Carry Skip Adder (CSK), Carry Select Adder (CSLA), Carry Increment Adder (CIA), Brent kung adder (BKA), Kong stone adder. the analysis is conducted using HDL on the Xilinx ISE 14.7 platform.

关键词： Finite impulse response (FIR) Ripple carry adder (RCA) Carry Look Ahead Adder (CLA) Carry Select Adder (CSLA) Carry Increment Adder (CIA) Carry Skip Adder Kogge Stone Adder (CSKa) Arithmetic-logic unit (ALU) parallel prefix adder (PPA)

来源：评论

学校读者我要写书评

暂无评论

Cross-Modal Mask and Detail Alignment for Text-Based Person Retrieval 24th

Cross-Modal Mask and Detail Alignment for Text-Based Perso...

引用

24th international conference on algorithms and architectures for parallel processing, ICA3PP 2024

作者： Guo, Ao Liu, Xuan Liu, Xianggan Hu, Bingmeng Yuan, Jie Chu, Chiawei Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance Ministry of Education Minzu University of China Beijing100081 China Hainan International College of Minzu University of China Li’an International Education Innovation pilot Zone Hainan Lingshui Li572499 China Natural Language Processing and Knowledge Graph Lab School of Computer Science and Technology Huazhong University of Science and Technology Wuhan430074 China City University of Macau China

ISBN: (纸本)9789819615278

Text-based person retrieval primarily aims to retrieve the images of target persons represented by a given text query. In this task, how to effectively align images and text globally and locally is an important challenge. At the same time, since multiple modalities are involved, reducing the differences between different modalities is also an important challenge. Existing work has focused more on reconstructing an image rather than the semantic consistency between the image and text modalities. therefore, we introduce the Cross-Modal Mask and Detail Alignment (CMDA) framework to address these challenges. Under this framework, our proposed Cross-Modal Mask Alignment module (CMA) semantically aligns features generated by supplementing randomly masked image/text with another modality text/image. Additionally, to narrow the gap between the image and text modalities, we designed the Cross-Modal Detail Alignment module (CDA), which establishes connections between images and texts and facilitates interactions between these two different modalities. Experimental results show that our model exhibits outstanding performance across multiple public datasets, i.e., CUHK-PEDES, ICFG-PEDES, and RSTPReID. © the Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

GPU parallelization and Optimization of a Combustion Simulation Application 24

GPU Parallelization and Optimization of a Combustion Simulat...

引用

24th IEEE international conference on High Performance Computing and Communications, 8th IEEE international conference on Data Science and Systems, 20th IEEE international conference on Smart City and 8th IEEE international conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022

作者： Liao, Zhixiang Liu, Yongzhou Che, Yonggang Institute for Quantum Information College of Computer National University of Defense Technology State Key Laboratory of High Performance Computing Changsha China

ISBN: (纸本)9798350319934

Graphics processing units (GPUs) are widely used in the area of scientific computing. While GPUs provide much higher peak performance, efficient implementation of real applications on the GPU architectures is still a non-trivial task. It is crucial to realize efficient solution algorithms that can better utilize GPU architectures. this paper presents our efforts in parallelizing and optimizing LESAP, a CFD application for scramjet combustion simulation, on NVIDIA GPUs. the GPU parallelization is realized based on the CUDA programming model, with a data-parallel implicit time-marching method that is efficient on the GPU architecture. Furthermore, shared memory and redundant calculation are proposed to reduce memory access overhead during GPU computation, and data transfer between CPU and GPU is optimized by packing the data to be transferred. the experimental results show that the GPU version, when runs on four V100 GPUs, achieves a speedup of 11.26 times compared to the CPU version that runs on two 24-core Intel Skylake Gold 6240R CPUs. Excellent parallel scalability across multiple GPUs is also observed. © 2022 IEEE.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

parallel Approaches in Deep Learning: Use parallel Computing 7

Parallel Approaches in Deep Learning: Use Parallel Computing

引用

7th international conference on Future Networks and Distributed Systems, ICFNDS 2023

作者： Rakhimov, Mekhriddin Javliev, Shakhzod Nasimov, Rashid Department Of Computer Systems Tashkent University Of Information Technologies Named After Muhammad Al-Khwarizmi Uzbekistan Department Of Artificial Intelligence Tashkent University Of Information Technologies Named After Muhammad Al-Khwarizmi Uzbekistan Department Of Artificial Intelligence Tashkent State University Of Economics Uzbekistan

ISBN: (纸本)9798400709036

In the present context, the rise of artificial intelligence (AI) has brought to light the importance of expediting processes due to the advancement in AI. this issue holds significance across various domains of machine learning. Consequently, all sectors linked to deep learning, a crucial facet of artificial intelligence, are experiencing continuous advancements. To illustrate, tasks associated with training such as the multiplication of extensive matrices or the manipulation of images to extricate vital features can lead to an increase in time consumption. It is common knowledge that dealing with substantial data quantities demands a considerable amount of time. the primary focus of this research revolves around significantly enhancing the time efficiency of deep learning procedures. While it is a recognized fact that graphics processing units (GPUs) deliver notably quicker outcomes for specific data handling tasks in comparison to a computer's central processing unit (CPU), this study delves into heterogeneous computing systems in cases where GPUs are inaccessible. Herein, we investigate strategies for attaining elevated processing speed through the utilization of advanced technologies. Ultimately, this study culminates by presenting comparative findings derived from di-verse approaches, accompanied by crucial recommendations for future endeavors. © 2023 ACM.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共300页 << < 10 11 12 13 14 15 16 17 18 19 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：