检索结果-内蒙古大学图书馆

28th International Conference on Parallel Architectures and Compilation Techniques (PACT)

作者： Hetherington, Tayler Hicklin Lubeznov, Maria Shah, Deval Aamodt, Tor M. Univ British Columbia Elect & Comp Engn Vancouver BC Canada

ISBN: (纸本)9781728136134

GPUs are known to benefit structured applications with ample parallelism, such as deep learning in a datacenter. Recently, GPUs have shown promise for irregular streaming network tasks. However, the GPU's co-processor dependence on a CPU for task management, inefficiencies with fine-grained tasks, and limited multiprogramming capabilities introduce challenges with efficiently supporting latency-sensitive streaming tasks. This paper proposes an event-driven GPU execution model, EDGE, that enables non-CPU devices to directly launch pre-configured tasks on a GPU without CPU interaction. Along with freeing up the CPU to work on other tasks, we estimate that EDGE can reduce the kernel launch latency by 4.4x compared to the baseline CPU-launched approach. This paper also proposes a warp-level preemption mechanism to further reduce the end-to-end latency of fine-grained tasks in a shared GPU environment. We evaluate multiple optimizations that reduce the average warp preemption latency by 35.9x over waiting for a preempted warp to naturally flush the pipeline. When compared to waiting for the first available resources, we find that warp-level preemption reduces the average and tail warp scheduling latencies by 2.6x and 2.9x, respectively, and improves the average normalized turnaround time by 1.4x.

关键词： GPU multiprogramming Networking

来源：评论

学校读者我要写书评

暂无评论

Hankel-based Unsupervised Anomaly Detection

Hankel-based Unsupervised Anomaly Detection

引用

American Control Conference

作者： Korkut Bekiroglu Ali Tekeoglu Bruno Andriamanalimanana Saumendra Sengupta Chen-Fu Chiang Jorge Novillo Electrical Engineering Department SUNY Polytechnic Institute Canadian Institute for Cybersecurity University of New Brunswick Computer Science Department SUNY Polytechnic Institute

ISBN: (数字)9781538682661

ISBN: (纸本)9781538682678

Embedding of data into a normed vector space or linear manifold constitutes a fundamental approach in machine learning. A generalization is embedding into a metric space, where the distance is not induced by a norm. This paper explores the embedding of a time series into a topological metric space of Hankel matrices. The rank metric, along with a windowing scheme, is used to design a score and a detection method, for the purposes of anomaly identification. Assuming that the non-anomalous behavior can be represented as a linear combination of a finite number of frequencies, the rank metric can be used to measure the number of frequency changes in realtime to detect the anomalies. Accordingly, the Hankel matrix rank is used as a metric to develop a Hankel-based unsupervised Anomaly Detection (HAD) algorithm. Extensive experiments are conducted to test the proposed method on the Numenta anomaly benchmark dataset, as well as artificially generated random time-series data. Results show that the proposed HAD method is promising with respect to anomaly detection precision and computational performance.

关键词： anomaly outlier Hankel matrix windowing anomaly score Numenta Hankel matrices anomaly detection multiprogramming outliers anomaly embedding metric space inspection methods computational performance

来源：评论

学校读者我要写书评

暂无评论

Sorted round robin algorithm 3

Sorted round robin algorithm

引用

3rd International Conference on Trends in Electronics and Informatics, ICOEI 2019

作者： Srujana, R. Mohana Roopa, Y. Datta Sai Krishna Mohan, M. Institute of Aeronautical Engineering Hyderabad India

ISBN: (纸本)9781538694398

Process scheduling is an important and necessary task of a multiprogramming operating system where the process manager handles the selection and removal of processes based on a strategy. One such strategy is the Round Robin algorithm, where each process is given a time quantum for its execution. Our algorithm is a combined product of the shortest job first (SJF) algorithm and Round Robin (RR) algorithm. It retains the advantage provided by these algorithms that may have an impact on the overall performance of the CPU and hence, is used to overcome the drawbacks in the RR algorithm by developing the strategies in use. Also, a detailed analysis is performed to compare the proposed algorithm and the existing algorithm in terms of performance and output. ©2019 IEEE.

关键词： multiprogramming

来源：评论

学校读者我要写书评

暂无评论

TLB Shootdown Mitigation for Low-Power Many-Core Servers with L1 Virtual Caches

IEEE COMPUTER ARCHITECTURE LETTERS

引用

IEEE COMPUTER ARCHITECTURE LETTERS 2018年第1期17卷 17-20页

作者： Binh Pham Hower, Derek Bhattacharjee, Abhishek Cain, Trey Rutgers State Univ Dept Comp Sci Piscataway NJ 08854 USA Qualcomm Technol Inc Piscataway NJ 08854 USA Qualcomm Datactr Technol Inc Piscataway NJ 08854 USA

Power efficiency has become one of the most important design constraints for high-performance systems. In this paper, we revisit the design of low-power virtually-addressed caches. While virtually-addressed caches enable significant power savings by obviating the need for Translation Lookaside Buffer (TLB) lookups, they suffer from several challenging design issues that curtail their widespread commercial adoption. We focus on one of these challenges-cache flushes due to virtual page remappings. We use detailed studies on an ARM many-core server to show that this problem degrades performance by up to 25 percent for a mix of multi-programmed and multi-threaded workloads. Interestingly, we observe that many of these flushes are spurious, and caused by an indiscriminate invalidation broadcast on ARM architecture. In response, we propose a low-overhead and readily implementable hardware mechanism using bloom filters to reduce spurious invalidations and mitigate their ill effects.

关键词： Virtual Cache virtual memory TLB multicores multiprogramming multithreading

来源：评论

学校读者我要写书评

暂无评论

Subsegmental level analysis of high arousal speech using the zero-time windowing method

引用

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019年第1期145卷 551-561页

作者： Gangamohan, P. Gangashetty, Suryakanth V. Yegnanarayana, B. IIIT H Speech Proc Lab Hyderabad Telangana India

Speech produced by a speaker in emotionally charged situations, such as anger, happiness, and shout corresponds to high arousal speech. Changes in the production characteristics such as increase in the subglottal air pressure, increase in the glottal closed phase in each cycle, and increase in the rate of glottal vibration are observed in the high arousal speech. Acoustic parameters such as glottal closed quotient and fundamental frequency ( F-0) are used to characterize the high arousal speech. In this paper, high arousal is characterized by features extracted using the zero-time windowing (ZTW) method. The spectrum derived from the ZTW method emphasizes the instantaneous spectral characteristics in the speech signal. In the glottal open region, changes are clearly observed in the lower frequency range of the spectrum. Distinctive spectral features are observed during the glottal open region in the case of high arousal speech, when compared to neutral speech. These features are used to develop a method for identification of high arousal speech. Simple and maybe somewhat ad hoc rules, based on these features seem to give good performance in the identification of high arousal speech, even without using neutral speech as reference. (C) 2019 Acoustical Society of America.

关键词： Arousal multiprogramming frequency ranges Speech time zero Voice Signal

来源：评论

学校读者我要写书评

暂无评论

Software Structures: A Careful Look

引用

IEEE SOFTWARE 2018年第6期35卷 68-71页

作者： Parnas, David Lorge McMaster Univ Hamilton ON Canada Univ Limerick Limerick Ireland Middle Rd Software Ottawa ON Canada

In the half century since Edsger Dijkstra published “The Structure of the ‘THE’-multiprogramming System,” it has become clear that the ability to design a software system’s structure is at least as important as the ability to design efficient algorithms or write code in a particular programming language. Although the word “structure” appeared in the paper’s title and was used seven more times, Dijkstra never defined the term. Closer examination revealed that he was discussing at least three distinct structures. His failure to define “structure,” or to clearly distinguish the structures that were important in his software, has led many to confuse those structures. This article aims to clarify what those structures are, their differences, and each one’s importance.

关键词： multiprogramming Programming Languages Software Structures Edsger Dijkstra THE multiprogramming System Software System Particular Programming Language Word Structure Closer Examination Distinct Structures Software Engineering Operating Systems Software Development Codes Module Program Component Process Software Structures Uses Part Of Gives Work To Edsger Dijkstra THE Operating System Software Engineering Software Development Reliable Code

来源：评论

学校读者我要写书评

暂无评论

Study on Natural Smoke Exhaust Characteristics of Railway Passenger Station under the Influence of Environmental Wind

Study on Natural Smoke Exhaust Characteristics of Railway Pa...

引用

Fire Science and Fire Protection Engineering (ICFSFPE), Conference on

作者： Jun Deng Weichao Geng Gaowen Liu Furu Kang Jiajia Song College of Safety Science and Engineering Xi'an University of Science and Technology Xi'an China Shanghai Fire Research Institute Ministry of Emergency Management Shanghai China

ISBN: (数字)9781728153223

ISBN: (纸本)9781728153230

In order to study the influence of environmental wind on natural smoke exhaust characteristics, a railway passenger station is selected to study the smoke exhaust effect under windowing modes M1 which to open the windward side window and M2 which to open the leeward side window with the wind speeds of 0 m/s, 3.4 m/s and 10 m/s by FDS software. The laws of smoke movement, temperature field changes and the impact on personnel safety evacuation are compared and analyzed in the station with different working conditions. The results show that when the wind speed is 0 m/s, both windowing modes M1 and M2 can effectively discharge the smoke out of the station. When the wind speed is 3.4 m/s, the windowing mode M1 exhaust time is delayed, the environment wind suppresses the natural smoke exhaust while the windowing mode M2 exhaust time is advanced, and the environment wind promotes the natural smoke exhaust. When the wind speed is 10 m/s strong wind, both of the windowing modes M1 and M2 natural smoke exhaust are invalid. The smoke layer height drops below 2 m from the ground after 600 s, and the visibility decreases to less than 10 m after 800 s, which seriously affects the personnel safety evacuation. Therefore, for the station, the windward exhaust window should be closed and the leeward exhaust window should be opened according to the wind direction to exhaust smoke in case of fire. For strong winds, due to the failure of natural smoke exhaust in both windowing modes M1 and M2, other measures should be taken to exhaust smoke.

关键词： Environmental wind Railway passenger station Natural smoke exhaust FDS natural smoke exhaust railway passenger station FDS Side windows wind speed Bed thickness multiprogramming EXHAUSTS Smoke STATIONS

来源：评论

学校读者我要写书评

暂无评论

Maximizing the GPU resource usage by reordering concurrent kernels submission

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2019年第18期31卷 e4409.1-e4409.15页

作者： Cruz, Rommel A. Q. Bentes, Cristiana Breder, Bernardo Vasconcellos, Eduardo Clua, Esteban de Carvalho, Pablo M. C. Drummond, Lucia M. A. Fed Fluminense Univ Inst Comp BR-24210240 Niteroi RJ Brazil Univ Estado Rio De Janeiro Dept Syst Engn BR-20550900 Maracana RJ Brazil

The increasing amount of resources available on current GPUs sparked new interest in the problem of sharing its resources by different kernels. While new generations of GPUs support concurrent kernel execution, their scheduling decisions are taken by the hardware at runtime. The hardware decisions, however, heavily depend on the order at which the kernels are submitted to execution. In this work, we propose a novel optimization approach to reorder the kernels invocation focusing on maximizing the resources utilization, improving the average turnaround time. We model the kernel assignments to the hardware resources as a series of knapsack problems and use a dynamic programming approach to solve them. We evaluate our method using kernels with different sizes and resource requirements. Our results show significant gains in the average turnaround time and system throughput compared to the kernels submission implemented in modern GPUs.

关键词： graphics processing unit kernel scheduling multiprogramming

来源：评论

学校读者我要写书评

暂无评论

Intelligent SRTF: A New Approach to Reduce the Number of Context Switches in SRTF 1st

Intelligent SRTF: A New Approach to Reduce the Number of Con...

引用

1st International Conference on Computational Intelligence and Informatics (ICCII)

作者： Bindu, C. Shoba Reddy, A. Yugandhar Reddy, P. Dileep Kumar JNTUA Coll Engn Anantapur Andhra Pradesh India

ISBN: (纸本)9789811024719;9789811024702

Throughput of the system in multiprogramming and time sharing systems mainly depends on the careful scheduling of the CPU and other I/O devices. CPU scheduling should control the waiting time, response time, turnaround time, and number of context switches. One of the most extensively used scheduling algorithms is shortest next remaining time first (SRTF), which gives the reduced amount of average waiting time. But this algorithm suffers from some drawbacks. One such is that, every upcoming process if selected for execution, causes a context switch even though it is slightly shorter than the currently running process. As the number of such situations increases, the number of context switches increases, causing the reduction in performance of the system. In this paper, we modify the traditional SRTF to intelligent SRTF, by changing the decision of the preemption, to decrease the number of context switches. The main idea of our proposed algorithm is to make a context switch only if the next process plus context switch over head is shorter than the currently running process. By this we can reduce the number of context switches and thereby the performance of the system is improved.

关键词： Throughput Scheduling Burst Preemptive Performance Queue multiprogramming

来源：评论

学校读者我要写书评

暂无评论

NASA Technical Reports Server (Ntrs) 20060014983: Model Checking Real Time Java Using Java Pathfinder

引用

2017年

NASA Technical Reports Server (Ntrs) 20060014983: Model Checking Real Time Java Using Java Pathfinder by NASA Technical Reports Server (Ntrs); published by

关键词： (ntrs) 20060014983: applications programs (computers) autonomy checking computer programs java (programming language) lindstrom, gary mehlitz, peter c. model multiprogramming nasa technical reports server (ntrs) pathfinder priorities real time operation software engineering synchronism tasks visser, willem

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：