In this research, we develop a sensor monitoring system for vehicle dynamics aimed at developing a forest soil moisture measurement system for landslide disaster prevention. The identification of landslide hazard warn...
详细信息
The research computing ecosystem is increasingly heterogeneous and diverse. Democratizing access to these essential resources is critical for accelerating research progress. However, the gap between a high-level workl...
详细信息
Modern Machine Learning (ML) training on large-scale datasets is a very time-consuming workload. It relies on the optimization algorithm Stochastic Gradient Descent (SGD) due to its effectiveness, simplicity, and gene...
详细信息
ISBN:
(数字)9798400706318
ISBN:
(纸本)9798400706318
Modern Machine Learning (ML) training on large-scale datasets is a very time-consuming workload. It relies on the optimization algorithm Stochastic Gradient Descent (SGD) due to its effectiveness, simplicity, and generalization performance (i.e., test performance on unseen data). Processor-centric architectures (e.g., CPUs, GPUs) commonly used for modern ML training workloads based on SGD are bottlenecked by data movement between the processor and memory units due to the poor data locality in accessing large training datasets. As a result, processor-centric architectures suffer from low performance and high energy consumption while executing ML training workloads. Processing-In-Memory (PIM) is a promising solution to alleviate the data movement bottleneck by placing the computation mechanisms inside or near memory. Several prior works propose PIM techniques to accelerate ML training;however, prior works either do not consider real-world PIM systems or evaluate algorithms that are not widely used in modern ML training. Our goal is to understand the capabilities and characteristics of popular distributed SGD algorithms on real-world PIM systems to accelerate data-intensive ML training workloads. To this end, we 1) implement several representative centralized parallel SGD algorithms, i.e., based on a central node responsible for synchronization and orchestration, on the real-world general-purpose UPMEM PIM system, 2) rigorously evaluate these algorithms for ML training on large-scale datasets in terms of performance, accuracy, and scalability, 3) compare to conventional CPU and GPU baselines, and 4) discuss implications for future PIM hardware. We highlight the need for a shift to an algorithm-hardware codesign to enable decentralized parallel SGD algorithms in real-world PIM systems, which significantly reduces the communication cost and improves scalability. Our results demonstrate three major findings: 1) The general-purpose UPMEM PIM system can be a viable alternat
The next wave in the era of computing will be outside the realm of desktop computing. In the IoT paradigm, objects surrounding us will communicate and share information. Advanced sensor technologies used in IoT will g...
详细信息
Wireless sensor networks play a major role in applications that require challenging human contact, which can greatly benefit from the use of wireless sensor networks. WSN is widely used in real-time applications, incl...
详细信息
Autonomous vehicles (AVs) have garnered immense interest and investments for more than a decade and a half. Nevertheless, large-scale AV deployments do not seem viable in the near future. In this talk, the speaker wil...
详细信息
ISBN:
(纸本)9781665453783
Autonomous vehicles (AVs) have garnered immense interest and investments for more than a decade and a half. Nevertheless, large-scale AV deployments do not seem viable in the near future. In this talk, the speaker will address questions like "What went wrong?", "Is AI the answer?", "Can (and how do) we course-correct?" and "Which future contributions will matter?". Finally, challenges that must be addressed by the research and engineering communities will be discussed.
The proliferation of cloud computing is directly responsible for the current transformation phase that the information technology sector is going through. The concept of cloud computing is still in its infancy, yet it...
详细信息
The widespread diffusion of Internet of Things (IoT) devices has led to an exponential growth in the volume of data generated at the edge of the network. With the rapid spread of machine learning (ML)-based applicatio...
详细信息
Pressure transmitters, directly or indirectly, play a key role in many systems in nuclear reactors and industries worldwide. The Innovative sensor section(ISS) in Indira Gandhi Center for Atomic Research(IGCAR) develo...
详细信息
When it comes to medical image analysis, problems arise due to the scarce amount of data and computational resources in medical environments. This is because, as earlier stated, cloud settings demand efficient models ...
详细信息
暂无评论