To meet the goals of the national "Dual Carbon" strategy and reduce energy consumption in the steel industry, accurate prediction of steel composition is crucial for precise control over alloy addition in st...
详细信息
To meet the goals of the national "Dual Carbon" strategy and reduce energy consumption in the steel industry, accurate prediction of steel composition is crucial for precise control over alloy addition in steelmaking. Several models have been created to predict the composition of the converter endpoint with a high level of accuracy. However, the different shortcomings of each have prevented large-scale application in real production environments. CBR prediction model has limited scope to solve the problem. CNN model has complex data processing and no memory. RELM model has randomly given input layer weights and hidden layer deviations. In this study, correlation analysis was used to analyze the factors influencing the carbon content at the endpoint of converter steelmaking. A feasible model was established and applied to predict the carbon content at the endpoint of converter using t-distributed stochastic neighbor embedding (t-SNE), particle swarm optimization (PSO), and backpropagation (BP) neuralnetwork. The learning rate, training times, and hidden layer nodes number of the prediction model were optimized. The prediction hit ratios for the carbon content in the error ranges of +/- 0.003%, +/- 0.01%, and +/- 0.02% are 61%, 86%, and 98%, respectively. Meanwhile, apply the established model to actual production, the carbon content of the product can be stably controlled between the lower and median limits, the control effect is significantly better than traditional methods. The results demonstrate that the t-SNE-PSO-BP model performs better than the known models. The accurate prediction of the carbon content at the endpoint of converter can greatly contribute to realizing a "narrow composition control" of the molten steel. Realize accurate prediction of carbon content at the endpoint of converter smelting, and has been effectively applied to industrial *** AbstractUnder the traditional method of predicting the endpoint carbon content of the conver
Estimating the total wind power output from the meteorological information at a province level (called Provincial Regional Wind Power Conversion Model, PRWPCM) plays vital and fundamental roles in energy modeling comm...
详细信息
Estimating the total wind power output from the meteorological information at a province level (called Provincial Regional Wind Power Conversion Model, PRWPCM) plays vital and fundamental roles in energy modeling community and regional wind power forecasting. How to construct a reliable PRWPCM is a real challenge, since PRWPCM involves a large number of widely distributed wind turbines, massive meteorological data across the whole province, and complex nonlinear correlations. This paper proposes a lightweight PRWPCM by integrating Geographic Information System (GIS) analysis technology and Convolutional neuralnetwork (CNN). First, we conduct the land suitability analysis for wind turbine sites through the multi-criteria GIS layer overlay method to make the provincial wind turbine land suitability map (WTLSM) with scored divisions from the least suitable to the most suitable areas. On this basis, a new fusion mechanism for geographic and meteorological information is proposed, through which the raw meteorological data matrix can be reconstructed to filter and amplify the meteorological information that is more relevant to the total wind power output of the province, and avoid the time-consuming and labor-intensive data collection and processing, large-size model construction and validation. Second, a CNN-based regression architecture is designed to further capture the mapping relationship between the reconstructed meteorological data and total wind power output of the province;each type of meteorological factor is considered as an input channel and the attention modules are introduced to adaptively enhance useful channels and suppress less useful ones. Numerical experiments based on the wind power operation data of Yunnan Province, China, are conducted to validate the superiority of the proposed PRWPCM via benchmarking against 13 classical methods.
Large language models (LLMs) are useful in many NLP tasks and become more capable with size, with the best open-source models having over 50 billion parameters. However, using these 50B+ models requires high-end hardw...
详细信息
ISBN:
(纸本)9781713899921
Large language models (LLMs) are useful in many NLP tasks and become more capable with size, with the best open-source models having over 50 billion parameters. However, using these 50B+ models requires high-end hardware, making them inaccessible to most researchers. In this work, we investigate methods for cost-efficient inference and fine-tuning of LLMs, comparing local and distributed strategies. We observe that a large enough model (50B+) can run efficiently even on geodistributed devices in a consumer-grade network. This could allow running LLM efficiently by pooling together idle compute resources of multiple research groups and volunteers. We address two open problems: (1) how to perform inference and fine-tuning reliably if any device can disconnect abruptly and (2) how to partition LLMs between devices with uneven hardware, joining and leaving at will. In order to do that, we develop special fault-tolerant inference algorithms and load-balancing protocols that automatically assign devices to maximize the total system throughput. We showcase these algorithms in PETALS1 - a decentralized system that runs Llama 2 (70B) and BLOOM (176B) over the Internet up to 10x faster than offloading for interactive generation. We evaluate the performance of our system in simulated conditions and a real-world setup spanning two continents.
The importance of encrypted traffic classification for network management and security is self-evident. The emergence of programmable data plane (PDP) technology makes it possible to directly implement encrypted traff...
详细信息
Our brain can differentiate many objects and time-dependent changes in the object features. This functionality requires information processing in both spatial and temporal domains. Studies showed that cortical microci...
Our brain can differentiate many objects and time-dependent changes in the object features. This functionality requires information processing in both spatial and temporal domains. Studies showed that cortical microcircuits are intrinsically responsive to spatiotemporal patterns. Confined neuralnetworks in vitro obtained from the brain cortex with an optical interface were used to classify spatiotemporal information in this work. This experimental setup was designed by confining a cortical network in a polydimethylsiloxane (PDMS) well and by co-transfecting the neurons with channel-rhodopsin 2 (ChR2) and jRgeco1a. In line with other studies, our data showed that neurons in dissociated cortical cultures are very susceptible to synchronized bursts, particularly in the confined network due to synaptic scaling. As these bursts represent chaotic activation and do not carry information about the input, we first designed a stimulation protocol to suppress bursts with distributed patterns of optical activation in the network. We found that increases in the Ca2+ baseline level of all neurons were correlated with successful burst suppression and the success rate was higher when the culture medium was replaced by artificial cerebrospinal fluid (ACSF) with high [Mg2+]. In the absence of population bursts, our results showed that the state of the living network could be used to classify input spatial and temporal patterns. Our experimental model shows a paradigm of reservoir computing with a living neuralnetwork that could run learning applications with small sets of training data and computational resources. This model can further be used to study toxicity or influences of commercial drugs on spatiotemporal information in neuralnetworks.
Revealing the organizing principles of developing neuralnetworks is a difficult but significant task in neuroscience. As a creature with a rather compact and well-studied neuralnetwork, C. elegans is an ideal subjec...
详细信息
Revealing the organizing principles of developing neuralnetworks is a difficult but significant task in neuroscience. As a creature with a rather compact and well-studied neuralnetwork, C. elegans is an ideal subject for neuroscience study. However, the researches on its developing neuralnetwork remain challenging. The changes in specific properties of neuralnetwork across development may uncover part of its principles. Motif is a typical structure property that can be well applied to various complex networks. Here, we study the motif changes in C. elegans neuralnetwork across development. By counting the occurrence number of all three-node subgraph motif structures in its neuralnetwork at different stages of C. elegans development, along with those in corresponding random networks, we determine which of these structures are motifs for C. elegans, finding out the regular changes of motifs during its development. Combined with the potential function of these subgraph motifs and synaptic information, we gain insight into the organizing principle of neuralnetwork during development, which may increase our understanding of neuroscience and inspire the construction of artificial neuralnetwork. (c) 2022 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://***/licenses/by/4.0/).
The power systems are transitioning to renewables. The power system infrastructure digitalization is the next stage in this evolution. A digitalized future distribution network (DN) has an opportunity to evolve into r...
详细信息
The power systems are transitioning to renewables. The power system infrastructure digitalization is the next stage in this evolution. A digitalized future distribution network (DN) has an opportunity to evolve into real-time analysis based on a huge volume of data. Such real-time analysis will be feasible through the integration of the theoretical background of fault analysis and machine learning techniques. Fault location is one of the major issues to improve reliability indices. The voltage sags characterization is used to locate faults in the DN. This article presents a methodology to characterize voltage sags using fault analysis and deep convolutional neuralnetworks. The voltage divider model allows the characterization and the discrete wavelet transform is used in signal processing. The machine learning and deep learning models used allow estimating the sag magnitude, fault location, phases involved, duration, and impact of distributed generation (DG) in each event. The IEEE 13-node test feeder including DG was used to validate the effectiveness of the proposed methodology. This paper provides a way to handle a Big Data stream in the DN control center and to efficiently locate faults in several operational scenarios.
The Hopfield neuron is an artificial neuron model used for pattern memorization and recognition. It exhibits a complex dynamic with stable states corresponding to memorized patterns. In order to grasp a more complete ...
详细信息
The Hopfield neuron is an artificial neuron model used for pattern memorization and recognition. It exhibits a complex dynamic with stable states corresponding to memorized patterns. In order to grasp a more complete representation of the information exchange between two neurons, emphasizing the importance of neuronal connections in brain processing, we propose in this work the coupling of two fractional-order Hopfield neurons via a fractional-order flux-controlled multistable memristor. Each of these two neurons incorporates a selfcoupling memristive component, called an autapse memristor. Additionally, the second neuron is subjected to an external electromagnetic radiation, simulated by an additional memristor. The I-V characteristics of the memristors integrated in this model are analyzed through numerical simulations. The simulations of model dynamics versus its diverse parameters have revealed rich and complex dynamical behaviors. These simulations demonstrate that the proposed model generates a variety of homogeneous and heterogeneous chaotic attractors, distributed at diverse locations. The elaborated memristor coupled fractional-order bi-Hopfield neuron MCFBHN model is implemented on an Arduino-Due platform. A comparison of the results of the two approaches shows good consistency.
processingneuralnetwork inferences on edge devices, such as smartphones, IoT devices and smart sensors can provide substantial advantages compared to traditional cloud-based computation. These include technical aspe...
详细信息
Road network representation learning aims to learn compressed and effective vectorized representations for road segments that are applicable to numerous tasks. In this paper, we identify the limitations of existing me...
暂无评论