Mental stress poses significant health risks, manifesting in various psychological and physical issues such as depression, anxiety, and cardiovascular complications. Establishing a reliable method for swiftly and accu...
详细信息
Learning behavior in legged robots presents a significant challenge due to its inherent instability and complex constraints. Recent research has proposed the use of a large language model (LLM) to generate reward func...
详细信息
The learnware paradigm was recently proposed by Zhou (2016) with the wish of developing the learnware market to help users build models more efficiently by reusing existing well-performed models rather than starting f...
详细信息
The learnware paradigm was recently proposed by Zhou (2016) with the wish of developing the learnware market to help users build models more efficiently by reusing existing well-performed models rather than starting from scratch. Specifically, a learnware in the learnware market is a well-performed pre-trained model with a specification describing its specialty and utility, and the market identifies helpful learnware(s) for the user's task based on the specification. Recent studies have attempted to realize a homogeneous prototype learnware market initially through Reduced Kernel Mean Embedding (RKME) specification, which requires all models in the market to share the same feature space. However, this limits the application scope of the learnware paradigm because various pre-trained models are often obtained from different feature spaces in real-world scenarios. In this paper, we make the first attempt to enable the learnware to handle heterogeneous feature spaces. We propose a more powerful specification to manage heterogeneous learnwares by integrating subspace learning in the specification design, along with a practical approach for identifying and reusing helpful learnwares for the user's task. Empirical studies on both synthetic data and real-world tasks validate the efficacy of our approach.
Various approaches are proposed to help under-resourced security researchers to detect and analyze software vulnerabilities. It is still incredibly time-consuming and labor-intensive for security researchers to fix su...
详细信息
Various approaches are proposed to help under-resourced security researchers to detect and analyze software vulnerabilities. It is still incredibly time-consuming and labor-intensive for security researchers to fix such reported vulnerabilities due to the increasing size and complexity of modern software systems. The time lag between the reporting and fixing of a security vulnerability causes software systems to suffer from significant exposure to possible attacks. Very recently, some techniques propose to apply pre-trained models to fix security vulnerabilities and have proved their success in improving repair accuracy. However, the effectiveness of existing pre-trained models has not been systematically compared and little is known about their advantages and disadvantages. To bridge this gap, we perform the first extensive study on applying various pre-trained models to automated vulnerability repair. The experimental results on two vulnerability datasets show that all studied pre-trained models consistently outperform the state-of-the-art technique VRepair with a prediction accuracy of 32.94%$\sim$similar to 44.96%. We also investigate the impact of three major phases (i.e., data pre-processing, model training and repair inference) in the vulnerability repair workflow. Inspired by the findings, we construct a simplistic vulnerability repair approach that adopts the transfer learning from bug fixing. Surprisingly, such a simplistic approach can further improve the prediction accuracy of pre-trained models by 9.40% on average. Besides, we provide additional discussion from different aspects (e.g., code representation and a preliminary study with ChatGPT) to illustrate the capacity and limitation of pre-trained model-based techniques. Finally, we further pinpoint various practical guidelines (e.g., the improvement of fine-tuning) for advanced pre-trained model-based vulnerability repair in the near future. Our study highlights the promising future of adopting pre-tr
Designing new molecules is essential for drug discovery and material science. Recently, deep generative models that aim to model molecule distribution have made promising progress in narrowing down the chemical resear...
详细信息
Designing new molecules is essential for drug discovery and material science. Recently, deep generative models that aim to model molecule distribution have made promising progress in narrowing down the chemical research space and generating high-fidelity molecules. However, current generative models only focus on modeling 2-D bonding graphs or 3-D geometries, which are two complementary descriptors for molecules. The lack of ability to jointly model them limits the improvement of generation quality and further downstream applications. In this article, we propose a joint 2-D and 3-D graph diffusion model (JODO) that generates geometric graphs representing complete molecules with atom types, formal charges, bond information, and 3-D coordinates. To capture the correlation between 2-D molecular graphs and 3-D geometries in the diffusion process, we develop a diffusion graph transformer (DGT) to parameterize the data prediction model that recovers the original data from noisy data. The DGT uses a relational attention mechanism that enhances the interaction between node and edge representations. This mechanism operates concurrently with the propagation and update of scalar attributes and geometric vectors. Our model can also be extended for inverse molecular design targeting single or multiple quantum properties. In our comprehensive evaluation pipeline for unconditional joint generation, the experimental results show that JODO remarkably outperforms the baselines on the QM9 and GEOM-Drugs datasets. Furthermore, our model excels in few-step fast sampling, as well as in inverse molecule design and molecular graph generation. Our code is provided in https://***/GRAPH-0/JODO.
Multimodal large language models (MLLMs), initiated with a trained LLM, first align images with text and then fine-tune on multimodal mixed inputs. However, during the continued training, the MLLM catastrophically for...
Code representation is important to machine learning models in the code-related applications. Existing code summarization approaches primarily leverage Abstract Syntax Trees (ASTs) and sequential information from sour...
详细信息
A wireless sensor network (WSN) represents a promising approach for establishing self-organizing wireless networks comprising a substantial number of wireless sensors, with the objective of facilitating communication ...
详细信息
Introducing the unique advantage of additive manufacturing technology into copper-based shape memory alloys(SMAs)to fabricate high-performance alloys has garnered great attention in recent years,but the intrinsic rela...
详细信息
Introducing the unique advantage of additive manufacturing technology into copper-based shape memory alloys(SMAs)to fabricate high-performance alloys has garnered great attention in recent years,but the intrinsic relationships between microstructure and mechanical properties need to be further *** this paper,the microstructural evolution of ternary CuAlNi SMAs fabricated by laser powder bed fusion(LPBF)under the tensile-compressive loading was investigated to determine the underlying mechanism of tension-compression asymmetry,that is,excellent compressive but poor tensile *** characterization of the different deformation stages revealed the numerous activated deformation mech-anism on the 18R martensite matrix.A twin-related transformation dominated the main plastic defor-mation process due to lower stacking faults energy and high-density pre-existing planer defects in the CuAlNi *** twinning nucleated at prior austenite boundaries and developed into parallel and network structures inside the parent grain of different *** addition,the preferred orientation in different stages,the stress-inducedγphase transformation,and the interaction between dislocations and stacking faults are *** results not only provide significant insights to understand the detwinning and deformation twinning process of SMAs but also establish the essential framework of mi-crostructure and mechanical properties of Cu-based SMAs fabricated by LPBF.
Parkinson’s disease (PD) is a debilitating neurodegenerative disease that has severe impacts on an individual’s quality of life. Compared with structural and functional MRI-based biomarkers for the disease, electroe...
详细信息
暂无评论