Since OpenAI opened access to ChatGPT,large language models(LLMs)become an increasingly popular topic attracting researchers’attention from abundant ***,public researchers meet some problems when developing LLMs give...
详细信息
Since OpenAI opened access to ChatGPT,large language models(LLMs)become an increasingly popular topic attracting researchers’attention from abundant ***,public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically *** datasets are an important setup of LLMs,this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune *** paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art ***,based on the properties of the pre-train and fine-tune processes,it comments on pre-train datasets from quality,quantity,and relation with models,and comments on fine-tune datasets from quality,quantity,and *** study then critically figures out the problems and research trends that exist in current LLM *** study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data *** the best of our knowledge,this paper is the first to summarize and discuss datasets used in both autoregressive and chat *** survey offers insights and suggestions to researchers and LLM developers as they build their models,and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.
In the era of bigdata, there are more and more outdoor camera acquisition equipment. Due to the influence of extreme weather, such as fog, camera acquisition equipment is easy to lead to the decline of image quality ...
详细信息
Based on flight operation data, this paper constructs a diversion path planning method for busy waypoints by analyzing the relationship of flight traffic conduction between waypoints. Taking busy waypoint KHN as an ex...
详细信息
Online job advertisements on various job portals or websites have become the most popular way for people to find potential career opportunities ***,the majority of these job sites are limited to offering fundamental f...
详细信息
Online job advertisements on various job portals or websites have become the most popular way for people to find potential career opportunities ***,the majority of these job sites are limited to offering fundamental filters such as job titles,keywords,and compensation *** often poses a challenge for job seekers in efficiently identifying relevant job advertisements that align with their unique skill sets amidst a vast sea of ***,we propose well-coordinated visualizations to provide job seekers with three levels of details of job information:a skill-job overview visualizes skill sets,employment posts as well as relationships between them with a hierarchical visualization design;a post exploration view leverages an augmented radar-chart glyph to represent job posts and further facilitates users’swift comprehension of the pertinent skills necessitated by respective positions;a post detail view lists the specifics of selected job posts for profound analysis and *** using a real-world recruitment advertisement dataset collected from 51Job,one of the largest job websites in China,we conducted two case studies and user interviews to evaluate *** results demonstrated the usefulness and effectiveness of our approach.
Graph processing has been widely used in many scenarios,from scientific computing to artificial *** processing exhibits irregular computational parallelism and random memory accesses,unlike traditional ***,running gra...
详细信息
Graph processing has been widely used in many scenarios,from scientific computing to artificial *** processing exhibits irregular computational parallelism and random memory accesses,unlike traditional ***,running graph processing workloads on conventional architectures(e.g.,CPUs and GPUs)often shows a significantly low compute-memory ratio with few performance benefits,which can be,in many cases,even slower than a specialized single-thread graph *** domain-specific hardware designs are essential for graph processing,it is still challenging to transform the hardware capability to performance boost without coupled software *** article presents a graph processing ecosystem from hardware to *** start by introducing a series of hardware accelerators as the foundation of this ***,the codesigned parallel graph systems and their distributed techniques are presented to support graph ***,we introduce our efforts on novel graph applications and hardware *** results show that various graph applications can be efficiently accelerated in this graph processing ecosystem.
Edge learning (EL) is an end-to-edge collaborative learning paradigm enabling devices to participate in model training and data analysis, opening countless opportunities for edge intelligence. As a promising EL framew...
详细信息
Quasi-Affine Transformation Evolutionary (QUATRE) algorithm is a kind of swarm-based collaborative optimization algorithm that solves the problem of a position deviation in a DE search by using the co-evolution matrix...
详细信息
This paper addresses the finite-time anti-synchronization issue for a type of delayed memristive neural networks. By designing a novel memoryless state-feedback controller, novel criteria on finite-time anti-synchroni...
详细信息
作者:
Zhong, WenjieSun, TaoZhou, Jian-TaoWang, ZhuoweiSong, XiaoyuInner Mongolia University
College of Computer Science the Engineering Research Center of Ecological Big Data Ministry of Education the Inner Mongolia Engineering Laboratory for Cloud Computing and Service Software the Inner Mongolia Engineering Laboratory for Big Data Analysis Technology Hohhot010000 China Guangdong University of Technology
School of Computer Science and Technology Guangzhou510006 China Portland State University
Department of Electrical and Computer Engineering PortlandOR97207 United States
Colored Petri nets (CPNs) provide descriptions of the concurrent behaviors for software and hardware. Model checking based on CPNs is an effective method to simulate and verify the concurrent behavior in system design...
详细信息
Session-based recommendation(SBR)and multibehavior recommendation(MBR)are both important problems and have attracted the attention of many researchers and *** from SBR that solely uses one single type of behavior sequ...
详细信息
Session-based recommendation(SBR)and multibehavior recommendation(MBR)are both important problems and have attracted the attention of many researchers and *** from SBR that solely uses one single type of behavior sequences and MBR that neglects sequential dynamics,heterogeneous SBR(HSBR)that exploits different types of behavioral information(e.g.,examinations like clicks or browses,purchases,adds-to-carts and adds-to-favorites)in sequences is more consistent with real-world recommendation scenarios,but it is rarely *** efforts towards HSBR focus on distinguishing different types of behaviors or exploiting homogeneous behavior transitions in a sequence with the same type of ***,all the existing solutions for HSBR do not exploit the rich heterogeneous behavior transitions in an explicit way and thus may fail to capture the semantic relations between different types of ***,all the existing solutions for HSBR do not model the rich heterogeneous behavior transitions in the form of graphs and thus may fail to capture the semantic relations between different types of *** limitation hinders the development of HSBR and results in unsatisfactory *** a response,we propose a novel behavior-aware graph neural network(BGNN)for *** BGNN adopts a dual-channel learning strategy for differentiated modeling of two different types of behavior sequences in a ***,our BGNN integrates the information of both homogeneous behavior transitions and heterogeneous behavior transitions in a unified *** then conduct extensive empirical studies on three real-world datasets,and find that our BGNN outperforms the best baseline by 21.87%,18.49%,and 37.16%on average correspondingly.A series of further experiments and visualization studies demonstrate the rationality and effectiveness of our *** exploratory study on extending our BGNN to handle more than two types of behaviors show that our BGNN can e
暂无评论