检索结果-内蒙古大学图书馆

arXiv 2021年

作者： Hildebrandt, Niclas Boenninghoff, Benedikt Orth, Dennis Schymura, Christopher Data Science Kitchen Germany

This paper presents the contribution1 of the data science Kitchen at GermEval 2021 shared task on the identification of toxic, engaging, and fact-claiming comments. The task aims at extending the identification of offensive language, by including additional subtasks that identify comments which should be prioritized for fact-checking by moderators and community managers. Our contribution focuses on a feature-engineering approach with a conventional classification backend. We combine semantic and writing style embeddings derived from pre-trained deep neural networks with additional numerical features, specifically designed for this task. Classifier ensembles are used to derive predictions for each subtask via a majority voting scheme. Our best submission achieved macro-averaged F1-scores of 66.8%, 69.9% and 72.5% for the identification of toxic, engaging, and fact-claiming comments. © 2021, CC BY.

关键词： data science

来源：评论

学校读者我要写书评

暂无评论

On the Pathwise Uniqueness of Solutions of One-dimensional Reflected Stochastic Differential Equations with Jumps

引用

Acta Mathematicae Applicatae Sinica 2024年第1期40卷 149-163页

作者： Hua Zhang School of Statistics and Data Science&Key Laboratory of Data Science in Finance and Economics Jiangxi University of Finance and EconomicsNanchang330013China

In this paper, we are concerned with the problem of the pathwise uniqueness of one-dimensional reflected stochastic differential equations with jumps under the assumption of non-Lipschitz continuous coefficients whose... 详细信息

关键词： reflected diffusion processes with jumps pathwise uniqueness local time Meyer It?'s formula

来源：评论

学校读者我要写书评

暂无评论

Skewed Distributions in data science

引用

CHANCE 2022年第1期35卷 51-55页

作者： Nairanjana Dasgupta Boeing Distinguished Professor in the Department of Mathematics and Statistics and a Data Science Fellow at Washington State University Nicole Lazar

This column is about raising questions, rather than providing answers. These days “data based decision making” is the rage among administrators in both industry and academia. The desire for this dependence on algorithms stems from the general idea that “humans are biased but machines are not”. More and more social decisions, like qualifying for welfare are, are made using algorithms. With this, the data scientists, who are behind the algorithms, are given a lot of power (and responsibility.) In this column, I discuss demographic characteristics of data scientists with conjectures on why this group is non-diverse. Should we allow a small group of non-representative people to make decisions that affect affect larger society?

关键词：

来源：评论

学校读者我要写书评

暂无评论

Statistics, Machine Learning, and data science: A Historical Review and a Look to the Future

Statistics, Machine Learning, and Data Science: A Historical...

引用

IEEE International Conference on Research, Innovation and Vision for the Future

作者： Tru Cao Department of Biostatistics and Data Science The University of Texas Health Science Center at Houston School of Public Health Chair: Cuong Pham (Posts & Telecommunications Institute of Technology Vietnam

We are witnessing the beginning of a data driven era with the explosion of data, impact of data on our everyday lives, and advances of data processing methodology and technology. At this juncture, data science has emerged as an interdisciplinary field to deal with data for which statistics and machine learning are two key enablers. Originally, statistics and machine learning appear to have been developed in the different contexts of mathematics and computer science, respectively. data science has brought them together in which statistics focuses on mathematical foundations and methods, while machine learning is more on algorithms and automated data processing. First, this talk looks back on a timeline of the emergence of statistics, machine learning, data science, and related fields. Second, it reviews a chronology of the invention and context of some important statistical and machine learning methods. Third, it discusses relationships between statistics, machine learning, and data science. Finally, it addresses some challenges and overcoming ways of learning from big data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

False Negative Sample Detection for Graph Contrastive Learning

引用

Tsinghua science and Technology 2024年第2期29卷 529-542页

作者： Binbin Zhang Li Wang College of Data Science Taiyuan University of TechnologyJinzhong 030600China

Recently,self-supervised learning has shown great potential in Graph Neural Networks (GNNs) through contrastive learning,which aims to learn discriminative features for each node without label information. The key to graph contrastive learning is data augmentation. The anchor node regards its augmented samples as positive samples,and the rest of the samples are regarded as negative samples,some of which may be positive samples. We call these mislabeled samples as “false negative” samples,which will seriously affect the final learning effect. Since such semantically similar samples are ubiquitous in the graph,the problem of false negative samples is very significant. To address this issue,the paper proposes a novel model,False negative sample Detection for Graph Contrastive Learning (FD4GCL),which uses attribute and structure-aware to detect false negative samples. Experimental results on seven datasets show that FD4GCL outperforms the state-of-the-art baselines and even exceeds several supervised methods.

关键词： graph representation learning contrastive learning false negative sample detection

来源：评论

学校读者我要写书评

暂无评论

Editorial: Advances in Network data science

引用

Journal of data science 2023年第3期21卷 443-445页

作者： Chen, Yuguo Sewell, Daniel Zhang, Panpan Zhu, Xuening Department of Statistics University of Illinois at Urbana-Champaign Champaign 61820 IL United States Department of Biostatistics University of Iowa Iowa City 52246 IA United States Department of Biostatistics Vanderbilt University Medical Center Nashville 37203 TN United States School of Data Science Fudan University Shanghai China

来源：评论

学校读者我要写书评

暂无评论

A generalized integrated framework for urban public transport operations evaluation based on interval neutrosophic TODIM and EDAS technique

引用

Soft Computing 2024年第17-18期28卷 10331-10344页

作者： Cai, Yunpeng Department of Data Science and Big Data Technology Shenyang Normal University Liaoning Shenyang110034 China

With the acceleration of urbanization construction, the contradiction between supply and demand of urban public transportation resources is becoming increasingly prominent, resulting in increasingly serious problems such as traffic congestion and environmental pollution, which has a great impact on the sustainable development of cities. The experience of mature markets both domestically and internationally indicates that priority must be given to the development of public transportation networks with rail transit as the backbone. Due to the fact that the operation of the rail transit system should not only focus on market benefits but also meet social benefits as much as possible, evaluating efficiency has become a prerequisite for managing rail transit and an important topic in the performance evaluation of urban public transportation. The urban public transport operations evaluation is a multiple-attribute group decision-making (MAGDM). Then, the TODIM and EDAS method has been constructed to deal with MAGDM issues. The interval neutrosophic sets (INSs) are constructed as an effective tool for representing uncertain information during the urban public transport operations evaluation. In this manuscript, the interval neutrosophic number TODIM-EDAS (INN-TODIM-EDAS) method is constructed to solve the MAGDM under INSs. Finally, a numerical example study for urban public transport operations evaluation is constructed to validate the INN-TODIM-EDAS method. The main research contribution of this paper is constructed: (1) the INN-TODIM-EDAS method is put up for MAGDM with INSs;(2) the INN-TODIM-EDAS method is put up for urban public transport operations evaluation and were compared with existing methods;(3) Through the detailed comparison, it is evident that INN-TODIM-EDAS method for urban public transport operations evaluation proposed in this paper are effective. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

关键词： Traffic congestion

来源：评论

学校读者我要写书评

暂无评论

THE STABILITY OF AF-RELATIONS

引用

Acta Mathematica Scientia 2024年第6期44卷 2443-2464页

作者： Jiajie HUA College of Data Science Jiacing UniversityJiacing 314001China

For given l,s∈N,Λ={ρ_(j)}_(j=i),…,s,ρj∈T,the C^(*)-algebra B:=ε({r_(j)}_(j=1),…,s,Λ,l)is defined to be the universal C^(*)-algebra generated by l unitaries u_(1),…,u_(l) subject to the relations r_(j)(u_(1),…,u_(l))-ρ_(j)=0 for all j=1,…,s,where the r_(j) is monomial in u_(1),…,u_(l) and their inverses for j=1,2,…,*** B is a unital AF-algebra with a unique tracial state,and K_(0)(B)is a finitely generated group,we say that the relations({r_(j)}_(j=1),…,s,Λ,l)are *** the relations({r_(j)}_(j=1),…,s,Λ,l)are AF-relations,we prove that,for any ε>0,there exists a δ>0 satisfying the following:for any unital C^(*)-algebra A with the cancellation property,strict comparison,nonempty tracial state space,and any l unitaries u_(1),u_(2),…,u_(l)∈A satisfying‖r_(j)(u_(1),u_(2),…,u_(l))-ρ_(j)‖<δ,j=1,2,…,s,and certain trace conditions,there exist l unitaries u_(1),u_(2),…,u_(l)∈A such that r_(j)(u_(1),u_(2),…,u_(l))=ρ_(j) for j=1,2,…,s,and‖ui-ui‖<ε for i=1,2,…,***,we give several applications of the above result.

关键词： C^(*)-algebras stability AF-relations unitary

来源：评论

学校读者我要写书评

暂无评论

Activities of the Polar Environment data science Center of ROIS-DS, Japan

引用

data science Journal 2022年第1期21卷

作者： Kadokura, Akira Kanao, Masaki Yabuki, Hironori Tanaka, Yoshimasa Nishimura, Koji Polar Environment Data Science Center Joint Support-Center for Data Science Research Research Organization of Information and Systems Tokyo Japan National Institute of Polar Research Research Organization of Information and Systems Tokyo Japan Research Institute for Sustainable Humanosphere Kyoto University Kyoto Japan

The Polar Environment data science Center (PEDSC) is one of the centers of the Joint Support-Center for data science Research (DS) of the Research Organization of Information and Systems (ROIS), which was established in 2017. The purpose of the PEDSC is to promote the opening and sharing of the scientific data obtained by research activities in the polar region led by the National Institute of Polar Research (NIPR). Activities of the PEDSC have been carried out along a five year plan with the following seven specific tasks since 2017: (1) construction of an integrated database;(2) upgrade and interoperable use of the three existing database systems (NIPR science database, Arctic data archive System (ADS), and Inter-university Upper atmosphere Global Observation NETwork system (IUGONET));(3) processing of the time-series digital data;(4) processing of the sample data;(5) data publication in the Polar data Journal;(6) collaboration with external communities;and (7) promoting data science using the database and database system. © 2022 The Author(s).

关键词： Open data

来源：评论

学校读者我要写书评

暂无评论

D^(2)-GCN:a graph convolutional network with dynamic disentanglement for node classification

引用

Frontiers of Computer science 2025年第1期19卷 145-161页

作者： Shangwei WU Yingtong XIONG Hui LIANG Chuliang WENG School of Data Science and Engineering East China Normal UniversityShanghai 200062China

Classic Graph Convolutional Networks (GCNs) often learn node representation holistically, which ignores the distinct impacts from different neighbors when aggregating their features to update a node’s representation. Disentangled GCNs have been proposed to divide each node’s representation into several feature units. However, current disentangling methods do not try to figure out how many inherent factors the model should assign to help extract the best representation of each node. This paper then proposes D^(2)-GCN to provide dynamic disentanglement in GCNs and present the most appropriate factorization of each node’s mixed features. The convergence of the proposed method is proved both theoretically and experimentally. Experiments on real-world datasets show that D^(2)-GCN outperforms the baseline models concerning node classification results in both single- and multi-label tasks.

关键词： graph convolutional networks dynamic disentanglement label entropy node classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：