检索结果-内蒙古大学图书馆

science 2025年第6740期387卷 1261页

作者： Dov Greenbaum Mark Gerstein The reviewer is at the Zvi Meitar Institute for Legal Implications of Emerging Technologies The reviewer is at the Harry Radzyner Law School The reviewer is at the Dina Recanati School of Medicine Reichman University Herzliya Israel The reviewer is at the Department of Biomedical Informatics and Data Science The reviewer is at the Program in Computational Biology and Bioinformatics The reviewer is at the Department of Computer Science Yale University New Haven CT USA.

来源：评论

学校读者我要写书评

暂无评论

Theory of dual-sparse regularized randomized reduction 32

Theory of dual-sparse regularized randomized reduction

引用

32nd International Conference on Machine Learning, ICML 2015

作者： Yang, Tianbao Zhang, Lijun Jin, Rong Zhu, Shenghuo Department of Computer Science University of Iowa Iowa City United States National Key Laboratory for Novel Software Technology Nanjing University Nanjing China Department of Computer Science and Engineering Michigan State University East Lansing Institute of Data Science and Technologies at Alibaba Group Seattle United States Institute of Data Science and Technologies at Alibaba Group Seattle United States

ISBN: (纸本)9781510810587

In this paper, we study randomized reduction methods, which reduce high-dimensional features into low-dimensional space by randomized methods (e.g., random projection, random hashing), for large-scale high-dimensional classification. Previous theoretical results on randomized reduction methods hinge on strong assumptions about the data, e.g., low rank of the data matrix or a large separable margin of classification, which hinder their applications in broad domains. To address these limitations, we propose dual-sparse regularized randomized reduction methods that introduce a sparse regularizer into the reduced dual problem. Under a mild condition that the original dual solution is a (nearly) sparse vector, we show that the resulting dual solution is close to the original dual solution and concentrates on its support set. In numerical experiments, we present an empirical study to support the analysis and we also present a novel application of the dual-sparse regularized randomized reduction methods to reducing the communication cost of distributed learning from large-scale high-dimensional data.

关键词： Clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

Leveraging Deep Learning Models for Machine-Generated Text Detection Using Transformer-Based Models

Leveraging Deep Learning Models for Machine-Generated Text D...

引用

2024 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems, ICSES 2024

作者： Suthanthiradevi, P. Revathy, G. Rejini, K. Muthu Lakshmi, V. School of Computing Srm institute of Science and Technology Department of Data Science and Business Systems Kattankulathur Chennai India Vels Institute of science Technology and Advanced studies Department of Computer science and engineering Chennai117 India Amrita college of Engineering and Technology Department of Computer Science Engineering Nagercoil India St. Joseph's College of Engineering Department of Computer Science and Engineering OMR Chennai India

ISBN: (纸本)9798331543617

GPT is a large language model (LLM) derived from natural language processing that can generate a human-like text using machine learning. However, these models raise questions about authenticity and reliability of material, particularly in fields such as journalism, social media, and academia, despite their usefulness for automating text-based tasks. Detecting machine-generated text is thus an important difficulty in ensuring content integrity. This study investigates the use of huge language models as a technique for recognizing machine-generated material. The author proposes a comprehensive detection model by evaluating the language patterns, syntactic structures, and stylistic traits that separate AI-generated literature from human writing. In addition, this research investigate the possibilities of fine-tuning models designed expressly for text identification tasks and evaluate their performance using LLM - Detect AI Generated Text datasets. In digital ecosystems, LLMs are effective at detecting AI-generated text, providing a novel approach for content moderation, academic integrity checks, and synthetic media detection. An increasingly AI-powered future will require a model that can discriminate between human and machine-generated writing in real-time. According to experimental findings, the CNN architecture's design combined with the use of DistilBERT embeddings allows for the effective and efficient classification of AI generated text data, achieving an exceptional 98% accuracy rate. © 2024 IEEE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

database Design for Pre-Eclampsia Case Administration 8

Database Design for Pre-Eclampsia Case Administration

引用

8th International Conference on Information Technology and Digital Applications, ICITDA 2023

作者： Asadi, Faisal Elwirehardja, Gregorius Natanael Trinugroho, Joko Pebrianto Rahutomo, Reza Pardamean, Bens School of Computer Science Bina Nusantara University Computer Science Department Jakarta11480 Indonesia Bina Nusantara University Bioinformatics and Data Science Research Center Jakarta11480 Indonesia School of Information Systems Bina Nusantara University Information System Department Jakarta11480 Indonesia Bina Nusantara University Binus Graduate Program Master of Computer Science Computer Science Department Jakarta11480 Indonesia

ISBN: (纸本)9798350344691

The Ministry of Health of Indonesia has referred to pre-eclampsia as one of the most severe diseases affecting women. As an urgency, it is crucial to administrate pre-eclampsia cases for disease prevention as a long-term national healthcare strategy. Regarding health science, case data was significant in developing research and innovation. However, the main problem regarding pre-eclampsia case administration is data handling, recording, and management incompetence. Hence, this research proposed a conceptual design of a database for pre-eclampsia case administration. The proposed design covered conceptual, logical, and physical design. We elaborate the concept into three concepts of pre-eclampsia disease: pre-treatment, treatment, and post-treatment. This study proposed a solution to gain more data and study pre-eclampsia disease in Indonesia. © 2023 IEEE.

关键词： Medical informatics

来源：评论

学校读者我要写书评

暂无评论

Ecological Show Cave and Wild Cave: Negative Binomial Gllvm's Arthropod Community Modelling 3

Ecological Show Cave and Wild Cave: Negative Binomial Gllvm'...

引用

3rd International Conference on computer science and Computational Intelligence, ICCSCI 2018

作者： Caraka, Rezzy Eko Shohaimi, Shamarina Kurniawan, Isma Dwi Herliansyah, Riki Budiarto, Arif Sari, Shinta Purnama Pardamean, Bens Bioinformatics and Data Science Research Center Bina Nusantara University Indonesia Department of Statistics Padjadjaran University Indonesia Department of Biology UIN Sunan Gunung Djati Bandung Indonesia Kalimantan Indonesia Computer Science Department School of Computer Science Bina Nusantara University Jakarta11480 Indonesia Computer Science Department BINUS Graduate Program Master of Computer Science Bina Nusantara University Jakarta11480 Indonesia

Ecology is a branch of biology that studies the interaction and relationship between organisms and their environment. Abundance, distribution of organisms and patterns of biodiversity are great interests for many ecologists. One of interesting ecosystems to be studied is a cave. A cave has a typical environment character with a vulnerable ecosystem. Many caves in Indonesia, particularly in Gunungsewu karst area have been developed into tourist objects (show caves) and managed imprudently. Such cave management has potential to harm the environment and leads to ecosystem destruction. Arthropods are the most abundance fauna in cave that play critical roles in maintaining cave ecosystems equilibrium. In the heart of statistical ecology, we need to analyze the differences on Arthropods community and abiotic (climatic-edaphic) parameters among show caves and wild caves. Statistical techniques are needed for the extraction of such information. GLLVM is one method that is able to explain spatial-based information and is particularly suitable for ecology. In this paper, we use negative binomial models to see the differences on spatial patterns of predator and decomposer Arthropods, also characteristic of edaphic and climatic in each cave. © 2018 The Authors. Published by Elsevier Ltd.

关键词： Caves

来源：评论

学校读者我要写书评

暂无评论

Comments on "A Lightweight Privacy Preserving Authentication Protocol for VANETs" 9

Comments on "A Lightweight Privacy Preserving Authentication...

引用

9th International Conference on computer and Communications, ICCC 2023

作者： Awais, Syed Muhammad Yucheng, Wu Mahmood, Khalid Kharel, Rupak Chongqing University School of Microelectronics and Communication Engineering Chongqing China University of Central Lancashire School of Psychology and Computer Science Graduate School of Intelligent Data Science Preston United Kingdom National Yunlin University of Science and Technology School of Psychology and Computer Science Graduate School of Intelligent Data Science Yunlin64002 Taiwan University of Central Lancashire School of Psychology and Computer Science Faculty of Electrical and Electronics Engineering Preston United Kingdom Ton Duc Thang University School of Psychology and Computer Science Faculty of Electrical and Electronics Engineering Ho Chi Minh City Viet Nam

ISBN: (纸本)9798350317251

Vehicle use and the concept of a "smart city"are developing quickly. As a result of this progression, the Vehicular Ad-Hoc Network (VANET) is a popular network for inter-vehicular communication. The data gathering about the state of the road, the whereabouts of vehicles, their speeds, and the description of traffic congestion are obtained using the VANET. The public nature of information flow on the VANET creates serious security risks. data security is typically one of the most important duties of VANET. As a result, it becomes one of the researchers' top priorities. Li et al. [1] introduced a Lightweight Privacy Preserving Authentication Protocol for VANETs. They stated that several attacks could be resisted using their secure authentication devised protocol. However, after analyzing we discovered that their authentication protocol is susceptible to roadside impersonation attack and it does not provide vehicle anonymity. In the end, we have provided a few suggestions to cope with Li et al.'s devised protocol current shortcomings. © 2023 IEEE.

关键词： Vehicular ad hoc networks

来源：评论

学校读者我要写书评

暂无评论

The NYU system for the CoNLL–SIGMORPHON 2018 shared task on universal morphological reinflection

The NYU system for the CoNLL–SIGMORPHON 2018 shared task on...

引用

2018 CoNLL-SIGMORPHON Shared Task: Universal Morphological Reinflection, CoNLL 2018

作者： Kann, Katharina Lauly, Stanislas Cho, Kyunghyun Center for Data Science New York University New York United States Dept. of Computer Science New York University New York United States

ISBN: (纸本)9781948087834

This paper describes the NYU submission to the CoNLL–SIGMORPHON 2018 shared task on universal morphological reinflection. Our system participates in the low-resource setting of Task 2, track 2, i.e., it predicts morphologically inflected forms in context: given a lemma and a context sentence, it produces a form of the lemma which might be used at an indicated position in the sentence. It is based on the standard attention-based LSTM encoder-decoder model, but makes use of multiple encoders to process all parts of the context as well as the lemma. In the official shared task evaluation, our system obtains the second best results out of 5 submissions for the competition it entered and strongly outperforms the official baseline. © 2018 Association for Computational Linguistics.

关键词： Signal encoding

来源：评论

学校读者我要写书评

暂无评论

Machine Learning Implementations in Childhood Stunting Research: A Systematic Literature Review 8

Machine Learning Implementations in Childhood Stunting Resea...

引用

8th International Conference on Information Management and Technology, ICIMTech 2023

作者： Rahutomo, Reza Elwirehardja, Gregorius Natanael Isnan, Mahmud Asadi, Faisal Pardamean, Bens School of Information Systems Bina Nusantara University Information Systems Department Jakarta11480 Indonesia School of Computer Science Bina Nusantara University Computer Science Department Jakarta11480 Indonesia Bioinformatics and Data Science Research Center Bina Nusantara University Jakarta11480 Indonesia Binus Graduate Program - Master of Computer Science Bina Nusantara University Computer Science Department Jakarta11480 Indonesia

ISBN: (纸本)9798350326093

Childhood stunting is a condition anticipated to affect the growth potential of children under the age of five. With numerous stunting researches that have been conducted, stunting datasets are now widely available to facilitate stunting research. This provides an opportunity to implement machine learning (ML) principles to produce a broader insight or a novel technique in stunting prediction. A systematic literature review is necessary to discover the landscape of machine learning implementation in the application domain as a preliminary study for creating an effective research roadmap. This paper presents a systematic literature review (SLR) of 22 curated manuscripts that focuses on identifying the ML models applied in stunting research, as well as the datasets used in such studies that were published during 2017-2022. The SLR process found that ML principles have been applied in stunting research since 2017, and the diversity of ML implementation has become more varied in 2021-2022. In terms of ML models, XGBoost and Random Forest are recognized as the two most utilized models, and stunting prediction is the most common ML implementation. The majority of stunting research utilizing ML has been conducted in Indonesia. Although national survey data has been the most commonly utilized dataset in stunting research, researchers in Indonesia have shown a preference for utilizing data from regional or independent surveys. This study will be followed by developing a classifier model for stunted children using XGBoost and Random Forest algorithms. The model will be trained on a dataset generated from StuntingDB. © 2023 IEEE.

关键词： Machine learning

来源：评论

学校读者我要写书评

暂无评论

MMCNet: deep learning–based multimodal classification model using dynamic knowledge

引用

Personal and Ubiquitous Computing 2022年第2期26卷 355-364页

作者： Park, Sung-Soo Chung, Kyungyong Data Mining Lab. Department of Computer Science Kyonggi University 154-42 Gwanggyosan-ro Yeongtong-gu Gyeonggi-do Suwon-si16227 Korea Republic of Division of Computer Science and Engineering Kyonggi University 154-42 Gwanggyosan-ro Yeongtong-gu Gyeonggi-do Suwon-si16227 Korea Republic of

Because of the growth of the business sector dealing in the distribution of movies, software, music, and other contents, a very large amount of contents has accumulated. Accordingly, recommendation systems for inducing user requests for contents are more important. In distribution businesses, accurate content recommendations are required to secure and retain users. To establish a highly accurate recommendation system, the recommended contents must be accurately classified. As classification methods, mainly techniques such as naive Bayes, SGD (stochastic gradient descent), and SVM (support vector machine), are utilized. If all of the information on recommended subjects is applied in the classification process, high-level accuracy can be expected, but heavy calculation, a long service time, and low scalability are incurred. Given this inefficiency, effective classification in which the metadata of contents are used is required. Metadata are expressed in the forms of the domain concept, relation, type, and attribute to allow the complicated relations between multimodal data (text, images, and video) to be processed efficiently. Most classification systems use single modal data to express one piece of knowledge for an item in a domain. Single modal data are limited in terms of improving classification accuracy, because they do not include the useful information provided by different knowledge types. Therefore, in this paper, we propose MMCNet, a deep learning–based multimodal classification model that uses dynamic knowledge. The proposed method consists of a classification model that applies the human learning principle-based CNN (convolution neural network) to multimodal data in combination with text and image knowledge. By using a Web robot agent, multimodal data are collected from the TMDb (The Movie database) data set, which includes a variety of single modal data. In the preprocessing procedures, knowledge integration, knowledge conversion, and knowledge reduction

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

HUSS:A Heuristic Method for Understanding the Semantic Structure of Spreadsheets

引用

data Intelligence 2023年第3期5卷 537-559页

作者： Xindong Wu Hao Chen Chenyang Bu Shengwei Ji Zan Zhang Victor S.Sheng Key Laboratory of Knowledge Engineering with Big Data(the Ministry of Education of China) Hefei University of TechnologyChinaSchool of Computer Science and Information EngineeringHefei University of TechnologyHefeiChina Research Institute of Artificial Intelligence Zhejiang LabHangzhouChina Department of Computer Science Texas Tech UniversityLubbockTX 79409USA

Spreadsheets contain a lot of valuable data and have many practical *** key technology of these practical applications is how to make machines understand the semantic structure of spreadsheets,e.g.,identifying cell function types and discovering relationships between cell *** existing methods for understanding the semantic structure of spreadsheets do not make use of the semantic information of cells.A few studies do,but they ignore the layout structure information of spreadsheets,which affects the performance of cell function classification and the discovery of different relationship types of cell *** this paper,we propose a Heuristic algorithm for Understanding the Semantic Structure of spreadsheets(HUSS).Specifically,for improving the cell function classification,we propose an error correction mechanism(ECM)based on an existing cell function classification model[11]and the layout features of *** improving the table structure analysis,we propose five types of heuristic rules to extract four different types of cell pairs,based on the cell style and spatial location *** experimental results on five real-world datasets demonstrate that HUSS can effectively understand the semantic structure of spreadsheets and outperforms corresponding baselines.

关键词： Spreadsheet semantic structure Information extraction Heuristics Cell function analysis Table structure analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：