检索结果-内蒙古大学图书馆

Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models

computers, Materials & Continua 2024年第8期80卷 1753-1808页

作者： Zheyi Chen Liuchang Xu Hongting Zheng Luyao Chen Amr Tolba Liang Zhao Keping Yu Hailin Feng College of Mathematics and Computer Science Zhejiang A&F UniversityHangzhou311300China Computer Science Department Community CollegeKing Saud UniversityRiyadh11437Saudi Arabia Mathematics and Computer Science Department Faculty of ScienceMenofia UniversityShebin El KomMenoufia Governorate32511Egypt School of Computer Science Shenyang Aerospace UniversityShenyang110136China Graduate School of Science and Engineering Hosei UniversityTokyo184-8584Japan

Since the 1950s,when the Turing Test was introduced,there has been notable progress in machine language *** modeling,crucial for AI development,has evolved from statistical to neural models over the last two ***,transformer-based Pre-trained Language Models(PLM)have excelled in Natural Language Processing(NLP)tasks by leveraging large-scale training *** the scale of these models enhances performance significantly,introducing abilities like context learning that smaller models *** advancement in Large Language Models,exemplified by the development of ChatGPT,has made significant impacts both academically and industrially,capturing widespread societal *** survey provides an overview of the development and prospects from Large Language Models(LLM)to Large Multimodal Models(LMM).It first discusses the contributions and technological advancements of LLMs in the field of natural language processing,especially in text generation and language ***,it turns to the discussion of LMMs,which integrates various data modalities such as text,images,and sound,demonstrating advanced capabilities in understanding and generating cross-modal content,paving new pathways for the adaptability and flexibility of AI ***,the survey highlights the prospects of LMMs in terms of technological development and application potential,while also pointing out challenges in data integration,cross-modal understanding accuracy,providing a comprehensive perspective on the latest developments in this field.

关键词： Artificial intelligence large language models large multimodal models foundation models

来源：评论

学校读者我要写书评

暂无评论

Empirical evaluation of machine learning models for analysis of CoVID related diseases on different body organs

引用

Multimedia Tools and Applications 2024年第38期83卷 86079-86090页

作者： Thombre, Supriya S. Malik, Latesh Kumar, Sanjay Yeshwantrao Chavan College of Engineering Maharashtra Nagpur India Department of Computer Science and Engineering Government Engineering College Maharashtra Nagpur India Department of Computer Science and Engineering Kalinga University Chhattisgarh Raipur India

CoVID-19 has been linked to long-term consequences on several human body organs, including lung ailments, kidney malfunctions, heart dysrhythmia, alterations in brain nutrient levels, psychological difficulties, abrupt changes in blood pressure, and more. Because of the considerable variety in the impacts on different body parts, researchers find it challenging to create models that can incorporate these effects for treatment recommendations and future disease prevention scenarios. Thus, this article examines some of the most recently proposed models for identifying the impacts of CoVID19 on various human organs. This review examines the underlying theories in terms of clinical nuances, functional advantages, contextual limits, and potential empirical applications. Based on this discussion, researchers will be able to find the best models for detecting particular diseases on specific body parts. It was discovered that hybrid bioinspired models, when paired with deep learning-based classification algorithms, can effectively detect these impacts. This text also parametrically analyses these models in terms of accuracy, precision, and recall, allowing readers to select the best models for their performance-specific use cases. To expand on this discussion, this book evaluates a unique CoVID19 Classification Rank Metric (CCRM) that integrates these factors for thorough model identification. Based on this criteria, researchers will be able to develop appropriate models for clinical scenarios that have high accuracy, low delay, and scalability while costing less. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Blood pressure

来源：评论

学校读者我要写书评

暂无评论

Deepfake Audio Detection for Urdu Language Using Deep Neural Networks

引用

IEEE Access 2025年 13卷 97765-97778页

作者： Ahmad, Omair Khan, Muhammad Sohail Jan, Salman Khan, Inayat University of Engineering and Technology Department of Computer Software Engineering Mardan Pakistan Arab Open University Faculty of Computer Studies A’Ali732 Bahrain University of Engineering and Technology Department of Computer Science Mardan Pakistan

Audio Deepfakes, which are highly realistic fake audio recordings driven by AI tools that clone human voices, With Advancements in Text-Based Speech Generation (TTS) and Vocal Conversion (VC) technologies have enabled it easier to create realistic synthetic and imitative speech, making audio Deepfakes a common and potentially dangerous form of deception. Well-known people, like politicians and celebrities, are often targeted. They get tricked into saying controversial things in fake recordings, causing trouble on social media. Even kids’ voices are cloned to scam parents into ransom payments, etc. Therefore, developing effective algorithms to distinguish Deepfake audio from real audio is critical to preventing such frauds. Various Machine learning (ML) and Deep learning (DL) techniques have been created to identify audio Deepfakes. However, most of these solutions are trained on datasets in English, Portuguese, French, and Spanish, expressing concerns regarding their correctness for other languages. The main goal of the research presented in this paper is to evaluate the effectiveness of deep learning neural networks in detecting audio Deepfakes in the Urdu language. Since there’s no suitable dataset of Urdu audio available for this purpose, we created our own dataset (URFV) utilizing both genuine and fake audio recordings. The Urdu Original/real audio recordings were gathered from random youtube podcasts and generated as Deepfake audios using the RVC model. Our dataset has three versions with clips of 5, 10, and 15 seconds. We have built various deep learning neural networks like (RNN+LSTM, CNN+attention, TCN, CNN+RNN) to detect Deepfake audio made through imitation or synthetic techniques. The proposed approach extracts Mel-Frequency-Cepstral-Coefficients (MFCC) features from the audios in the dataset. When tested and evaluated, Our models’ accuracy across datasets was noteworthy. 97.78% (5s), 98.89% (10s), and 98.33% (15s) were remarkable results for the RNN+LSTM

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Enhanced Diabetes Detection with Deep Learning: An Iris Image Analysis Approach 5

Enhanced Diabetes Detection with Deep Learning: An Iris Imag...

引用

5th International Conference on Sustainable Communication Networks and Application, ICSCNA 2024

作者： Rajarajeswari, T.S. Snehanjali, Beeram Rizwana, Rapthadu Kasi Vardhan, Bethi Phanindra Velagapudi Ramakrishna Siddhartha Engineering College Department of Computer Science and Engineering Vijayawada India Velagapudi Ramakrishna Siddhartha Engineering College Department of Artificial Intelligence and Data Science Vijayawada India

ISBN: (纸本)9798331530013

Diabetes is a chronic and progressive condition that, if not diagnosed early, can lead to serious health complications, often we will see that patients learn they have diabetes only years after its emergence this points out the significance of precise and early identification. Thus, diabetes is a global health issue that requires timely detection. Traditional methods for diagnosing diabetes often involve expensive procedures, which can lead to delays in diagnosis. Using iris images for diabetes detection through deep learning algorithms offers a more accurate and cost-effective alternative. In general, a person with diabetes may experience changes in the iris, such as alterations in pattern, texture, or pigmentation. To identify these altered features, high-resolution observation is required. For this purpose, we use advanced imaging techniques and deep learning algorithms to analyze and interpret the changes in iris characteristics. CNN's, known for their powerful feature extraction capabilities, effectively captured and analyzed complex details in iris patterns. This approach utilizes several types of algorithms available in deep learning like Convolutional Neural Networks (CNNs), MobileNetV2, InceptionV3, ResNet50 and CNNs along with transformers. All these algorithms are widely known for their efficiency in image analyzing applications. Thus, we have chosen to implement the same procedural approach but for different algorithms. Finally, we had found an efficient algorithm in such a way that it gives the highest accuracy for the same dataset as compared with the other algorithms. It proved that CNNs are effective in identifying features and patterns in images and it gave the accuracy of about 93.91 % showing its effectiveness. Thus, ultimate aim of this approach is to provide an accurate, non-invasive and cost-effective way of diabetes disease diagnosis. © 2024 IEEE.

关键词： Convolutional Neural Networks Disease diagnosis InceptionV3 Iris MobileNetV2 ResNet50 Transformer Networks

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive data Description for LoRaWAN Path Loss Measurements in an Indoor Office Setting: Effects of Environmental Factors

引用

IEEE Access 2025年 13卷 83148-83170页

作者： Obiri, Nahshon van Laerhoven, Kristof University of Siegen Department of Electrical Engineering and Computer Science Siegen57076 Germany

This paper presents a comprehensive dataset of LoRaWAN technology path loss measurements collected in an indoor office environment, focusing on quantifying the effects of environmental factors on signal propagation. Utilizing a network of six strategically placed LoRaWAN end devices (EDs) and a single indoor gateway (GW) at the University of Siegen’s Hölderlinstraße Campus in the City of Siegen, Germany, we systematically measured signal strength indicators such as the Received Signal Strength Indicator (RSSI) and the Signal-to-Noise Ratio (SNR) under various environmental conditions, including temperature, relative humidity, carbon dioxide (CO2) concentration, barometric pressure, and particulate matter levels (PM2.5). Our empirical analysis confirms that transient phenomena such as reflections, scattering, interference, occupancy patterns (induced by environmental parameter variations), and furniture rearrangements can alter signal attenuation by as much as 10.58 dB, highlighting the dynamic nature of indoor propagation. As an example of how this dataset can be utilized, we tested and evaluated a refined Log-Distance Path Loss and Shadowing Model that integrates both structural obstructions (Multiple Walls) and Environmental Parameters (LDPLSM-MW-EP). Compared to a baseline model that considers only Multiple Walls (LDPLSM-MW), the enhanced approach reduced the root mean square error (RMSE) from 10.58 dB to 8.04 dB and increased the coefficient of determination (R2) from 0.6917 to 0.8222. By capturing the extra effects of environmental conditions and occupancy dynamics, this improved model provides valuable insights for optimizing power usage and prolonging device battery life, enhancing network reliability in indoor Internet of Things (IoT) deployments, among other applications. This dataset offers a solid foundation for future research and development in indoor wireless communication. © 2013 IEEE.

关键词： Green computing

来源：评论

学校读者我要写书评

暂无评论

An Optimized Transfer Learning Model Based Kidney Stone Classification

引用

computer Systems science & engineering 2023年第2期44卷 1387-1395页

作者： S.Devi Mahalakshmi Department of Computer Science and Engineering Mepco Schlenk Engineering CollegeSivakasi626005TamilnaduIndia

The kidney is an important organ of humans to purify the *** healthy function of the kidney is always essential to balance the salt,potassium and pH levels in the ***,the failure of kidneys happens easily to human beings due to their lifestyle,eating habits and diabetes *** pre-diction of kidney stones is compulsory for timely *** processing-based diagnosis approaches provide a greater success rate than other detection *** this work,proposed a kidney stone classiﬁcation method based on optimized Transfer Learning(TL).The Deep Convolutional Neural Network(DCNN)models of DenseNet169,MobileNetv2 and GoogleNet applied for clas-siﬁ*** combined classiﬁcation results are processed by ensemble learning to increase classiﬁcation *** hyperparameters of the DCNN model are adjusted by the metaheuristic algorithm of Gorilla Troops Optimizer(GTO).The proposed TL model outperforms in terms of all the parameters compared to other DCNN models.

关键词： DCNN GTO kidney stone transfer learning

来源：评论

学校读者我要写书评

暂无评论

Fuzzy-GEC an Energy-Aware Hybrid Task Scheduling on the Cloud 2nd

Fuzzy-GEC an Energy-Aware Hybrid Task Scheduling on the Clou...

引用

2nd International Conference on Advances in data-driven Computing and Intelligent Systems, ADCIS 2023

作者： Lalitha Devi, K. Deepa Thilak, K. Shanmuganathan, C. Kalaiselvi, K. Department of Computer Science and Engineering Sathyabama Institute of Science and Technology Chennai India Department of Computer Science and Engineering SRM Institute of Science and Technology Kattankulathur Chennai India Department of Computer Science and Engineering SRM Institute of Science and Technology Ramapuram Chennai India

ISBN: (纸本)9789819995172

In an infrastructure cloud environment, task scheduling should focus on optimizing execution time and saving energy. The data center consumes a large amount of energy during the execution of the task. Energy-saving techniques reduce the amount of energy consumed based on proper task scheduling approaches. The existing fuzzy-based hybrid genetic algorithm considers job length to optimize the makespan. However this fuzzy genetic encoded chromosome (fuzzy-GEC) algorithm considers both makespan and energy consumption to optimize the calculation of fitness value for assigning the task to the virtual machine (VM) by considering the characteristics of the task and VM. According to these characteristics, a fuzzy genetic rule-based encoding scheme is developed to schedule the tasks onto the virtual machines to minimize makespan and energy consumption. The effectiveness of the new technique is evaluated against FCFS and conventional genetic scheduling algorithm using Google Cloud trace workload. The results show the efficiency of the developed approach for makespan and energy consumption. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Virtual machine

来源：评论

学校读者我要写书评

暂无评论

Research on Elliptic Curve Crypto System with Bitcoin Curves - SECP256k1, NIST256p, NIST521p and LLL

引用

Journal of Cyber Security and Mobility 2023年第1期12卷 103-128页

作者： Ulla, Mohammed Mujeer Sakkari, Deepak S. Department of Computer Science and Engineering Presidency University Bangalore India

Very recent attacks like ladder leak demonstrated feasibility to recover private key with side channel attacks using just one bit of secret nonce. ECDSA nonce bias can be exploited in many ways. Some attacks on ECDSA involve complicated Fourier analysis and lattice mathematics. In this paper will enable cryptographers to identify efficient ways in which ECDSA can be cracked on curves NIST256p, SECP256k1, NIST521p and weak nonce, kind of attacks that can crack ECDSA and how to protect yourself. Initially we begin with ECDSA signature to sign a message using private key and validate the generated signature using the shared public key. Then we use a nonce or a random value to randomize the generated signature. Every time we sign, a new verifiable random nonce value is created and way in which the intruder can discover the private key if the signer leaks any one of the nonce value. Then we use Lenstra-Lenstra-Lovasz (LLL) method as a black box, we will try to attack signatures generated from bad nonce or bad random number generator (RAG) on NIST256p, SECP256k1 curves. The analysis is performed by considering all the three curves for implementation of Elliptic Curve Digital Signature Algorithm (ECDSA).The comparative analysis for each of the selected curves in terms of computational time is done with leak of nonce and with Lenstra-Lenstra-Lovasz method to crack ECDSA. The average computational costs to break ECDSA with curves NIST256p, NIST521p and SECP256k1 are 0.016, 0.34, 0.46 respectively which is almost to zero depicts the strength of algorithm. The average computational costs to break ECDSA with curves SECP256K1 and NIST256p using LLL are 2.9 and 3.4 respectively. © 2023 River Publishers.

关键词： Internet of things

来源：评论

学校读者我要写书评

暂无评论

From Perception to Action: Building a Robust AI System for Safe and Adaptive Autonomous Driving 15

From Perception to Action: Building a Robust AI System for S...

引用

15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024

作者： Taha, Adnan Sabith Hossain, Md Rafat Esha, Sheikh Aysha Khatun Pantha, Iffat Mahmud Riya, Aparna Sarker Uddin, Md.Jamil Shakur, Md. Abdus Computer Science and Engineering Bangladesh Pace University Data Science United States North South University Computer Science and Engineering Bangladesh Brac University Computer Science and Engineering Bangladesh Eastern University Electrical and Electronics Engineering Bangladesh Electrical and Electronics Engineering Bangladesh

ISBN: (纸本)9798350370249

This study explores the development of a self-driving car using a combination of deep learning (DL), machine learning (ML), computer vision (CV), and convolutional neural networks (CNN). The proposed system aims to simulate human-like decision making in response to external conditions encountered during autonomous driving. The approach involves real-time on-road testing and a self-training mechanism to enable the car to continuously learn and adapt. Furthermore, the text suggests an investigation into the fundamental principles of artificial intelligence (AI) and their role in the autonomous car's functionality. © 2024 IEEE.

关键词： Control Perception Planning Self-driving car

来源：评论

学校读者我要写书评

暂无评论

Role of Artificial Intelligence in Bioinformatics and Information Extraction Systems

Decision Making: Applications in Management and Engineering

引用

Decision Making: Applications in Management and engineering 2025年第1期8卷 401-418页

作者： Al-Safarini, Maram Y. Baashirah, Rania A. Computer Science Department Zarqa University Jordan Software Engineering Department University of Business & Technology Saudi Arabia

Artificial Intelligence (AI) is transforming numerous domains, including bioinformatics and information extraction systems, by advancing data processing capabilities, enhancing precision, and facilitating automation. The primary aim of this research is to examine the function of AI within the realms of bioinformatics and information extraction through a combination of quantitative and qualitative approaches. The quantitative component involved a sample of 152 participants, recognised as experts in bioinformatics and data science. The survey findings underscore the prevalent integration of AI in bioinformatics, particularly through the utilisation of reinforcement learning, neural networks, and natural language processing (NLP). Furthermore, AI substantially improves the analysis of biological data;however, it encounters challenges such as limited model interpretability and the lack of data standardisation. Notwithstanding these obstacles, AI-driven innovations in disease prognosis, personalised healthcare, and pharmaceutical development are anticipated to shape the future trajectory of bioinformatics. The qualitative findings, derived through thematic analysis encompassing core themes and sub-themes, reveal that AI significantly contributes to accelerating and refining information extraction processes. Technologies such as machine learning, NLP, and neural networks are instrumental in enhancing data processing efficiency. Nonetheless, issues such as inadequate data quality, elevated computational expenses, and the intricacy of AI models remain persistent. Looking ahead, AI is projected to integrate more effectively with big data infrastructures, enable real-time information extraction, and deliver increasingly tailored solutions. The study concludes with several policy recommendations to guide future implementation strategies. © 2025 Regional Association for Security and crisis management. All rights reserved.

关键词： data integration

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：