检索结果-内蒙古大学图书馆

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

science China(Information sciences) 2024年第12期67卷 36-51页

作者： Yangzhou LIU Yue CAO Zhangwei GAO Weiyun WANG Zhe CHEN Wenhai WANG Hao TIAN Lewei LU Xizhou ZHU Tong LU Yu QIAO Jifeng DAI School of Computer Science Nanjing University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai AI Laboratory School of Computer Science Fudan University Department of Information Engineering The Chinese University of Hong Kong SenseTime Research Department of Electronic Engineering Tsinghua University

Despite the effectiveness of vision-language supervised fine-tuning in enhancing the performance of vision large language models(VLLMs), existing visual instruction tuning datasets include the following limitations.(1) Instruction annotation quality: despite existing VLLMs exhibiting strong performance,instructions generated by those advanced VLLMs may still suffer from inaccuracies, such as hallucinations.(2) Instructions and image diversity: the limited range of instruction types and the lack of diversity in image data may impact the model's ability to generate diversified and closer to real-world scenarios outputs. To address these challenges, we construct a high-quality, diverse visual instruction tuning dataset MMInstruct,which consists of 973k instructions from 24 domains. There are four instruction types: judgment, multiplechoice, long visual question answering, and short visual question answering. To construct MMInstruct, we propose an instruction generation data engine that leverages GPT-4V, GPT-3.5, and manual correction. Our instruction generation engine enables semi-automatic, low-cost, and multi-domain instruction generation at 1/6 the cost of manual construction. Through extensive experiment validation and ablation experiments,we demonstrate that MMInstruct could significantly improve the performance of VLLMs, e.g., the model fine-tuning on MMInstruct achieves new state-of-the-art performance on 10 out of 12 benchmarks. The code and data shall be available at https://***/yuecao0119/MMInstruct.

关键词： instruction tuning multi-modal multi-domain dataset vision large language model

来源：评论

学校读者我要写书评

暂无评论

Community detection of trajectory data for location-based facility recommendation system

引用

International Journal of Information and Communication Technology 2024年第2期25卷 101-117页

作者： Sabarish, B.A. Karthi, R. Kumar, T. Gireesh Department of Computer Science and Engineering Amrita School of Engineering Amrita Vishwa Vidyapeetham Coimbatore India

Trajectory contains spatial-data generated from traces of moving objects like people, animals, etc. Community generated from trajectories portrays common behaviour. Trajectory clustering based on community-detection involves region-graph generation and community-detection. In region-graph generation, trajectories are projected to spatial grid to transform GPS representation into string representation. Sequential graph is generated from string representation. Edge-based similarity is calculated between trajectories to create an adjacency matrix representing relationship and represent entire region. In community-detection phase, region-graph is divided into communities using various algorithms and validated using modularity values. Based on analysis, Louvain, fast-greedy, leading-eigenvector, and edge-betweenness algorithms provide the optimum modularity value for better community detection. Analysing the community can be used as a pre-processing step in identifying location for location-based services (LBS), including hotspots, delay-tolerant-networks, and mobile antenna placements for better coverage. Design and capacity planning of the network based on the size and pattern of the community improves quality of LBS. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Quality of service

来源：评论

学校读者我要写书评

暂无评论

Hybrid CGAN-based plant leaf disease classification using OTSU and surf feature extraction

引用

Neural Computing and Applications 2024年第23期36卷 14395-14407页

作者： Saraswathi, E. Banu, J. Faritha Department of Computer Science and Engineering SRM Institute of Science and Technology Ramapuram Chennai600089 India

Agriculture encompasses a way of life and a profession for the general population. Most global traditions and cultures revolve around agriculture. With the help of advanced farming, agriculture may become more profitable, dependable, and able to use resources and time more effectively. The proposed hybrid CGAN using OTSU and SURF (CGAN-OF) model provides a novel framework for plant leaf disease classification. In proposed CGAN-OF model, contrast-limited adaptive histogram equalization is used for image preprocessing and enhancement. The proposed model utilizes OTSU algorithm to speed up the image segmentation without prior knowledge of the images and SURF algorithm to extract the local features using scale-invariant feature transformation. CGAN increases the input plant village dataset using image generation method and identifies the various plant leaf diseases and classifies them. There are three classifications of leaf diseases: fungi, viruses, and bacteria. Furthermore, these classifications contain approximately 300 diseases. The proposed work identified a minimum of 200 diseases in 18,161 major and minor crop species. The investigational investigations are carried out using the Python Jupyter app with the Kaggle Plant Village Dataset and also leaf samples collected from farmers. The proposed framework achieves 99.2% accuracy. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

Computation of Graph Fourier Transform Centrality Using Graph Filter

IEEE Open Journal of Circuits and Systems

引用

IEEE Open Journal of Circuits and Systems 2024年 5卷 69-80页

作者： Tseng, Chien-Cheng Lee, Su-Ling National Kaohsiung University of Science and Technology Department of Computer and Communication Engineering Kaohsiung824005 Taiwan Chang Jung Christian University Department of Computer Science and Information Engineering Tainan711301 Taiwan

In this paper, the computation of graph Fourier transform centrality (GFTC) of complex network using graph filter is presented. For conventional computation method, it needs to use the non-sparse transform matrix of graph Fourier transform (GFT) to compute GFTC scores. To reduce the computational complexity of GFTC, a linear algebra method based on Frobenius norm of error matrix is applied to convert the spectral-domain GFTC computation task to vertex-domain one such that GFTC can be computed by using polynomial graph filtering method. There are two kinds of designs of graph filters to be studied. One is the graph-aware method;the other is the graph-unaware method. The computational complexity comparison and experimental results show that the proposed graph filter method is more computationally efficient than conventional GFT method because the sparsity of Laplacian matrix is used in the implementation structure. Finally, the centrality computations of social network, metro network and sensor network are used to demonstrate the effectiveness of the proposed GFTC computation method using graph filter. © 2020 IEEE.

关键词： Complex networks

来源：评论

学校读者我要写书评

暂无评论

Detecting Low-Yield Machines in Batch Production Systems Based on Observed Defective Pieces

引用

IEEE Transactions on Systems, Man, and Cybernetics: Systems 2024年第7期54卷 3972-3983页

作者： Adipraja, Philip F. E. Chang, Chin-Chun Yang, Hua-Sheng Wang, Wei-Jen Liang, Deron National Central University Department of Computer Science and Information Engineering Taoyuan City32001 Taiwan National Taiwan Ocean University Department of Computer Science and Engineering Keelung City20224 Taiwan

In batch production systems, detecting low-yield machines is essential for minimizing the production of defective pieces, which is a complex problem that currently requires multiple experts, considerable capital, or a combination of both to overcome. To solve this problem, we proposed a cost-efficient and straightforward method that involves using maximum likelihood estimation and bootstrap confidence intervals to estimate per-machine yield;this method enables identification of low-yield machines and generation of a list of these machines. Manufacturing engineers can use the list to perform necessary verification and maintenance processes. Before implementing this method, a manufacturer with 50-500 machines should build a dataset containing approximately 6-20 times as many batches as there are production machines. When this condition is met, the proposed method can be used effectively to detect up to five low-yield machines. © 2013 IEEE.

关键词： Maximum likelihood estimation

来源：评论

学校读者我要写书评

暂无评论

Deep reinforcement learning for online scheduling of photovoltaic systems with battery energy storage systems

引用

Intelligent and Converged Networks 2024年第1期5卷 28-41页

作者： Yaze Li Jingxian Wu Yanjun Pan the Department of Electrical Engineering University of ArkansasFayettevilleAR 72701USA the Department of Computer Science and Computer Engineering University of ArkansasFayettevilleAR 72701USA

A new online scheduling algorithm is proposed for photovoltaic(PV)systems with battery-assisted energy storage systems(BESS).The stochastic nature of renewable energy sources necessitates the employment of BESS to balance energy supplies and demands under uncertain weather *** proposed online scheduling algorithm aims at minimizing the overall energy cost by performing actions such as load shifting and peak shaving through carefully scheduled BESS charging/discharging *** scheduling algorithm is developed by using deep deterministic policy gradient(DDPG),a deep reinforcement learning(DRL)algorithm that can deal with continuous state and action *** of the main contributions of this work is a new DDPG reward function,which is designed based on the unique behaviors of energy *** new reward function can guide the scheduler to learn the appropriate behaviors of load shifting and peak shaving through a balanced process of exploration and *** new scheduling algorithm is tested through case studies using real world data,and the results indicate that it outperforms existing algorithms such as Deep *** online algorithm can efficiently learn the behaviors of optimum non-casual off-line algorithms.

关键词： photovoltaic(PV) battery energy storage system(BESS) Markov decision process(MDP) deep deterministic policy gradient(DDPG)

来源：评论

学校读者我要写书评

暂无评论

Automatic Transportation Mode Classification Using a Deep Reinforcement Learning Approach With Smartphone Sensors

引用

IEEE Access 2024年 12卷 514-533页

作者： Taherinavid, Siavash Moravvej, Seyed Vahid Chen, Yen-Lin Yang, Jing Ku, Chin Soon Yee, Por Lip Iran University of Science and Technology School of Civil Engineering Tehran13114-16846 Iran Isfahan University of Technology Department of Electrical and Computer Engineering Isfahan84156-83111 Iran National Taipei University of Technology Department of Computer Science and Information Engineering Taipei106344 Taiwan Universiti Malaya Faculty of Computer Science and Information Technology Department of Computer System and Technology Kuala Lumpur50603 Malaysia Universiti Tunku Abdul Rahman Department of Computer Science Kampar31900 Malaysia

The increasing dependence on smartphones with advanced sensors has highlighted the imperative of precise transportation mode classification, pivotal for domains like health monitoring and urban planning. This research is motivated by the pressing demand to enhance transportation mode classification, leveraging the potential of smartphone sensors, notably the accelerometer, magnetometer, and gyroscope. In response to this challenge, we present a novel automated classification model rooted in deep reinforcement learning. Our model stands out for its innovative approach of harnessing enhanced features through artificial neural networks (ANNs) and visualizing the classification task as a structured series of decision-making events. Our model adopts an improved differential evolution (DE) algorithm for initializing weights, coupled with a specialized agent-environment relationship. Every correct classification earns the agent a reward, with additional emphasis on the accurate categorization of less frequent modes through a distinct reward strategy. The Upper Confidence Bound (UCB) technique is used for action selection, promoting deep-seated knowledge, and minimizing reliance on chance. A notable innovation in our work is the introduction of a cluster-centric mutation operation within the DE algorithm. This operation strategically identifies optimal clusters in the current DE population and forges potential solutions using a pioneering update mechanism. When assessed on the extensive HTC dataset, which includes 8311 hours of data gathered from 224 participants over two years. Noteworthy results spotlight an accuracy of 0.88±0.03 and an F-measure of 0.87±0.02, underscoring the efficacy of our approach for large-scale transportation mode classification tasks. This work introduces an innovative strategy in the realm of transportation mode classification, emphasizing both precision and reliability, addressing the pressing need for enhanced classification mechanisms in an eve

关键词： Smartphones

来源：评论

学校读者我要写书评

暂无评论

Pentago SnW:An Improved Spray and Wait Protocol for Delay Tolerant Wireless Sensor Networks

引用

China Communications 2025年第3期22卷 104-114页

作者： Idris Afzal Shah Mushtaq Ahmed Raghavendra Singh Department of Computer Science and Engineering Malaviya National Institute of TechnologyJaipur302017India

Delay tolerant wireless sensor networks(DTWSN)is a class of wireless network that finds its deployment in those application scenarios which demand for high packet delivery ratio while maintaining minimal overhead in order to prolong network lifetime;owing to resource-constrained nature of *** fundamental requirement of any network is routing a packet from its source to *** of a routing algorithm depends on the number of network parameters utilized by that routing *** the recent years,various routing protocol has been developed for the delay tolerant networks(DTN).A routing protocol known as spray and wait(SnW)is one of the most widely used routing algorithms for *** this paper,we study the SnW routing protocol and propose a modified version of it referred to as Pentago SnW which is based on pentagonal number *** to binary SnW shows promising results through simulation using real-life scenarios of cars and pedestrians randomly moving on a map.

关键词： binary SnW delay tolerant network Pentago SnW spray and wait routing

来源：评论

学校读者我要写书评

暂无评论

Proactive Crowdsourced Monitoring and Sensing With Expansible Activity Recognition Based on Internet of Things Localization

引用

IEEE Internet of Things Journal 2025年第11期12卷 17674-17686页

作者： Chen, Lien-Wu Liao, Chun-Wei Liu, Jun-Xian Feng Chia University Department of Information Engineering and Computer Science Taichung407 Taiwan

This article proposes a proactive crowdsourced monitoring and sensing (PCMS) framework with the designed Smart iBeacon device to accurately recognize the activities of an equipped target, exclusively customize the recognition model of a specific target, and actively trigger cooperative tracking of nearby smartphones for an abnormal target based on Internet of Things (IoT) localization. According to our review of relevant research, PCMS is the first framework that provides the following features: 1) coarse-grained and fine-grained features can be extracted to accurately recognize target activities through densely connected convolutional networks with improvement design;2) crowdsourced monitoring and sensing can be actively triggered for a target as the abnormal activity of the target is detected;and 3) deep learning model of target activity recognition can be exclusively customized for a specific target to improve the recognition accuracy based on the dedicated activity data of the target. An Android-based prototype with stationary iBeacon nodes and the Smart iBeacon is implemented to verify the feasibility and superiority of our PCMS framework. Experimental results show that our framework outperforms existing methods and can accurately recognize target activities for abnormal event detection and proactive crowdsourced tracking in a real-time manner. © 2014 IEEE.

关键词： Smartphones

来源：评论

学校读者我要写书评

暂无评论

A Secure Framework for WSN-IoT Using Deep Learning for Enhanced Intrusion Detection

引用

computers, Materials & Continua 2024年第10期81卷 471-501页

作者： Chandraumakantham Om Kumar Sudhakaran Gajendran Suguna Marappan Mohammed Zakariah Abdulaziz S.Almazyad School of Computer Science&Engineering Vellore Institute of TechnologyChennai CampusChennai600127India School of Electronics&Engineering Vellore Institute of TechnologyChennai CampusChennai600127India Department of Computer Sciences and Engineering College of Applied ScienceKing Saud UniversityRiyadh11543Saudi Arabia Department of Computer Engineering College of Computer and Information SciencesKing Saud UniversityRiyadh11543Saudi Arabia

The security of the wireless sensor network-Internet of Things(WSN-IoT)network is more challenging due to its randomness and self-organized *** detection is one of the key methodologies utilized to ensure the security of the *** intrusion detection mechanisms have issues such as higher misclassification rates,increased model complexity,insignificant feature extraction,increased training time,increased run time complexity,computation overhead,failure to identify new attacks,increased energy consumption,and a variety of other factors that limit the performance of the intrusion system *** this research a security framework for WSN-IoT,through a deep learning technique is introduced using Modified Fuzzy-Adaptive DenseNet(MF_AdaDenseNet)and is benchmarked with datasets like NSL-KDD,UNSWNB15,CIDDS-001,Edge IIoT,Bot *** this,the optimal feature selection using Capturing Dingo Optimization(CDO)is devised to acquire relevant features by removing redundant *** proposed MF_AdaDenseNet intrusion detection model offers significant benefits by utilizing optimal feature selection with the CDO *** results in enhanced Detection Capacity with minimal computation complexity,as well as a reduction in False Alarm Rate(FAR)due to the consideration of classification error in the fitness *** a result,the combined CDO-based feature selection and MF_AdaDenseNet intrusion detection mechanism outperform other state-of-the-art techniques,achieving maximal Detection Capacity,precision,recall,and F-Measure of 99.46%,99.54%,99.91%,and 99.68%,respectively,along with minimal FAR and Mean Absolute Error(MAE)of 0.9%and 0.11.

关键词： Deep learning intrusion detection fuzzy rules feature selection false alarm rate accuracy wireless sensor networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：