检索结果-内蒙古大学图书馆

TechRxiv 2023年

作者： Du, Hongyang Li, Zonghang Niyato, Dusit Kang, Jiawen Xiong, Zehui Huang, Huawei Mao, Shiwen School of Computer Science and Engineering The Energy Research Institute @ NTU Interdisciplinary Graduate Program Nanyang Technological University Singapore School of Information and Communication Engineering University of Electronic Sciences and Technology of China Chengdu China School of Automation Guangdong University of Technology China Pillar of Information Systems Technology and Design Singapore University of Technology and Design Singapore School of Software Engineering Sun Yat-Sen University Zhuhai China Department of Electrical and Computer Engineering Auburn University Auburn United States

As Metaverse emerges as the next-generation Internet paradigm, the ability to efficiently generate content is paramount. AI-Generated Content (AIGC) emerges as a key solution, yet the resource-intensive nature of large Generative AI (GAI) models presents challenges. To address this issue, we introduce an AIGC-as-a-Service (AaaS) architecture, which deploys AIGC models in wireless edge networks to ensure broad AIGC services accessibility for Metaverse users. Nonetheless, an important aspect of providing personalized user experiences requires carefully selecting AIGC Service Providers (ASPs) capable of effectively executing user tasks, which is complicated by environmental uncertainty and variability. Addressing this gap in current research, we introduce the AI-Generated Optimal Decision (AGOD) algorithm, a diffusion model-based approach for generating the optimal ASP selection decisions. Integrating AGOD with Deep Reinforcement Learning (DRL), we develop the Deep Diffusion Soft Actor-Critic (D2SAC) algorithm, enhancing the efficiency and effectiveness of ASP selection. Our comprehensive experiments demonstrate that D2SAC outperforms seven leading DRL algorithms. Furthermore, the proposed AGOD algorithm has the potential for extension to various optimization problems in wireless networks, positioning it as a promising approach for future research on AIGC-driven services. The implementation of our proposed method is available at: https://***/Lizonghang/AGOD. © 2023, CC BY.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

AgriField3D: A Curated 3D Point Cloud and Procedural Model Dataset of Field-Grown Maize from a Diversity Panel

arXiv

引用

arXiv 2025年

作者： Kimara, Elvis Hadadi, Mozhgan Godbersen, Jackson Balu, Aditya Jubery, Talukder Li, Yawei Krishnamurthy, Adarsh Schnable, Patrick S. Ganapathysubramanian, Baskar Department of Computer Science Iowa State University Ames United States Translational AI Research and Education Center Ames United States Department of Mechanical Engineering Iowa State University Ames United States Plant Science Institute Iowa State University Ames United States Interdepartmental Genetics and Genomics Graduate Program Iowa State University Ames United States Department of Agronomy Iowa State University Ames United States

The application of artificial intelligence (AI) in three-dimensional (3D) agricultural research, particularly for maize, has been limited by the scarcity of large-scale, diverse datasets. While 2D image datasets are abundant, they fail to capture essential structural details such as leaf architecture, plant volume, and spatial arrangements that 3D data provide. To address this limitation, we present AgriField3D (website), a curated dataset of 3D point clouds of field-grown maize plants from a diverse genetic panel, designed to be AI-ready for advancing agricultural research. Our dataset comprises over 1,000 high-quality point clouds collected using a Terrestrial Laser Scanner, complemented by procedural models that provide structured, parametric representations of maize plants. These procedural models, generated using Non-Uniform Rational B-Splines (NURBS) and optimized via a two-step process combining Particle Swarm Optimization (PSO) and differentiable programming, enable precise, scalable reconstructions of leaf surfaces and plant architectures. To enhance usability, we performed graph-based segmentation to isolate individual leaves and stalks, ensuring consistent labeling across all samples. We also conducted rigorous manual quality control on all datasets, correcting errors in segmentation, ensuring accurate leaf ordering, and validating metadata annotations. The dataset further includes metadata detailing plant morphology and quality, alongside multi-resolution subsampled versions (100k, 50k, 10k points) optimized for various computational needs. By integrating point cloud data of field grown plants with high-fidelity procedural models and ensuring meticulous manual validation, AgriField3D provides a comprehensive foundation for AI-driven phenotyping, plant structural analysis, and 3D applications in agricultural research. © 2025, CC BY-NC-SA.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A BCLC Staging System for Hepatocellular Carcinoma using Swin Transformer and CT Imaging

A BCLC Staging System for Hepatocellular Carcinoma using Swi...

引用

Annual International Conference of the IEEE engineering in Medicine and Biology Society (EMBC)

作者： Shun-Cheng Chang Pochuang Wang Weichung Wang Tung-Hung Su Jia-Horng Kao Che Lin Graduate Institute of Communication Engineering National Taiwan University (NTU) Taipei Taiwan Department of Computer Science and Information Engineering NTU Taipei Taiwan NTU Institute of Applied Mathematical Sciences Taipei Taiwan Department of Internal Medicine Division of Gastroenterology and Hepatology National Taiwan University Hospital Taipei Taiwan Department of Electrical Engineering NTU Taipei Taiwan NTU Center for Advanced Computing and Imaging in Biomedicine Taipei Taiwan NTU Smart Medicine and Health Informatics Program Taipei Taiwan

ISBN: (数字)9798350371499

ISBN: (纸本)9798350371505

The Barcelona Clinic Liver Cancer (BCLC) staging system plays a crucial role in clinical planning, offering valuable insights for effectively managing hepatocellular carcinoma. Accurate prediction of BCLC stages can significantly ease the workload on radiologists. However, few datasets are explicitly designed for discerning BCLC stages. Despite the common practice of appending BCLC labels to clinical data within datasets, the inherent imbalance in BCLC distribution is further amplified by the diverse purposes for which datasets are curated. In this study, we aim to develop a BCLC staging system using the advanced Swin Transformer model. Additionally, we explore the integration of two datasets, each originally intended for separate objectives, highlighting the critical challenge of preserving class distribution in practical study designs. This exploration is pivotal for ensuring the applicability of our developed staging system in the designed clinical settings. Our resulting BCLC staging system demonstrates an accuracy of 55.81% (±7.8%), contributing to advancing medical image-based research for predicting BCLC stages.

关键词： Liver cancer Accuracy Computed tomography Biological system modeling Transformers Planning Biomedical imaging

来源：评论

学校读者我要写书评

暂无评论

ReViT: A Hybrid Approach for BCLC Staging of Hepatocellular Carcinoma Using 3D CT with Multiple Instance Learning

ReViT: A Hybrid Approach for BCLC Staging of Hepatocellular ...

引用

IEEE EMBS International Conference on Information Technology Applications in Biomedicine (ITAB)

作者： Shun-Cheng Chang Hsin-Pei Yu Yi-Hsien Hsieh Pochuang Wang Weichung Wang Tung-Hung Su Jia-Horng Kao Che Lin Graduate Institute of Communication Engineering National Taiwan University (NTU) Taipei Taiwan Department of Computer Science and Information Engineering NTU Taipei Taiwan Institute of Applied Mathematical Sciences NTU Taipei Taiwan Department of Internal Medicine Division of Gastroenterology and Hepatology National Taiwan University Hospital Taipei Taiwan Department of Electrical Engineering NTU Taipei Taiwan Center for Advanced Computing and Imaging in Biomedicine NTU Taipei Taiwan Smart Medicine and Health Informatics Program NTU Taipei Taiwan

ISBN: (数字)9798350351552

ISBN: (纸本)9798350351569

Deep learning has revolutionized medical imaging, offering advanced methods for accurate diagnosis and treatment planning. The BCLC staging system is crucial for staging Hepatocellular Carcinoma (HCC), a high-mortality cancer. An automated BCLC staging system could significantly enhance diagnosis and treatment planning efficiency. However, we found that BCLC staging, which is directly related to the size and number of liver tumors, aligns well with the principles of the Multiple Instance Learning (MIL) framework. To effectively achieve this, we proposed a new preprocessing technique called Masked Cropping and Padding(MCP), which addresses the variability in liver volumes and ensures consistent input sizes. This technique preserves the structural integrity of the liver, facilitating more effective learning. Furthermore, we introduced Re ViT, a novel hybrid model that integrates the local feature extraction capabilities of Convolutional Neural Networks (CNNs) with the global context modeling of Vision Transformers (ViTs). Re ViT leverages the strengths of both architectures within the MIL framework, enabling a robust and accurate approach for BCLC staging. We will further explore the trade-off between performance and interpretability by employing TopK Pooling strategies, as our model focuses on the most informative instances within each bag.

关键词： Visualization Accuracy Three-dimensional displays Liver Transformers Planning Convolutional neural networks Context modeling Biomedical imaging Tumors

来源：评论

学校读者我要写书评

暂无评论

Diffusion-based Reinforcement Learning for Edge-enabled AI-Generated Content Services

arXiv

引用

arXiv 2023年

作者： Du, Hongyang Li, Zonghang Niyato, Dusit Kang, Jiawen Xiong, Zehui Huang, Huawei Mao, Shiwen The School of Computer Science and Engineering The Energy Research Institute @ NTU Interdisciplinary Graduate Program Nanyang Technological University Singapore The School of Information and Communication Engineering University of Electronic Sciences and Technology of China Chengdu China The School of Automation Guangdong University of Technology China The Pillar of Information Systems Technology and Design Singapore University of Technology and Design Singapore The School of Software Engineering Sun Yat-Sen University Zhuhai China The Department of Electrical and Computer Engineering Auburn University Auburn United States

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Video-Text as Game Players: Hierarchical Banzhaf Interaction...

引用

Conference on computer Vision and Pattern Recognition (CVPR)

作者： Peng Jin Jinfa Huang Pengfei Xiong Shangxuan Tian Chang Liu Xiangyang Ji Li Yuan Jie Chen School of Electronic and Computer Engineering Peking University Shenzhen China AI for Science (AI4S)-Preferred Program Peking University Shenzhen Graduate School Shenzhen China Shopee Shenzhen China Department of Automation and BNRist Tsinghua University Beijing China Peng Cheng Laboratory Shenzhen China

Contrastive learning-based video-language representation learning approaches, e.g., CLIP, have achieved outstanding performance, which pursue semantic interaction upon pre-defined video-text pairs. To clarify this coarse-grained global interaction and move a step further, we have to encounter challenging shell-breaking interactions for fine-grained cross-modal learning. In this paper, we creatively model video-text as game players with multivariate cooperative game theory to wisely handle the uncertainty during fine-grained semantic interaction with diverse granularity, flexible combination, and vague intensity. Concretely, we propose Hierarchical Banzhaf Interaction (HBI) to value possible correspondence between video frames and text words for sensitive and explainable cross-modal contrast. To efficiently realize the cooperative game of multiple video frames and multiple text words, the proposed method clusters the original video frames (text words) and computes the Banzhaf Interaction between the merged tokens. By stacking token merge modules, we achieve cooperative games at different semantic levels. Extensive experiments on commonly used text-video retrieval and video-question answering bench-marks with superior performances justify the efficacy of our HBI. More encouragingly, it can also serve as a visualization tool to promote the understanding of cross-modal interaction, which have a far-reaching impact on the community. Project page is available at https://***/HBI/.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Immersive Modeling Framework for Training Applications

Immersive Modeling Framework for Training Applications

引用

International Conference on Advanced Learning Technologies (ICALT)

作者： Vinicius Chrisosthemos Teixeira Carlo Smaniotto Mantovani Alexandre Cardoso Carlos Alexandre dos Santos Márcio Sarroglia Pinho Computer Science Graduate Program School of Technology Pontifical Catholic University of Rio Grande do Sul Porto Alegre Brazil Faculty of Electrical Engineering Federal University of Uberlândia Uberlândia Brazil School of Technology Pontifical Catholic University of Rio Grande do Sul Porto Alegre Brazil

This article describes a framework for modeling and executing training in augmented and virtual reality environments. The framework was designed based on characteristics observed in existing training applications for complex tasks, such as the use of tools, control panels and the need for step-by-step instructions. Unlike other frameworks, in the proposed system, it is possible to create the training entirely within the virtual/augmented environment, avoiding constant switching between 2D and 3D environments. The framework allows for the definition of steps in a training program, each of which includes textual instructions, videos, and 3D objects, static or animated, anchored in the real world. To demonstrate the capabilities of the framework, a training program for operating a Universal Testing Machine was created as a case study. Overall, the proposed framework allows for the creation of effective and efficient AR training programs for a variety of tasks and industries.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation

ACSeg: Adaptive Conceptualization for Unsupervised Semantic ...

引用

Conference on computer Vision and Pattern Recognition (CVPR)

作者： Kehan Li Zhennan Wang Zesen Cheng Runyi Yu Yian Zhao Guoli Song Chang Liu Li Yuan Jie Chen School of Electronic and Computer Engineering Peking University Shenzhen China AI for Science (AI4S)-Preferred Program Peking University Shenzhen Graduate School Shenzhen China Peng Cheng Laboratory Shenzhen China Dalian University of Technology Department of Automation and BNRist Tsinghua University Beijing China

Recently, self-supervised large-scale visual pre-training models have shown great promise in representing pixel-level semantic relationships, significantly promoting the development of unsupervised dense prediction tasks, e.g., unsupervised semantic segmentation (USS). The extracted relationship among pixel-level representations typically contains rich class-aware information that semantically identical pixel embeddings in the representation space gather together to form sophisticated concepts. However, leveraging the learned models to ascertain semantically consistent pixel groups or regions in the image is non-trivial since over/ under-clustering overwhelms the conceptualization procedure under various semantic distributions of different images. In this work, we investigate the pixel-level semantic aggregation in self-supervised ViT pre-trained models as image Segmentation and propose the Adaptive Conceptualization approach for USS, termed ACSeg. Concretely, we explicitly encode concepts into learnable prototypes and design the Adaptive Concept Generator (ACG), which adaptively maps these prototypes to informative concepts for each image. Meanwhile, considering the scene complexity of different images, we propose the modularity loss to optimize ACG independent of the concept number based on estimating the intensity of pixel pairs belonging to the same concept. Finally, we turn the USS task into classifying the discovered concepts in an unsupervised manner. Extensive experiments with state-of-the-art results demonstrate the effectiveness of the proposed ACSeg.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Explainable machine learning models for estimating daily dissolved oxygen concentration of the Tualatin River

引用

engineering Applications of Computational Fluid Mechanics 2024年第1期18卷

作者： Li, Shuguang Qasem, Sultan Noman Band, Shahab S. Ameri, Rasoul Pai, Hao-Ting Mehdizadeh, Saeid School of Computer Science and Technology Shandong Technology and Business University Yantai China Computer Science Department College of Computer and Information Sciences Imam Mohammad Ibn Saud Islamic University (IMSIU) Riyadh Saudi Arabia Computer Science Department Faculty of Applied Science Taiz University Taiz Yemen Future Technology Research Center National Yunlin University of Science and Technology Douliu Taiwan Department of Information Management International Graduate School of Artificial Intelligence National Yunlin University of Science and Technology Douliu Taiwan Department of Information Management National Yunlin University of Science and Technology Douliu Taiwan Bachelor Program of Big Data Applications in Business National Pingtung University Pingtung Taiwan Water Engineering Department Urmia University Urmia Iran

Monitoring the quality of river water is of fundamental importance and needs to be taken into consideration when it comes to the research into the hydrological field. In this context, the concentration of the dissolved oxygen (DO) is one of the most significant indicators of the quality of river water. The current study aimed to estimate the minimum, maximum, and mean DO concentrations (DO min, DO max, DO mean) at a gauging station located on Tualatin River, United States. To that end, four machine learning models, such as support vector regression (SVR), multi-layer perceptron (MLP), random forest (RF), and gradient boosting (GB) were established. Root mean square error (RMSE), mean absolute error (MAE), coefficient of correlation (R), and Nash-Sutcliffe efficiency (NSE) metrics were employed to better assess the accuracies of these models. The modeling results demonstrated that the SVR and MLP surpassed the RF and GB models. Despite this, the SVR was concluded to be the best-performing method when used to estimate the DO min, DO max, and DO mean. The best error statistics in the testing phase were related to the SVR model with full (four) inputs to estimate DO mean concentration (RMSE = 0.663 mg/l, MAE = 0.508 mg/l, R = 0.945, NSE = 0.875). Finally, the explainability of the superior models (i.e. SVR models) was conducted using SHapley Additive exPlanations (SHAP) for the first time to estimate DO concentration. In fact, evaluating the explainability of machine learning models can provide useful information about the impact of each of the input estimators used in the procedure of models development. It was concluded that the specific conductance (SC) and followed by water temperature (WT) could provide the most contributions for estimating the DO min, DO max, and DO mean concentrations. © 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

关键词： dissolved oxygen concentration estimation Explainable machine learning SHapley additive explanations

来源：评论

学校读者我要写书评

暂无评论

E-payment for Jakarta Smart Public Transportation, Using the Point System for E-Commerce 2

E-payment for Jakarta Smart Public Transportation, Using the...

引用

2nd International Conference on computer, science, engineering, and Technology, ICComSET 2019

作者： Anwar, Nizirwan Rasjidin, Roesfiansjah Najoan, Daniel Stephanus Rolando, Christopher Tamimmanar Warnars, Harco Leslie Hendric Spits Faculty of Computer Science Esa Unggul University Jakarta11510 Indonesia Faculty of Engineering Esa Unggul University Jakarta11510 Indonesia Computer Science Department School of Computer Science Bina Nusantara University Jakarta11480 Indonesia Computer Science Department BINUS Graduate Program - Doctor of Computer Science Bina Nusantara University Jakarta11480 Indonesia

The ease of using transportation is one of the most critical things in the city with a significant population like Jakarta. The growth of the population in Jakarta is increased rapidly. The wage that Many transportations are causing a traffic jam in Jakarta. The government suggests that people use public transportation for their mobility. However, people choose to use their vehicles rather than public transportation. The main reason is that public transportation cannot guarantee the arriving time, whether it is on time or not. Many people move on to online transportation services. However, the massive growth of online transportation is still a contradiction as public transportation. People in Jakarta need faster mobility to go. This research is trying to make a system for public transport without any delay and hard to use it. This problem can be solved by building a new system. This system required e-wallet for payment in public transportation. This E-Wallet will search the possible route from the nearest location to the destination - only the public transport where there is a station that can use it. By using QR-Code generated by e-wallet on the mobile phone, people can scan it directly to a machine located in the station. This method will make a transaction faster than ever. Moreover, people can enjoy another feature like cashback and redeem the prize by a point system. In this era, e-commerce is dominating the market for purchasing something. This is also the attraction of the system. This system is also required excellent facilities from the government so that people can enjoy it. © Published under licence by IOP Publishing Ltd.

关键词： Traffic congestion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：