检索结果-内蒙古大学图书馆

CLIP-Flow:Decoding images encoded in CLIP space

Computational Visual Media 2024年第6期10卷 1157-1168页

作者： Hao Ma Ming Li Jingyuan Yang Or Patashnik Dani Lischinski Daniel Cohen-Or Hui Huang Visual Computing Research Center College of Computer Science and Software EngineeringShenzhen UniversityShenzhen 518060China Department of Computer Science Tel Aviv UniversityTel Aviv 6997801Israel School of Computer Science and Engineering the Hebrew University of JerusalemJerusalem 91904Israel

This study introduces CLIP-Flow,a novel network for generating images from a given image or *** effectively utilize the rich semantics contained in both modalities,we designed a semantics-guided methodology for image-and text-to-image *** particular,we adopted Contrastive Language-Image Pretraining(CLIP)as an encoder to extract semantics and StyleGAN as a decoder to generate images from such ***,to bridge the embedding space of CLIP and latent space of StyleGAN,real NVP is employed and modified with activation normalization and invertible *** the images and text in CLIP share the same representation space,text prompts can be fed directly into CLIP-Flow to achieve text-to-image *** conducted extensive experiments on several datasets to validate the effectiveness of the proposed image-to-image synthesis *** addition,we tested on the public dataset Multi-Modal CelebA-HQ,for text-to-image *** validated that our approach can generate high-quality text-matching images,and is comparable with state-of-the-art methods,both qualitatively and quantitatively.

关键词： image-to-image text-to-image contrastive language-image pretraining(CLIP) flow StyleGAN

来源：评论

学校读者我要写书评

暂无评论

An infrastructure software perspective toward computation offloading between executable specifications and foundation models

引用

Science China(Information Sciences) 2025年第4期68卷 380-382页

作者： Dezhi RAN Mengzhou WU Yuan CAO Assaf MARRON David HAREL Tao XIE Key Laboratory of High Confidence Software Technologies (PKU) Ministry of Education School of Computer SciencePeking University School of Electronics Engineering and Computer Science Peking University Department of Computer Science and Applied Mathematics Weizmann Institute of Science

Foundation models(FMs) [1] have revolutionized software development and become the core components of large software systems. This paradigm shift, however, demands fundamental re-imagining of software engineering theories and methodologies [2]. Instead of replacing existing software modules implemented by symbolic logic, incorporating FMs' capabilities to build software systems requires entirely new modules that leverage the unique capabilities of ***, while FMs excel at handling uncertainty, recognizing patterns, and processing unstructured data, we need new engineering theories that support the paradigm shift from explicitly programming and maintaining user-defined symbolic logic to creating rich, expressive requirements that FMs can accurately perceive and implement.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Common Declarative Language for UML State Machine Representation, Model Transformation, and Interoperability of Visualization Tools 1st

A Common Declarative Language for UML State Machine Represe...

引用

1st International Symposium on software Fault Prevention, Verification, and Validation, SFPVV 2024

作者： Jannatpour, Ali Constantinides, Constantinos Department of Computer Science and Software Engineering Concordia University Montreal Canada

ISBN: (纸本)9789819616206

Originally presented in previous work to capture the set of fundamental elements of the UML state machine specification, Common Declarative Language (CDL) provides a model that can aid in the validation and verification of requirements. In this paper we target two objectives: First, we extend CDL by addressing one of the advanced concepts of the UML state machine specification, namely the notion of orthogonality which allows complex machine behavior through parallel state configurations. Second, we complement previous work by focusing on how CDL can serve as a platform for the representation of a state machine, how the language can be deployed for a model transformation where the initial machine (containing composite and/or orthogonal states) can be flattened into a model whose formal definition we provide, and finally how the CDL can be deployed to support interoperability among text-to-UML drawing tools [11]. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Specification languages

来源：评论

学校读者我要写书评

暂无评论

Caching Strategies in NDN Based Wireless Ad Hoc Network:A Survey

引用

computers, Materials & Continua 2024年第7期80卷 61-103页

作者： Ahmed Khalid Rana Asif Rehman Byung-Seo Kim Department of Computer Science FAST School of ComputingNational University of Computer and Emerging SciencesLahore54000Pakistan Department of Software and Communications Engineering Hongik UniversitySejong City30016Republic of Korea

Wireless Ad Hoc Networks consist of devices that are wirelessly *** Ad Hoc Networks(MANETs),Internet of Things(IoT),and Vehicular Ad Hoc Networks(VANETs)are the main domains of wireless ad hoc *** is used in wireless ad hoc *** is based on Transmission Control Protocol(TCP)/Internet Protocol(IP)network where clients and servers interact with each other with the help of IP in a pre-defined *** fetches data from a fixed *** redundancy,mobility,and location dependency are the main issues of the IP network *** these factors result in poor performance of wireless ad hoc *** main disadvantage of IP is that,it does not provide in-network ***,there is a need to move towards a new network that overcomes these *** Data Network(NDN)is a network that overcomes these *** is a project of Information-centric Network(ICN).NDN provides in-network caching which helps in fast response to user *** NDN in wireless ad hoc network provides many benefits such as caching,mobility,scalability,security,and *** considering the certainty,in this survey paper,we present a comprehensive survey on Caching Strategies in NDN-based Wireless *** cachingmechanism-based results are also *** the last,we also shed light on the challenges and future directions of this promising field to provide a clear understanding of what caching-related problems exist in NDN-based wireless ad hoc networks.

关键词： Content centric network Internet of Things mobile ad hoc network named data network vehicular ad hoc network

来源：评论

学校读者我要写书评

暂无评论

Effects of Fast Charging of EV Batteries at Low Temperatures Based on Temporary Lithium Plating and Temperature Gradients

Effects of Fast Charging of EV Batteries at Low Temperatures...

引用

2024 IEEE Energy Conversion Congress and Exposition, ECCE 2024

作者： Chetri, Chandan Williamson, Sheldon Group Department of Electrical Computer and Software Engineering Oshawa Canada

ISBN: (纸本)9798350376067

The study investigates battery degradation under high C-rates and subzero temperatures, analyzing temperature gradients (ΔT/Δt) and differential temperature rises (ΔT) on 21700 lithium nickel cobalt aluminum oxide (NCA) battery. The research findings identifies maximum ΔT at charging rates of 2.5C and 3C, reaching 24.6°C and 29.96°C, respectively, at an ambient temperature of -10°C. At -15°C, charging rate of 2C resulted in ΔT of 20.9°C. Moreover, the voltage dip during fast charging at subzero temperatures increases with lower ambient temperatures and higher C-rates. Comparative analysis underscores anode as the most heated part, exhibiting steeper ΔT/Δt curves. Although the battery management system (BMS) may regulate the total temperature rise within safe operating limits, however, often ΔT/Δt may cause accelerated battery degradation and thermal runaway condition, especially during dynamic charging/discharging conditions. Therefore, it is imperative to monitor battery ΔT/Δt, rather than only ΔT to ensure reduced accelerated degradation and thermal safety. © 2024 IEEE.

关键词： State of charge

来源：评论

学校读者我要写书评

暂无评论

On learning the right attention point for feature enhancement

引用

Science China(Information Sciences) 2023年第1期66卷 131-143页

作者： Liqiang LIN Pengdi HUANG Chi-Wing FU Kai XU Hao ZHANG Hui HUANG College of Computer Science and Software Engineering Shenzhen University Department of Computer Science and Engineering The Chinese University of Hong Kong School of Computer Science National University of Defense Technology School of Computing Science Simon Fraser University

We present a novel attention-based mechanism to learn enhanced point features for point cloud processing tasks, e.g., classification and segmentation. Unlike prior studies, which were trained to optimize the weights of a pre-selected set of attention points, our approach learns to locate the best attention points to maximize the performance of a specific task, e.g., point cloud classification. Importantly, we advocate the use of single attention point to facilitate semantic understanding in point feature learning. Specifically,we formulate a new and simple convolution, which combines convolutional features from an input point and its corresponding learned attention point(LAP). Our attention mechanism can be easily incorporated into state-of-the-art point cloud classification and segmentation networks. Extensive experiments on common benchmarks, such as Model Net40, Shape Net Part, and S3DIS, all demonstrate that our LAP-enabled networks consistently outperform the respective original networks, as well as other competitive alternatives, which employ multiple attention points, either pre-selected or learned under our LAP framework.

关键词： point convolution feature enhancement attention point deep neural network

来源：评论

学校读者我要写书评

暂无评论

Identifying patients in need of psychological treatment with language representation models

引用

Multimedia Tools and Applications 2025年第1期84卷 397-418页

作者： Aygün, İrfan Kaya, Buket Kaya, Mehmet Department of Software Engineering Celal Bayar University Manisa Turkey Department of Electronics and Automation Fırat University Elazig Turkey Department of Computer Engineering Fırat University Elazig Turkey

Early diagnosis of psychological disorders is very important for patients to regain their health. Research shows that many patients do not realize that they have a psychological disorder or apply to different departments for treatment. The detection of hidden psychological disorders in patients will both increase the quality of life of patients and reduce the traffic of patients who apply to the wrong department. This study aimed to determine whether patients who consult a physician for any reason need psychological treatment. For this purpose, the relationships, and similarities between the sentences of previous psychiatric patients and the sentences of newly arrived patients were analyzed. Domain-based trained ELECTRA language model was used to detect sentence similarities semantically. In the study, the dialogues of patients with physicians in 92 different specialties were analyzed using the MedDialog dataset, which consists of online physician applications, and the DAIC-WOZ dataset. As a result of the experiments, 90.49% success was achieved for the MedDialog dataset and 89.36% for the DAIC-WOZ dataset. With the proposed model, patients in need of psychological treatment were identified and the medical departments where psychological problems were revealed the most were determined. These divisions are Neurology, Sexology, Cardiology, and Plastic Surgery, respectively. With the findings obtained, complications caused by psychological problems and types of diseases that are precursors to psychological disorders were determined. To the best of our knowledge, this article is the first study that aims to analyze all psychological illness instead of focusing on any of the psychological problems (depression, OCD, schizophrenia, etc.) and validated by electronic health records. © The Author(s) 2024.

关键词： Diseases

来源：评论

学校读者我要写书评

暂无评论

A Motion Capture Quality Comparison Between Rokoko and Kinect 6

A Motion Capture Quality Comparison Between Rokoko and Kinec...

引用

6th IEEE International Conference on Artificial Intelligence in engineering and Technology, IICAIET 2024

作者： Li, Monica M.Q. Polytehcnique Montreal Department of Computer and Software Engineering Montreal Canada

ISBN: (纸本)9798350389692

In human machine interaction tasks, the quality of motion capture plays a critical role. Rokoko Motion Capture System (Rokoko) is a relatively economic motion capture device and has been utilized in various areas of motion-related research. In this study, we test the representative ability of the products captured by Rokoko and Microsoft Kinect v2 (Kinect). Three non-professional actors wore the Rokoko to do three kinds of activities: walking with normal velocity, jumping with vertical acceleration and rotation, by displaying different body functions. Motion data were recorded using these two devices simultaneously. We compared the pros and cons of the results of the captured motions by these two devices and provide suggestions and precautions using Rokoko Smartsuit Pro for motion capture activities for researchers. © 2024 IEEE.

关键词： Human robot interaction

来源：评论

学校读者我要写书评

暂无评论

TES-CVIDS: A Transmission Efficient Sub-Map Based Collaborative Dense VI-SLAM Framework

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-14页

作者： Zhang, Tianjun Zhang, Lin Zhang, Fengyi Zhao, Shengjie Zhou, Yicong School of Software Engineering Tongji University Shanghai China Department of Computer and Information Science University of Macau Macau China

In recent years, how to achieve stable localization and construct high-quality dense maps in large-scale scenes has become a research highlight. In large-scale scenes, for the consideration of the mapping accuracy and efficiency, multi-agent systems rather than single-agent ones are usually employed. Currently, as far as we know, collaborative VI-SLAM (Visual Inertial Simultaneous Localization And Mapping) systems applicable to multi-agent systems are still sporadic, and systems those can achieve a good balance among the localization accuracy, the mapping density, and the transmission efficiency are temporarily lacking. In this paper, we propose a novel centralized collaborative VI-SLAM framework, namely TES-CVIDS (Transmission Efficient Sub-map based Collaborative Visual-Inertial Dense SLAM). In TES-CVIDS, instead of the original RGBD images, the compact sub-maps are transmitted, effectively reducing the transmission data redundancy. After that, the server completes key-frame processing, hierarchical pose-graph optimization, and global dense map construction in three separate threads. Besides, thanks to our depth search mechanism, the geometry information of all key-frames can be recovered on the server-end. Thus, sub-maps can be regenerated after the global pose-graph optimization to maintain the consistency between the localization and the mapping. Both the qualitative and the quantitative experimental results corroborate the superior performance of our TES-CVIDS. To make our results reproducible, the source code has been released at https://***/TES-CVIDS-MainPage/. IEEE

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

Consumer grade VR gloves based on hall effect sensors in an angular flexion arrangement 1

Consumer grade VR gloves based on hall effect sensors in an ...

引用

IEEE AITU: 1st International Student Conference: "Digital Generation", AITU 2024

作者： Yessentayev, Timur Mukanova, Zhanna Department of Computer and Software Engineering Turan University Almaty Kazakhstan

ISBN: (纸本)9798350364378

VR gloves can greatly enhance the realism of the VR experience by allowing users to not only see and hear the virtual environment, but also touch it without having to press buttons. This could make VR more appealing to a wide range of users, as well as increase the effectiveness of VR applications in various fields such as education, entertainment and others. VR gloves are currently aimed at the enterprise market and are out of the price range of the regular consumer. They offer benefits for social interaction in VR experiences and improve immersion. This article covers findings during the design process of an affordable VR glove prototype, based on hall effect sensors in an angular flexion arrangement, such as: user input surfaces (buttons, joysticks etc.), power supply and management of the glove, sensor accuracy, noise suppression, form factor, wiring, weight distribution, tracking solutions and material selection. ©2024 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：