检索结果-内蒙古大学图书馆

6th International Conference on Intelligent Computing and signal processing (ICSP)

作者： Xiaobo Chang Linlin Zhao Jinxiang Wang College of Engineering Yanbian University Yanji China Intelligent Information Processing Lab. Dept. of Computer Science & Technology Yanbian University Yanji China Computer Science and Technology College of Engineering Yanbian University Yanji China

Image recognition has become a necessary component for computer visual system and widely utilized to detect objectives for downstream tasks in realistic applications. However, existing methods are concentrated on utilizing the clustering information of image features to recognize the subjects, which are unable to dispose several high correlation subjects and cost numerous computation period. In this paper, we utilize the convolution operation for images and extract the separated features. After acquiring these features, a deep neural network is established to recognize the objectives in the input images with enough iterations training procedures. Subsequently, the trained model is evaluated through the testing data-set to measure the real performance of proposed method. From our extensive experimental results, we can conclude that our proposed model can automatically realize the recognition process for input images with reasonable accuracy and acceptable computation costs. Additionally, our experimental results also indicate that the convolutional operation is more suitable to dispose the images data-set than traditional machine learning method.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Novel Miniaturized Wideband High-Gain Palm-Leaf Vivaldi Array Antenna

A Novel Miniaturized Wideband High-Gain Palm-Leaf Vivaldi Ar...

引用

2023 IEEE International Workshop on Electromagnetics: Applications and Student Innovation Competition, iWEM 2023

作者： Wang, Min Li, Xuan Xiang, Ceng Chen, Zhengchuan Chongqing University of Posts and Telecommunications Postdoctoral Res. Ctr. of Chongqing Key Lab. of Optoelectron. Info. Sensing and Transmiss. Technol. Chongqing400065 China Guilin University of Electronic Technology Guangxi Wireless Broadband Communication and Signal Processing Key Laboratory Guilin541004 China Chongqing University School of Microelectronics and Communication Engineering Chongqing400044 China

ISBN: (纸本)9798350336740

A novel miniaturized wideband high-gain palm-leaf Vivaldi array antenna is presented in this work. Firstly, a novel Vivaldi antenna element is designed. To miniaturize the element, three groups of arc-shaped slots of varying lengths are etched on the radiator and a transition structure from fan-shaped microstrip line to slotted line is introduced in the feeding structure. Besides, two rectangular slots are introduced to the both sides of the slotted line to suppress the sidelobe level. Then, in order to facilitate the cascading of elements, a 1×16 microstrip power division feed network with equal amplitude and same phase is designed. Finally, a wideband high-gain 4 × 4 Vivaldi array antenna is designed and simulated. Simulated results indicate that the relative bandwidth of the presented Vivaldi array antenna reaches 42.2% (3.6 GHz-5.6 GHz), and the gain reaches 15.8 dBi at 4.5 GHz. © 2023 IEEE.

关键词： Microwave antennas

来源：评论

学校读者我要写书评

暂无评论

Automated Clinical Summary Generation via Integrating Structured and Unstructured Data 12th

Automated Clinical Summary Generation via Integrating Struct...

引用

12th CCF Conference on BigData, BigData 2024

作者： Fu, Jiaojiao Yang, Bowen Guo, Yi Zhou, Yangfan Wang, Xin School of Information Science and Engineering East China University of Science and Technology Shanghai China Shanghai Key Lab. of Intelligent Information Processing Fudan University Shanghai China School of Computer Science Fudan University Shanghai China

ISBN: (纸本)9789819610235

Automatically generating clinical texts can significantly reduce the time physicians spend on clinical data recording, which is particularly important for developing countries where physicians are extremely busy due to a severe shortage. This work automatically generates discharge summaries as a case to explore the methods and feasibilities of automatic clinical text summarization. Existing work typically uses either structured or unstructured data alone to generate discharge summaries. However, the content generated often has issues such as being overly verbose, lacking focus, or omitting significant information, especially key indicators and medications. This work innovatively proposes a data integration-based clinical text generation approach, using content generated from unstructured clinical data as the basis and supplementing it with text generated from structured clinical data. This study utilizes advanced natural language processing algorithms and models to create clinical texts. It addresses the challenges of lacking datasets suitable for fine-tuning pre-trained models and combining the advantages rather than the disadvantages of both types to produce discharge summaries. Experimental results show that the structured supplementation approach can effectively improve the generation of clinical texts. This work demonstrates that clinical texts generated using existing natural language processing technologies still do not meet the demands of medical practice, pointing out the need to develop further text generation technologies tailored to the characteristics of clinical data. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Data assimilation

来源：评论

学校读者我要写书评

暂无评论

Neural signal Compression System with Spike Detection Using Compressed Sensing

Neural Signal Compression System with Spike Detection Using ...

引用

2023 Cross Strait Radio Science and Wireless technology Conference, CSRSWTC 2023

作者： Zheng, Ruihan Xia, Yu Li, Dongming Wang, Liyang Li, Hung Chun Mak, Peng Un Vai, Mang I. Pun, Sio Hang State Key Laboratory of Analog and Mixed-Signal VLSL University of Macau Macau China Institution of Microelectronics University of Macau Macau China Jt. Lab. of Zhuhai UM Sci. and Technol. Research Institution - Lingyange Semiconductor Incorporated Zhuhai China University of Macau Faculty of Science and Technology Department of Electrical and Computer Engineering Macau China

ISBN: (纸本)9798350358971

This article aims to demonstrate a signal compression method for the wireless invasive neural recording system. A compression system with spike detection for neural signals is proposed. The input signal is firstly detected in the spike detection part and then the intercepted spike segments are sent to the compression part. A compressed sensing technique is applied in the compression part, and the Minimum Euclidean or Manhattan Distance Cluster-based (MDC) matrix is adopted for compressing neural spike segments. During simulation, the compression rate can surpass 99% and the signal-to-noise distortion ratio is around 37 dB. Moreover, the proposed method is also contrasted with the direct compression of input neural signals. When the input neural signal is directly compressed, the compression ratio is 98%, and the signal-to-noise ratio distortion rate is about 28 dB. By employing spike detection and utilizing this MDC matrix, it becomes possible to compress non-sparse spike segments in the time domain, resulting in a higher compression rate and improved reconstruction performance. © 2023 IEEE.

关键词： compressed sensing neural signal compression spike detection

来源：评论

学校读者我要写书评

暂无评论

Data-consistent Unsupervised Diffusion Model for Metal Artifact Reduction

Data-consistent Unsupervised Diffusion Model for Metal Artif...

引用

2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023

作者： Tong, Zhan Wu, Zhan Yang, Yang Mao, Weilong Wang, Shijie Li, Yinsheng Chen, Yang Southeast University Laboratory of Image Science and Technology Nanjing210096 China Southeast University Ministry of Education Key Laboratory of Computer Network and Information Integration Nanjing210096 China Chinese Academy of Sciences Research Center for Medical Artificial Intelligence Shenzhen Institutes of Advanced Technology Shenzhen518055 China School of Computer Science and Engineering Key Lab. of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications Jiangsu Provincial Joint International Research Laboratory of Medical Information Processing The Laboratory of Image Science and Technology Nanjing210096 China

ISBN: (纸本)9798350337488

Computed Tomography (CT) is an imaging technique widely used in clinical diagnosis. However, high-attenuation metallic implants result in the obstruction of low-energy Xrays and further lead to metal artifacts in the reconstructed CT images. Deep supervised model-based metal artifact reduction(MAR) approaches are limited in clinical applications due to the difficulty in obtaining paired artifact-affected and artifactfree data. Furthermore, these model-based methods lack the consideration of data consistency in the sinogram-domain to perform exact metal trace inpainting. To address these challenges, we propose a Data-consistent unsupErVised diffusiOn model for meTal artifact rEDuction, called DEVOTED-Net. First, DEVOTED-Net leverages prior knowledge to guide the conditional diffusion model for fine-grained metal trace inpainting. Second, an unsupervised MAR framework is designed in the reverse process for the unknown metal traces restoration in the sinogram domain. Third, to further enhance the sinogram-domain data consistency, physics-based consistency constraint loss including conjugateray consistency loss and accumulation-ray consistency loss is designed. Extensive experiments are carried out to verify the performance of our algorithm on the publicly availab.e dataset and clinical experimental dataset. This efficient, accurate, and reliable MAR approach holds great potential in clinics. © 2023 IEEE.

关键词： computed tomography deep unsupervised learning denoising diffusion probabilistic model Metal artifact reduction physics-based consistency constraint

来源：评论

学校读者我要写书评

暂无评论

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

arXiv

引用

arXiv 2024年

作者： Wang, Dongsheng Cui, Jiequan Li, Miaoge Lin, Wang Chen, Bo Zhang, Hanwang College of Computer Science and Software Engineering Shenzhen University China School of Computer Science and Engineering Nanyang Technological University Singapore Department of Computing Hong Kong Polytechnic University Hong Kong College of Computer Science and Technology Zhejiang University China National Key Lab of Radar Signal Processing Xidian University China

As the open community of large language models (LLMs) matures, multimodal LLMs (MLLMs) have promised an elegant bridge between vision and language. However, current research is inherently constrained by challenges such as the need for high-quality instruction pairs and the loss of visual information in image-to-text training objectives. To this end, we propose a Visual Token Complement framework (VTC) that helps MLLMs regain the missing visual features and thus improve response accuracy. Specifically, our VTC integrates text-to-image generation as a guide to identifying the text-irrelevant features, and a visual selector is then developed to generate complementary visual tokens to enrich the original visual input. Moreover, an iterative strategy is further designed to extract more visual information by iteratively using the visual selector without any additional training. Notably, the training pipeline requires no additional image-text pairs, resulting in a desired instruction tuning-free property. Both qualitative and quantitative experiments demonstrate the superiority and efficiency of our VTC. Copyright © 2024, The Authors. All rights reserved.

关键词： Visual BASIC

来源：评论

学校读者我要写书评

暂无评论

A Robust Interference Suppression Method Based on FDA-MIMO Radar

A Robust Interference Suppression Method Based on FDA-MIMO R...

引用

IEEE International Conference on Information Communication and signal processing (ICICSP)

作者： Zhixia Wu Shengqi Zhu Jingwei Xu Lan Lan Mengdi Zhang Ximin Li National Lab. of Radar Signal Processing Xidian University Xi'an China School of Information and Control Engineering China University of Mining and Technology Xuzhou China

In the issue of interference suppression, the performance of traditional adaptive methods will decrease when mainlobe interference and sidelobe interference have angle error. To this end, a robust adaptive beamforming technique based on frequency diversity array (FDA) multiple-input multiple-output (MIMO) is proposed in this work. Firstly, preprocessing in data domain is adopted for mainlobe interference cancellation. Then, the sidelobe interference is suppressed in the receiving dimension. Finally, robust adaptive beamforming method is applied to suppress sidelobe interference. Simulation results show the effectiveness of the proposed algorithm.

关键词： Interference suppression Interference cancellation Array signal processing Simulation signal processing algorithms Interference Radar

来源：评论

学校读者我要写书评

暂无评论

Understanding the Robustness of 3D Object Detection with Bird'View Representations in Autonomous Driving

Understanding the Robustness of 3D Object Detection with Bir...

引用

2023 IEEE/CVF Conference on computer Vision and Pattern Recognition, CVPR 2023

作者： Zhu, Zijian Zhang, Yichi Chen, Hai Dong, Yinpeng Zhao, Shu Ding, Wenbo Zhong, Jiachen Zheng, Shibao Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China Institute for Ai Tsinghua University BNRist Center Thbi Lab Dept. of Comp. Sci. and Tech. China School of Computer Science and Technology Anhui University Key Laboratory of Intelligent Computing and Signal Processing Ministry of Education Information Materials and Intelligent Sensing Laboratory of Anhui Province China Saic Motor Ai Lab Zhongguancun Laboratory China

ISBN: (纸本)9798350301298

3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird's-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations;2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features;3) Camera-LiDARfusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models. © 2023 IEEE.

关键词： Autonomous driving

来源：评论

学校读者我要写书评

暂无评论

EFFICIENT ONLINE lab.L CONSISTENT HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL

EFFICIENT ONLINE LABEL CONSISTENT HASHING FOR LARGE-SCALE CR...

引用

2021 IEEE International Conference on Multimedia and Expo, ICME 2021

作者： Yi, Jinhan Liu, Xin Cheung, Yiu-Ming Xu, Xing Fan, Wentao He, Yi Department of Computer Science and Technology Huaqiao University Xiamen361021 China Xiamen Key Lab. of Computer Vision and Pattern Recognition Fujian Key Lab. of Big Data Intelligence and Security China Department of Computer Science Hong Kong Baptist University Kowloon Hong Kong School of Computer Science and Engineering University of Electronic Science and Technology of China China Provincial Key Laboratory for Computer Information Processing Technology Soochow University China

ISBN: (纸本)9781665438643

Existing cross-modal hashing still faces three challenges: (1) Most batch-based methods are unsuitable for processing large-scale and streaming data. (2) Current online methods often suffer from insufficient semantic association, while lacking flexibility to learn the hash functions for varying streaming data. (3) Existing supervised methods always require much computation time or accumulate large quantization loss to learn hash codes. To address above challenges, we present an efficient Online lab.l Consistent Hashing (OLCH) for cross-modal retrieval, which aims to incrementally learn hash codes for the current arriving data, while updating the hash functions at a streaming manner. To be specific, an online semantic representation learning framework is designed to adaptively preserve the semantic similarity across different modalities, and a mini-batch online gradient descent approach associated with forward-backward splitting is developed to optimize the hash functions. Accordingly, the hash codes are adaptively learned online with the high discriminative capability, while avoiding high computation complexity to process the streaming data. Experimental results show its outstanding performance in comparison with the-state-of-arts. © 2021 IEEE computer Society. All rights reserved.

关键词： Hash functions

来源：评论

学校读者我要写书评

暂无评论

Design of a Linearly Polarized Dual-Band Metal-Only Transmitarray Antenna Element 11

Design of a Linearly Polarized Dual-Band Metal-Only Transmit...

引用

11th IEEE Asia-Pacific Conference on Antennas and Propagation, APCAP 2023 - Proceedings

作者： Wang, Min Hu, Yang Li, Xuan Hao, Honggang Chen, Zhengchuan Chongqing University of Posts and Telecommunications Postdoctoral Res. Ctr. of Chongqing Key Lab. of Optoelectron. Info. Sensing and Transmiss. Technol. Chongqing400065 China Guilin University of Electronic Technology Guangxi Wireless Broadband Communication and Signal Processing Key Laboratory Guilin541004 China Southeast University State Key Laboratory of Millimeter Waves Nanjing210096 China Chongqing University Chongqing400044 China

ISBN: (纸本)9798350326277

A linearly polarized dual-band metal-only transmitarray antenna (TA) element is proposed. The TA element consists of four identical metallic layers without dielectric substrates. An air gap is present between each pair of layers. The four identical metallic layers are positioned equidistantly and symmetrically along the z-axis. Each metallic layer consists of an interleaved orthogonal H-typed slot and a Jerusalem cross slot, which works at two different frequency band. By adjusting the length between two rectangular slots along the y-axis of the H-typed slot and the width of the Jerusalem cross slot, the lower frequency band and the upper frequency band can be controlled, respectively. The proposed TA element has a 3-dB transmission bandwidth is 18.0% and 14.0% in the lower frequency band and upper frequency band, respectively. Furthermore, the phase coverage of 330° and 360° with low loss is achieved at 10.0 GHz and 15.0 GHz, respectively. © 2023 IEEE.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：