检索结果-内蒙古大学图书馆

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial intelligence and Lecture Notes in Bioinformatics) 2014年 8886卷 644-655页

作者： Zhan, Zhi-Hui Zhang, Ge-Yi Ying-Lin Gong, Yue-Jiao Zhang, Jun Department of Computer Science Sun Yat-sen University Guangzhou510006 China Key Lab. Machine Intelligence and Advanced Computing Ministry of Education China Engineering Research Center of Supercomputing Engineering Software MOE China Key Lab. Software Technology Education Department Guangdong Province China School of Sofware Engineering Sun Yat-sen University Guangzhou510006 China Department of Psychology Sun Yat-sen university 510275 China

This paper proposes to solve the task scheduling problem in cloud computing by using a load balance aware genetic algorithm (LAGA) with Minmin and Max-min methods. Task scheduling problems are of great importance in cloud computing, and become especially challenging when taking load balance into account. Our proposed LAGA algorithm has several advantages when solving this kind of problems. Firstly, by introducing the time load balance (TLB) model to help establish the fitness function with makespan, the algorithm benefits from the ability to find the solution that performs best on load balance among a set of solutions with the same makespan. More importantly, the interaction between makespan and TLB helps the algorithm to minimize makespan in the same time. Secondly, Min-min and Max-min methods are used to produce promising individuals at the beginning of evolution, leading to noticeable improvement of evolution efficiency. We evaluated LAGA on several task scheduling problems and compared with a Min-min, Max-min improved version of genetic algorithm (MMGA), which does not use the TLB strategy. The results show that LAGA can obtain very competitive results with good load balancing properties, and outperform MMGA in both makespan and TLB objectives. © Springer International Publishing Switzerland 2014.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

SIOD: Single Instance Annotated Per Category Per Image for Object Detection

arXiv

引用

arXiv 2022年

作者： Li, Hanjun Pan, Xingjia Yan, Ke Tang, Fan Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-Sen University China Youtu Lab Tencent China Jilin University China Peng Cheng Laboratory China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education

Object detection under imperfect data receives great attention recently. Weakly supervised object detection (WSOD) suffers from severe localization issues due to the lack of instance-level annotation, while semi-supervised object detection (SSOD) remains challenging led by the inter-image discrepancy between lab.led and unlab.led data. In this study, we propose the Single Instance annotated Object Detection (SIOD), requiring only one instance annotation for each existing category in an image. Degraded from inter-task (WSOD) or inter-image (SSOD) discrepancies to the intra-image discrepancy, SIOD provides more reliable and rich prior knowledge for mining the rest of unlab.led instances and trades off the annotation cost and performance. Under the SIOD setting, we propose a simple yet effective framework, termed Dual-Mining (DMiner), which consists of a Similarity-based Pseudo lab.l Generating module (SPLG) and a Pixel-level Group Contrastive Learning module (PGCL). SPLG firstly mines latent instances from feature representation space to alleviate the annotation missing problem. To avoid being misled by inaccurate pseudo lab.ls, we propose PGCL to boost the tolerance to false pseudo lab.ls. Extensive experiments on MS COCO verify the feasibility of the SIOD setting and the superiority of the proposed method, which obtains consistent and significant improvements compared to baseline methods and achieves comparable results with fully supervised object detection (FSOD) methods with only 40% instances annotated. Code is availab.e at https://***/solicucu/SIOD. Copyright © 2022, The Authors. All rights reserved.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

Combined depth space based architecture search for person re-identification

arXiv

引用

arXiv 2021年

作者： Li, Hanjun Wu, Gaojie Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Peng Cheng Laboratory Shenzhen518005 China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Pazhou Lab Guangzhou China

Most works on person re-identification (ReID) take advantage of large backbone networks such as ResNet, which are designed for image classification instead of ReID, for feature extraction. However, these backbones may not be computationally efficient or the most suitable architectures for ReID. In this work, we aim to design a lightweight and suitable network for ReID. We propose a novel search space called Combined Depth Space (CDS), based on which we search for an efficient network architecture, which we call CDNet, via a differentiable architecture search algorithm. Through the use of the combined basic building blocks in CDS, CDNet tends to focus on combined pattern information that is typically found in images of pedestrians. We then propose a low-cost search strategy named the Top-k Sample Search strategy to make full use of the search space and avoid trapping in local optimal result. Furthermore, an effective Fine-grained Balance Neck (FBLNeck), which is removable at the inference time, is presented to balance the effects of triplet loss and softmax loss during the training process. Extensive experiments show that our CDNet (∼1.8 M parameters) has comparable performance with state-of-the-art lightweight networks. Copyright © 2021, The Authors. All rights reserved.

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Syntax-enhanced Pre-trained model

arXiv

引用

arXiv 2020年

作者： Xu, Zenan Guo, Daya Tang, Duyu Su, Qinliang Shou, Linjun Gong, Ming Zhong, Wanjun Quan, Xiaojun Jiang, Daxin Duan, Nan School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Microsoft Research Asia Beijing China Microsoft Search Technology Center Asia Beijing China Guangdong Key Laboratory Big Data Analysis and Processing Guangzhou China Key Lab. of Machine Intelligence and Advanced Computing Ministry of Education China

We study the problem of leveraging the syntactic structure of text to enhance pre-trained models such as BERT and RoBERTa. Existing methods utilize syntax of text either in the pre-training stage or in the fine-tuning stage, so that they suffer from discrepancy between the two stages. Such a problem would lead to the necessity of having human-annotated syntactic information, which limits the application of existing methods to broader scenarios. To address this, we present a model that utilizes the syntax of text in both pre-training and fine-tuning stages. Our model is based on Transformer with a syntax-aware attention layer that considers the dependency tree of the text. We further introduce a new pre-training task of predicting the syntactic distance among tokens in the dependency tree. We evaluate the model on three downstream tasks, including relation classification, entity typing, and question answering. Results show that our model achieves state-of-the-art performance on six public benchmark datasets. We have two major findings. First, we demonstrate that infusing automatically produced syntax of text improves pre-trained models. Second, global syntactic distances among tokens bring larger performance gains compared to local head relations between contiguous tokens. Copyright © 2020, The Authors. All rights reserved.

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis

Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image...

引用

Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Yanzuo Lu Manlin Zhang Andy J Ma Xiaohua Xie Jianhuang Lai School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Guangdong Province Key Laboratory of Information Security Technology China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Pazhou Lab (HuangPu) Guangzhou China

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Diffusion model is a promising approach to image generation and has been employed for Pose-Guided Person Image Synthesis (PGPIS) with competitive performance. While existing methods simply align the person appearance to the target pose, they are prone to overfitting due to the lack of a high-level semantic understanding on the source person image. In this paper, we propose a novel Coarse-to-Fine Latent Diffusion (CFLD) method for PGPIS. In the absence of image-caption pairs and textual prompts, we de-velop a novel training paradigm purely based on images to control the generation process of a pre-trained text-to-image diffusion model. A perception-refined decoder is designed to progressively refine a set of learnable queries and extract semantic understanding of person images as a coarse-grained prompt. This allows for the decoupling of fine-grained appearance and pose information controls at different stages, and thus circumventing the potential over-fitting problem. To generate more realistic texture details, a hybrid- granularity attention module is proposed to encode multi-scale fine-grained appearance features as bias terms to augment the coarse-grained prompt. Both quantitative and qualitative experimental results on the DeepFashion benchmark demonstrate the superiority of our method over the state of the arts for PGPIS. Code is availab.e at https://***/YanzuoLu/CFLD.

关键词： Training Image synthesis Semantics Text to image Process control Diffusion models Generators

来源：评论

学校读者我要写书评

暂无评论

Cross-camera feature prediction for intra-camera supervised person re-identification across distant scenes

arXiv

引用

arXiv 2021年

作者： Ge, Wenhang Pan, Chunyan Wu, Ancong Zheng, Hongwei Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Pazhou Lab Guangzhou China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Universtiy of Chinese Academy of Sciences Xinjiang China

Person re-identification (Re-ID) aims to match person images across non-overlapping camera views. The majority of Re-ID methods focus on small-scale surveillance systems in which each pedestrian is captured in different camera views of adjacent scenes. However, in large-scale surveillance systems that cover larger areas, it is required to track a pedestrian of interest across distant scenes (e.g., a criminal suspect escapes from one city to another). Since most pedestrians appear in limited local areas, it is difficult to collect training data with cross-camera pairs of the same person. In this work, we study intra-camera supervised person re-identification across distant scenes (ICS-DS Re-ID), which uses cross-camera unpaired data with intra-camera identity lab.ls for training. It is challenging as cross-camera paired data plays a crucial role for learning camera-invariant features in most existing Re-ID methods. To learn camera-invariant representation from cross-camera unpaired training data, we propose a cross-camera feature prediction method to mine cross-camera self supervision information from camera-specific feature distribution by transforming fake cross-camera positive feature pairs and minimize the distances of the fake pairs. Furthermore, we automatically localize and extract local-level feature by a transformer. Joint learning of global-level and local-level features forms a global-local cross-camera feature prediction scheme for mining fine-grained cross-camera self supervision information. Finally, cross-camera self supervision and intra-camera supervision are aggregated in a framework. The experiments are conducted in the ICS-DS setting on Market-SCT, Duke-SCT and MSMT17-SCT datasets. The evaluation results demonstrate the superiority of our method, which gains significant improvements of 15.4 Rank-1 and 22.3 mAP on Market-SCT as compared to the second best method. Our code is availab.e at https://***/g3956/CCFP. Copyright © 2021, The Authors.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

BTI Aging Monitoring based on SRAM Start-up Behavior

BTI Aging Monitoring based on SRAM Start-up Behavior

引用

Asian Test Symposium (ATS)

作者： Shengyu Duan Peng Wang Gaole Sai School of Computer Engineering and Science Shanghai University Shanghai China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences China State Key Laboratory of Mathematical Engineering and Advanced Computing Wuxi China Guangdong Provincial Key Lab of Robotics and Intelligent System Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences China CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems Shenzhen Institutes of Advanced Technology China

ISBN: (数字)9781728174679

ISBN: (纸本)9781728174686

Bias Temperature Instability (BTI) is one of the dominant CMOS aging mechanisms. It causes time-dependent variation, threatening circuit lifetime reliability. BTI-induced circuit errors are not detectable at the fabrication stage. On-line monitoring schemes are therefore necessary to capture the degradations during the operational time. Traditional aging monitoring techniques exhibit high implementation complexity and low stability. In this paper, we propose a BTI monitoring approach by simply tracking the start-up behavior of SRAM cells. SRAM is a widely used on-chip device in many applications. We study the impact of BTI for SRAM start-up values and age some cells in a manipulated manner. The BTI degradation is evaluated based on the number of SRAM cells starting with a certain value. This technique can be used to estimate the degradation for on-chip logic circuits without introducing additional circuitry, and thus has very low implementation complexity. We use an SRAM array with 1024 cells to estimate the degradations for multiple logic circuits, and show the average mean absolute percentage error as 8.48%. In addition, this technique is robust considering process, voltage and temperature variations.

关键词： Degradation Monitoring Logic circuits Transistors SRAM cells Aging System-on-chip

来源：评论

学校读者我要写书评

暂无评论

Gray Learning from Non-IID Data with Out-of-distribution Samples

arXiv

引用

arXiv 2022年

作者： Zhao, Zhilin Cao, Longbing Wang, Chang-Dong The Data Science Lab School of Computing and DataX Research Centre Macquarie University SydneyNSW2109 Australia The School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Guangdong Province Key Laboratory of Computational Science Guangzhou China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China

The integrity of training data, even when annotated by experts, is far from guaranteed, especially for non-IID datasets comprising both in- and out-of-distribution samples. In an ideal scenario, the majority of samples would be in-distribution, while samples that deviate semantically would be identified as out-of-distribution and excluded during the annotation process. However, experts may erroneously classify these out-of-distribution samples as in-distribution, assigning them lab.ls that are inherently unreliable. This mixture of unreliable lab.ls and varied data types makes the task of learning robust neural networks notably challenging. We observe that both in- and out-of-distribution samples can almost invariably be ruled out from belonging to certain classes, aside from those corresponding to unreliable ground-truth lab.ls. This opens the possibility of utilizing reliable complementary lab.ls that indicate the classes to which a sample does not belong. Guided by this insight, we introduce a novel approach, termed Gray Learning (GL), which leverages both ground-truth and complementary lab.ls. Crucially, GL adaptively adjusts the loss weights for these two lab.l types based on prediction confidence levels. By grounding our approach in statistical learning theory, we derive bounds for the generalization error, demonstrating that GL achieves tight constraints even in non-IID settings. Extensive experimental evaluations reveal that our method significantly outperforms alternative approaches grounded in robust statistics. Copyright © 2022, The Authors. All rights reserved.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

Exploring architectural ingredients of adversarially robust deep neural networks 21

Exploring architectural ingredients of adversarially robust ...

引用

Proceedings of the 35th International Conference on Neural Information Processing Systems

作者： Hanxun Huang Yisen Wang Sarah Erfani Quanquan Gu James Bailey Xingjun Ma School of Computing and Information Systems The University of Melbourne Victoria Australia Key Lab. of Machine Perception School of Artificial Intelligence Peking University Beijing China and Institute for Artificial Intelligence Peking University Beijing China University of California Los Angeles School of Computer Science Fudan University Shanghai China

ISBN: (纸本)9781713845393

Deep neural networks (DNNs) are known to be vulnerable to adversarial attacks. A range of defense methods have been proposed to train adversarially robust DNNs, among which adversarial training has demonstrated promising results. However, despite preliminary understandings developed for adversarial training, it is still not clear, from the architectural perspective, what configurations can lead to more robust DNNs. In this paper, we address this gap via a comprehensive investigation on the impact of network width and depth on the robustness of adversarially trained DNNs. Specifically, we make the following key observations: 1) more parame- ters (higher model capacity) does not necessarily help adversarial robustness; 2) reducing capacity at the last stage (the last group of blocks) of the network can actually improve adversarial robustness; and 3) under the same parameter budget, there exists an optimal architectural configuration for adversarial robustness. We also provide a theoretical analysis explaning why such network configuration can help robustness. These architectural insights can help design adversarially robust DNNs.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals

Multi-Stage Speaker Extraction with Utterance and Frame-Leve...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Meng Ge Chenglin Xu Longbiao Wang Eng Siong Chng Jianwu Dang Haizhou Li Tianjin Key Laboratory of Cognitive Computing and Application College of Intelligence and Computing Tianjin University Tianjin China School of Computer Science and Engineering Nanyang Technological University Singapore National University of Singapore Singapore Japan Advanced Institute of Science and Technology Ishikawa Japan Machine Listening Lab University of Bremen Germany

ISBN: (纸本)9781728176055;9781728176062

Speaker extraction requires a sample speech from the target speaker as the reference. However, enrolling a speaker with a long speech is not practical. We propose a speaker extraction technique, that performs in multiple stages to take full advantage of short reference speech sample. The extracted speech in early stages is used as the reference speech for late stages. For the first time, we use frame-level sequential speech embedding as the reference for target speaker. This is a departure from the traditional utterance-based speaker embedding reference. In addition, a signal fusion scheme is proposed to combine the decoded signals in multiple scales with automatically learned weights. Experiments on WSJ0-2mix and its noisy versions (WHAM! and WHAMR!) show that SpEx++ consistently outperforms other state-of-the-art baselines.

关键词： Conferences Signal processing Acoustics Noise measurement Time-domain analysis Speech processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：