检索结果-内蒙古大学图书馆

IEEE International Conference on e-Business engineering (ICEBE)

作者： Jiayin Tian Yaozhi Wang Jiaxin Liu Yan Chen School of Computer Science and Technology Xi'an Jiaotong University Xi'an Shaanxi China Shaanxi Key Lab of Big Data Knowledge Engineering Xi'an Jiaotong University Xi'an Shaanxi China Shaanxi Key Lab of Big Data Knowledge Engineering School of Computer Science and Technology Xi'an Jiaotong University Xi'an Jiaotong University Xi'an Shaanxi China

ISBN: (数字)9798350365856

ISBN: (纸本)9798350365863

In a world brimming with new products continually, novel waste types are ubiquitous. This makes current image-based garbage classification systems difficult to perform well due to the long-tailed effects of distribution of garbage types, and necessitates an urgent and efficient garbage classification with abilities of detecting new and rare wastes and class-incremental learning for environmental sustainability. Therefore, we propose a framework of Online System of Garbage Image-Oriented Intelligent Classification, Submission, and Examination, facilitating the incremental garbage classification efforts. In which, to identify novel garbage effectively, we also introduced few-shot object detection method with two key algorithms: Two-Stage Object Detection Learning Algorithm and Dynamic Query-based Incremental Few-shot Learning Algorithm. Our experiment results show that Both outperform the current existing ones in dataset, MS COCO. Then, a strategy of Class-Incremental learning based Residual Network is proposed to meet the need of new waste class-incremental learning. The experimental results support our strategy. Finally, a prototype system employed the above algorithms and the strategy is described.

关键词： Image recognition Heavily-tailed distribution Heuristic algorithms Green products Prototypes Object detection Classification algorithms Object recognition Few shot learning Residual neural networks

来源：评论

学校读者我要写书评

暂无评论

UniMoCo: Unsupervised, Semi-Supervised and Fully-Supervised Visual Representation Learning

UniMoCo: Unsupervised, Semi-Supervised and Fully-Supervised ...

引用

2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022

作者： Dai, Zhigang Cai, Bolun Chen, Junying South China University of Technology Key Lab. of Big Data & Intell. Robot MoE School of Software Engineering Guangzhou China Tencent Wechat Ai Guangzhou China

ISBN: (数字)9781665452588

ISBN: (纸本)9781665452588

Momentum Contrast (MoCo) achieves great success for unsupervised visual representation learning. However, there are a lot of supervised and semi-supervised datasets, which are already labeled. To fully utilize the label annotations, we propose Unified Momentum Contrast (UniMoCo), which extends MoCo to support arbitrary ratios of labeled data and unlabeled data training. Compared with MoCo, UniMoCo has two modifications as follows: (1) Different from a single positive pair in MoCo, we maintain multiple positive pairs on-the-fly by comparing the query label to a label queue. (2) We propose a Unified Contrastive (UniCon) loss to support an arbitrary number of positives and negatives in a unified pair-wise optimization perspective. Our UniCon is more reasonable and powerful than the supervised contrastive loss in theory and practice. In our experiments, we pre-train multiple UniMoCo models with different ratios of ImageNet labels and evaluate the performance on various downstream tasks. Experiment results show that UniMoCo generalizes well for unsupervised, semi-supervised and fully-supervised visual representation learning. Besides, we surprisingly find that UniMoCo performs best with 60% ImageNet labels for COCO and VOC transfer learning. The code is available: https://***/dddzg/unimoco. © 2022 IEEE.

关键词： Momentum

来源：评论

学校读者我要写书评

暂无评论

SP3: Enhancing Structured Pruning via PCA Projection

arXiv

引用

arXiv 2023年

作者： Hu, Yuxuan Zhang, Jing Zhao, Zhe Zhao, Chen Chen, Xiaodong Li, Cuiping Chen, Hong School of Information Renmin University of China Beijing China Key Laboratory of Data Engineering and Knowledge Engineering MOE China Engineering Research Center of Database and Business Intelligence MOE China Tencent AI Lab Tencent Beijing China School of Computer Science and Technology Xi'an Jiaotong University Xi'An China

Structured pruning is a widely used technique for reducing the size of pre-trained language models (PLMs), but current methods often overlook the potential of compressing the hidden dimension (d) in PLMs, a dimension critical to model size and efficiency. This paper introduces a novel structured pruning approach, Structured Pruning with PCA Projection (SP3), targeting the effective reduction of d by projecting features into a space defined by principal components before masking. Extensive experiments on benchmarks (GLUE and SQuAD) show that SP3 can reduce d by 70%, compress 94% of the BERTbase model, and maintain over 96% accuracy and outperform other methods that compress d by 6% in accuracy at the same compression ratio. SP3 has also proven effective with other models, including OPT and Llama. Our data and code are available at ours repo. © 2023, CC BY.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

arXiv

引用

arXiv 2023年

作者： Ge, Jiake Zhang, Huanchen Shi, Boyu Luo, Yuanhui Guo, Yunda Chai, Yunpeng Chen, Yuxing Pan, Anqun Key Laboratory of Data Engineering and Knowledge Engineering MOE China Engineering Research Center of Database and Business Intelligence MOE China School of Information Renmin University of China China China and Shanghai Qi Zhi Institute China Tsinghua University China Tencent Inc China

The growth in data storage capacity and the increasing demands for high performance have created several challenges for concurrent indexing structures. One promising solution is learned indexes, which use a learning-based approach to fit the distribution of stored data and predictively locate target keys, significantly improving lookup performance. Despite their advantages, prevailing learned indexes exhibit constraints and encounter issues of scalability on multi-core data storage. This paper introduces SALI, the Scalable Adaptive Learned Index framework, which incorporates two strategies aimed at achieving high scalability, improving efficiency, and enhancing the robustness of the learned index. Firstly, a set of node-evolving strategies is defined to enable the learned index to adapt to various workload skews and enhance its concurrency performance in such scenarios. Secondly, a lightweight strategy is proposed to maintain statistical information within the learned index, with the goal of further improving the scalability of the index. Furthermore, to validate their effectiveness, SALI applied the two strategies mentioned above to the learned index structure that utilizes fine-grained write locks, known as LIPP. The experimental results have demonstrated that SALI significantly enhances the insertion throughput with 64 threads by an average of 2.04× compared to the second-best learned index. Furthermore, SALI accomplishes a lookup throughput similar to that of LIPP+. © 2023, CC BY-NC-ND.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

DIFF3DS: GENERATING VIEW-CONSISTENT 3D SKETCH VIA DIFFERENTIABLE CURVE RENDERING

arXiv

引用

arXiv 2024年

作者： Zhang, Yibo Wang, Lihong Zou, Changqing Wu, Tieru Ma, Rui Jilin University China State Key Lab of CAD&CG Zhejiang University China Zhejiang Lab China Engineering Research Center of Knowledge-Driven Human-Machine Intelligence MOE China

3D sketches are widely used for visually representing the 3D shape and structure of objects or scenes. However, the creation of 3D sketch often requires users to possess professional artistic skills. Existing research efforts primarily focus on enhancing the ability of interactive sketch generation in 3D virtual systems. In this work, we propose Diff3DS, a novel differentiable rendering framework for generating view-consistent 3D sketch by optimizing 3D parametric curves under various supervisions. Specifically, we perform perspective projection to render the 3D rational Bézier curves into 2D curves, which are subsequently converted to a 2D raster image via our customized differentiable rasterizer. Our framework bridges the domains of 3D sketch and raster image, achieving end-to-end optimization of 3D sketch through gradients computed in the 2D image domain. Diff3DS can enable a series of novel 3D sketch generation tasks, including text-to-3D sketch and image-to-3D sketch, supported by the popular distillation-based supervision, such as Score Distillation Sampling. Extensive experiments have yielded promising results and demonstrated the potential of our framework. Project page is at https://***/Diff3DS/ © 2024, CC BY.

关键词： Three dimensional computer graphics

来源：评论

学校读者我要写书评

暂无评论

Frontiers of Artificial Intelligent and Quantitative Management: Preface for ITQM 2024

引用

Procedia Computer Science 2024年 242卷 1-8页

作者： Filip, Florin Gheorghe Shi, Yong Pocatilu, Paul Ciurea, Cristian-Eugen Li, Jianping Tien, James M. Berg, Daniel Romanian Academy Information Science and Technology Section Bucharest Romania Research Centre on Fictitious Economy and Data Science Chinese Academy of Sciences Beijing100190 China Key Lab of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing100190 China School of Economics and Management University of Chinese Academy of Sciences Beijing100190 China College of Information Science and Technology University of Nebraska at Omaha OmahaNE68182 United States Bucharest University of Economic Studies Bucharest Romania College of Engineering University of Miami Coral GablesFL33124 United States

来源：评论

学校读者我要写书评

暂无评论

TREC: transient redundancy elimination-based convolution 22

TREC: transient redundancy elimination-based convolution

引用

Proceedings of the 36th International Conference on Neural Information Processing Systems

作者： Jiawei Guan Feng Zhang Jiesong Liu Hsin-Hsuan Sung Ruofan Wu Xiaoyong Du Xipeng Shen Key Laboratory of Data Engineering and Knowledge Engineering (MOE) and School of Information Renmin University of China Computer Science Department North Carolina State University

ISBN: (纸本)9781713871088

The intensive computations in convolutional neural networks (CNNs) pose challenges for resource-constrained devices; eliminating redundant computations from convolution is essential. This paper gives a principled method to detect and avoid transient redundancy, a type of redundancy existing in input data or activation maps and hence changing across inferences. By introducing a new form of convolution (TREC), this new method makes transient redundancy detection and avoidance an inherent part of the CNN architecture, and the determination of the best configurations for redundancy elimination part of CNN backward propagation. We provide a rigorous proof of the robustness and convergence of TREC-equipped CNNs. TREC removes over 96% computations and achieves 3.51× average speedups on microcontrollers with minimal (about 0.7%) accuracy loss.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Top-one Recommendation with Anonymous User Behaviors 39

Top-one Recommendation with Anonymous User Behaviors

引用

39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

作者： Lu, Xiangkui Wu, Jun MoE Key Lab of Big Data & Artificial Intelligence in Transportation Beijing Jiaotong University Beijing 100044 China MoE Engineering Research Center of Integration and Application of Digital Learning Technology The Open University of China Beijing 100039 China

ISBN: (纸本)157735897X

Top-one recommendation with anonymous user behaviors, also known as session-based recommendation (SBR), faces challenges of top-one ranking and short anonymous sequences. To this end, we propose a novel objective that combines (i) a reciprocal rank loss to directly optimize the benchmark metric of top-one recommendation, with (ii) a listwise contrastive loss to handle short sequences through listwise augmented consistency regularization. Empirical studies demonstrate that optimizing the proposed objective significantly improves the performance of existing SBR baselines. Copyright © 2025, Association for the Advancement of Artificial Intelligence (***). All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Semi-supervised Multi-label Learning with Missing labels via Correlation Information

Semi-supervised Multi-Label Learning with Missing Labels via...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Zexian Xie Peipei Li Jinling Jiang Xindong Wu Key Laboratory of Knowledge Engineering with Big Data (the Ministry of Education of China) Hefei University of Technology School of Computer Science and Information Engineering Hefei University of Technology Hefei Anhui China Knowledge Engineering Research Center Zhejiang Lab Hangzhou Zhejiang China

In multi-label learning, each instance is associated with a set of labels simultaneously. Most existing studies assume that the set of labels for each instance is complete. However, it is generally difficult to obtain all the relevant labels of each instance, and only a partial or even empty set of relevant labels is available, which is called semi-supervised multi-label learning with missing labels. To tackle this problem, we propose a novel framework that considers label correlations and instance correlations to recover the missing labels and utilizes a large amount of unlabeled data simultaneously to improve the classification performance. Specifically, a new supplementary label matrix is firstly obtained by learning the label correlation. Secondly, considering each class label may be decided by some specific characteristics of its own, a label-specific data representation is hence learned for each class label. Thirdly, instance correlations are utilized not only to recover the missing labels, but also to propagate the supervision information from labeled instances to unlabeled ones. In addition, a united objective function is designed to facilitate the above processing and an accelerated proximal gradient method is adopted to solve the optimization problem. Finally, extensive experimental results conducted on several benchmark datasets demonstrate the effectiveness of the proposed method compared to competing ones.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-lingual acquisition on multimodal pre-training for cross-modal retrieval 22

Multi-lingual acquisition on multimodal pre-training for cro...

引用

Proceedings of the 36th International Conference on Neural Information Processing Systems

作者： Liang Zhang Anwen Hu Qin Jin School of Information Renmin University of China School of Information Renmin University of China and Key Laboratory of Data Engineering and Knowledge Engineering (MOE) Renmin University of China

ISBN: (纸本)9781713871088

Vision and diverse languages are important information sources in our living world. A model that understands multi-modalities and multi-languages can be applied to a wider range of real-life scenarios. To build such a multimodal and multilingual model, existing works try to ensemble vision-language data from multiple languages in pre-training. However, due to the large number of languages, these works often require huge computing resources and cannot be flexibly extended to new languages. In this work, we propose a Multi-Lingual Acquisition (MLA) framework that can easily empower a monolingual Vision-Language Pre-training (VLP) model with multilingual capability. Specifically, we design a lightweight language acquisition encoder based on state-of-the-art monolingual VLP models. We further propose a two-stage training strategy to optimize the language acquisition encoder, namely the Native Language Transfer stage and the Language Exposure stage. With much less multilingual training data and computing resources, our model achieves state-of-the-art performance on multilingual image-text and video-text retrieval benchmarks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：