检索结果-内蒙古大学图书馆

Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization

IEEE JOURNAL OF SELECTED TOPICS IN signal processing 2020年第4期14卷 609-622页

作者： Stamoulis, Dimitrios Ding, Ruizhou Wang, Di Lymberopoulos, Dimitrios Priyantha, Bodhi Liu, Jie Marculescu, Diana Carnegie Mellon Univ Dept ECE Pittsburgh PA 15213 USA Univ Texas Austin Dept ECE Austin TX 78712 USA Microsoft Redmond WA 98052 USA Harbin Inst Technol Harbin 150001 Peoples R China

Can we reduce the search cost of neural Architecture Search (NAS) from days down to only a few hours? NAS methods automate the design of Convolutional Networks (ConvNets) under hardware constraints and they have emerged as key components of AutoML frameworks. However, the NAS problem remains challenging due to the combinatorially large design space and the significant search time (at least 200 GPU-hours). In this article, we alleviate the NAS search cost down to less than 3 hours, while achieving state-of-the-art image classification results under mobile latency constraints. We propose a novel differentiable NAS formulation, namely Single-Path NAS, that uses one single-path over-parameterized ConvNet to encode all architectural decisions based on shared convolutional kernel parameters, hence drastically decreasing the search overhead. Single-Path NAS achieves state-of-the-art top-1 imageNet accuracy (75.62%), hence outperforming existing mobile NAS methods in similar latency settings (similar to 80 ms). In particular, we enhance the accuracy-runtime tradeoff in differentiable NAS by treating the Squeeze-and-Excitation path as a fully searchable operation with our novel single-path encoding. Our method has an overall cost of only 8 epochs (24 TPU-hours), which is up to 5,000x faster compared to prior work. Moreover, we study how different NAS formulation choices affect the performance of the designed ConvNets. Furthermore, we exploit the efficiency of our method to answer an interesting question: instead of empirically tuning the hyperparameters of the NAS solver (as in prior work), can we automatically find the hyperparameter values that yield the desired accuracy-runtime trade-off (e.g., target runtime for different platforms)? We view our extensive experimental results as a valuable exploration for NAS-based cloud AutoML services, and we open-source our entire codebase at: https://***/dstamoulis/single-path-nas.

关键词： neural architecture search (NAS) hardware-aware convnets ConvNets AutoML

来源：评论

学校读者我要写书评

暂无评论

基于卷积神经网络模型的木材宏、微观辨识方法

引用

林业科学 2021年第6期57卷 134-143页

作者：赵子宇杨霄霞郭慧葛浙东周玉成山东建筑大学信息与电气工程学院济南250101 中国林业科学研究院木材工业研究所北京100091

【目的】提出一种基于卷积神经网络模型——PWoodIDNet模型的木材宏、微观辨识方法,以有效提高木材辨识精度和速度,为海关、进出口检疫检验、家具企业等法定部门和企业提供先进的辨识方法和仪器,推动我国木材进出口检疫检验行业和木材... 详细信息

【目的】提出一种基于卷积神经网络模型——PWoodIDNet模型的木材宏、微观辨识方法,以有效提高木材辨识精度和速度,为海关、进出口检疫检验、家具企业等法定部门和企业提供先进的辨识方法和仪器,推动我国木材进出口检疫检验行业和木材加工制造企业的科技进步。【方法】首先,选择16种木材样本,每种样本获取50张高分辨率显微CT图像和工业相机图像,共1600幅;然后,截取具有木射线、薄壁组织、轴向管胞、纹孔和纹理等特征的目标区域,共4800幅,通过水平翻转、垂直翻转、镜像、亮度变换等图像增强算法后将图像集扩充至19200幅。构建基于卷积神经网络的木材宏、微观辨识模型——PWoodIDNet模型,采用加入动量的随机梯度下降(SGDM)方法优化模型,并利用GPU优化并行运算库,对木材宏、微观结构数据集进行分类准确率对比。【结果】相比现行GoogLeNet模型,PWoodIDNet模型准确率提高1.49%,速度提高59.69%;相比现行AlexNet模型,PWoodIDNet模型准确率提高3.76%,速度提高2.63%。【结论】PWoodIDNet模型突破现有辨识方法木材辨识种类范围窄、准确率低和辨识速度慢的难点,能够有效辨识木材,并可在更短的训练时间内实现最佳辨识效果,为我国木材辨识提供一种新的方法和思路。

关键词：木材辨识卷积神经网络微观结构宏观结构

来源：评论

学校读者我要写书评

暂无评论

Scene Perceived image Perceptual Score (SPIPS): combining global and local perception for image quality assessment

arXiv

引用

arXiv 2025年

作者： Lao, Zhiqiang Yu, Heather Futurewei Technologies Inc Basking Ridge NJ United States

The rapid advancement of artificial intelligence and widespread use of smartphones have resulted in an exponential growth of image data, both real (camera-captured) and virtual (AI-generated). This surge underscores the critical need for robust image quality assessment (IQA) methods that accurately reflect human visual perception. Traditional IQA techniques primarily rely on spatial features—such as signal-to-noise ratio, local structural distortions, and texture inconsistencies—to identify artifacts. While effective for unprocessed or conventionally altered images, these methods fall short in the context of modern image post-processing powered by deep neural networks (DNNs). The rise of DNN-based models for image generation, enhancement, and restoration has significantly improved visual quality, yet made accurate assessment increasingly complex. To address this, we propose a novel IQA approach that bridges the gap between deep learning methods and human perception. Our model disentangles deep features into high-level semantic information and low-level perceptual details, treating each stream separately. These features are then combined with conventional IQA metrics to provide a more comprehensive evaluation framework. This hybrid design enables the model to assess both global context and intricate image details, better reflecting the human visual process, which first interprets overall structure before attending to fine-grained elements. The final stage employs a multilayer perceptron (MLP) to map the integrated features into a concise quality score. Experimental results demonstrate that our method achieves improved consistency with human perceptual judgments compared to existing IQA models. © 2025, CC BY.

关键词： Color vision

来源：评论

学校读者我要写书评

暂无评论

An investigation of pre-upsampling generative modelling and generative adversarial networks in audio super resolution

arXiv

引用

arXiv 2021年

作者： King, James Torné, Ramon Viñas Campbell, Alexander Liò, Pietro

There have been several successful deep learning models that perform audio super-resolution. Many of these approaches involve using preprocessed feature extraction which requires a lot of domain-specific signal processing knowledge to implement. Convolutional neural Networks (CNNs) improved upon this framework by automatically learning filters. An example of a convolutional approach is AudioUNet, which takes inspiration from novel methods of upsampling images. Our paper compares the pre-upsampling AudioUNet to a new generative model that upsamples the signal before using deep learning to transform it into a more believable signal. Based on the EDSR network for image super-resolution, the newly proposed model outperforms UNet with a 20% increase in log spectral distance and a mean opinion score of 4.06 compared to 3.82 for the two times upsampling case. AudioEDSR also has 87% fewer parameters than AudioUNet. How incorporating AudioUNet into a Wasserstein GAN (with gradient penalty) (WGAN-GP) structure can affect training is also explored. Finally the effects artifacting has on the current state of the art is analysed and solutions to this problem are proposed. The methods used in this paper have broad applications to telephony, audio recognition and audio generation tasks. © 2021, CC BY.

关键词： signal sampling

来源：评论

学校读者我要写书评

暂无评论

Perturbation Analysis of Learning Algorithms: Generation of Adversarial Examples From Classification to Regression

引用

IEEE TRANSACTIONS ON signal processing 2019年第23期67卷 6078-6091页

作者： Balda, Emilio Rafael Behboodi, Arash Mathar, Rudolf Rhein Westfal TH Aachen Inst Theoret Informat Technol TI D-52074 Aachen Germany

Despite the tremendous success of deep neural networks in various learning problems, it has been observed that adding intentionally designed adversarial perturbations to inputs of these architectures leads to erroneous classification with high confidence in the prediction. In this work, we show that adversarial examples can be generated using a generic approach that relies on the perturbation analysis of learning algorithms. Formulated as a convex program, the proposed approach retrieves many current adversarial attacks as special cases. It is used to propose novel attacks against learning algorithms for classification and regression tasks under various new constraints with closed-form solutions in many instances. In particular, we derive new attacks against classification algorithms which are shown to be top-performing on various architectures. Although classification tasks have been the main focus of adversarial attacks, we use the proposed approach to generate adversarial perturbations for various regression tasks. Designed for single pixel and single subset attacks, these attacks are applied to autoencoding, image colorization and real-time object detection tasks, showing that adversarial perturbations can degrade equally gravely the output of regression tasks. (1) (1) In the spirit of encouraging reproducible research, the implementations used in this paper have been made available at: ***/ebalda/adversarialconvex.

关键词： Perturbation methods Task analysis Training signal processing algorithms Robustness neural networks Cost function Artificial neural networks machine learning robustness computer vision

来源：评论

学校读者我要写书评

暂无评论

基于图像处理和历史输出数据的光伏面板功率预测

引用

电工技术 2025年第6期 102-105页

作者：沈泓钱伟娜沈扬国网常州供电公司电力调度控制中心江苏常州213001

太阳能光伏(PV)装机在全球范围内迅速扩张,但其发电的间歇性阻碍了与电网的深入融合。短期光伏波动部分源于突发的天气变化,如云量变化,这可能在分钟级时间尺度上显著影响光伏电池板的输出。天空图像可提供当前和即将到来的云量信息,从... 详细信息

太阳能光伏(PV)装机在全球范围内迅速扩张,但其发电的间歇性阻碍了与电网的深入融合。短期光伏波动部分源于突发的天气变化,如云量变化,这可能在分钟级时间尺度上显著影响光伏电池板的输出。天空图像可提供当前和即将到来的云量信息,从而提高光伏发电预测的准确性。使用卷积神经网络(CNN)将太阳能电池板的功率与当前的天空图像联系起来,并利用光伏电池板的历史输出数据进一步提升预测准确性。同时,评估了模型对机器学习过程配置(如神经元数量和网络宽度)的敏感性,以及所采用的随机方法的不确定性和不同输入输出配置对性能指标的影响,实现所提模型在实际采样数据集上均方根误差(RMSE)为2.37 kW。

关键词：分布式光伏出力预测深度学习图像处理

来源：评论

学校读者我要写书评

暂无评论

Self-Quotient image based CNN: A Basic image processing assisting Convolutional neural Network 2019

Self-Quotient Image based CNN: A Basic Image Processing assi...

引用

3rd International Conference on Digital signal processing (ICDSP) / 2nd International Conference on Computer Graphics and Virtuality (ICCGV)

作者： Xing, Xingrun Dong, Min Bi, Cheng Yang, Lin Zhengzhou Univ Sch Informat Engn 100 Kexue Rd Zhengzhou Henan Peoples R China Henan Bldg Mat Res & Design Inst Co Ltd 34 Hongqi Rd Zhengzhou Henan Peoples R China

ISBN: (纸本)9781450362047

The Convolutional neural Networks (CNNs) are able to learn basic and high level features hierarchically with the highlight that it implements an end-to-end learning method. However, lacking in the ability to utilize prior information and domain knowledge has led to the neural networks hard to train. In this paper, a method using prior information is proposed, which is by appending prior feature-maps through a bypass input structure. As an implementation, we evaluate a convolutional neural network integrating with the Self-Quotient image (SQI) algorithm. Through the bypass, we import the feature-maps from the SQI algorithm and concat them with the output of the first convolution layer. With the help of traditional image processing methods, CNNs can directly improve the accuracy and training stability, while the bypass is exactly a consistent point. Finally, the necessity of this bypass pattern is that it avoids the direct modification of original images. As CNNs are able to focus on far richer features than basic image processing methods, it is advisable for us to expose CNNs to the original data. It is exactly the main design idea that we make the output from synergistic processing algorithm bypass from the side.

关键词： Bypass convolution features-map neural network prior information self-quotient image

来源：评论

学校读者我要写书评

暂无评论

基于可重构计算的SAR成像与目标识别高性能实现方法

引用

现代雷达 2024年第12期46卷 102-109页

作者：纪津伦宋雨龙李世平邓松峰何国强傅玉祥南京大学电子科学与工程学院江苏南京210023 江苏华创微系统有限公司江苏南京211889 上海航天电子技术研究所上海201109

合成孔径雷达(SAR)广泛运用于军用与民用领域,常用于执行成像与目标识别任务。然而,SAR图像的成像与目标识别任务有着庞大的图像尺寸,其性能受到硬件资源的严重限制。文中立足于新兴的可重构计算技术,基于可重构计算芯片提出了SAR成像... 详细信息

合成孔径雷达(SAR)广泛运用于军用与民用领域,常用于执行成像与目标识别任务。然而,SAR图像的成像与目标识别任务有着庞大的图像尺寸,其性能受到硬件资源的严重限制。文中立足于新兴的可重构计算技术,基于可重构计算芯片提出了SAR成像与目标识别系统的高性能实现方法。可重构计算芯片采用重构控制技术实现不同的计算与数据通路,兼具灵活性与高能效。文中选取线性调频变标算法与YOLOv3-tiny神经网络构建系统算法内核,针对SAR图像大尺寸的特点,在成像阶段提出了多核并行与内存规划方案,在目标识别阶段提出了图像分割策略和多核并行方案。文中的成像与目标识别系统经实验证明达到了显著的性能提升效果;在1000×1000大小图像成像方面取得了单张图66.8 ms的用时表现,优于Intel i5-12500的115 ms;在480×480大小图像识别方面取得31.3 ms的用时表现,优于Jetson nano的147 ms。

关键词：合成孔径雷达信号处理可重构计算并行计算线性调频变标成像算法目标识别算法

来源：评论

学校读者我要写书评

暂无评论

Biomedical image Reconstruction: From the Foundations to Deep neural Networks

引用

FOUNDATIONS AND TRENDS IN signal processing 2019年第3期13卷 283-359页

作者： McCann, M. T. Unser, M. Ecole Polytech Fed Lausanne Biomed Imaging Grp CH-1015 Lausanne Switzerland Ecole Polytech Fed Lausanne Ctr Biomed Imaging Signal Proc Core CH-1015 Lausanne Switzerland

This tutorial covers biomedical image reconstruction, from the foundational concepts of system modeling and direct reconstruction to modern sparsity and learning-based approaches. Imaging is a critical tool in biological research and medicine, and most imaging systems necessarily use an image reconstruction algorithm to create an image;the design of these algorithms has been a topic of research since at least the 1960's. In the last few years, machine learning-based approaches have shown impressive performance on image reconstruction problems, triggering a wave of enthusiasm and creativity around the paradigm of learning. Our goal is to unify this body of research, identifying common principles and reusable building blocks across decades and among diverse imaging modalities. We first describe system modeling, emphasizing how a few building blocks can be used to describe a broad range of imaging modalities. We then discuss reconstruction algorithms, grouping them into three broad generations. The first are the classical direct methods, including Tikhonov regularization;the second are the variational methods based on sparsity and the theory of compressive sensing;and the third are the learning-based (also called data-driven) methods, especially those using deep convolutional neural networks. There are strong links between these generations: classical (first-generation) methods appear as modules inside the latter two, and the former two are used to inspire new designs for learning-based (third-generation) methods. As a result, a solid understanding of all three generations is necessary for the design of state-of-the-art algorithms.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Effect of Architectures and Training methods on the Performance of Learned Video Frame Prediction 26

Effect of Architectures and Training Methods on the Performa...

引用

26th IEEE International Conference on image processing (ICIP)

作者： Yilmaz, M. Akin Tekalp, A. Murat Koc Univ Dept Elect & Elect Engn Istanbul Turkey

ISBN: (纸本)9781538662496

We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), a convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be trained stably and very efficiently using the stateful truncated backpropagation through time procedure, and it requires an order of magnitude less inference runtime to achieve near real-time frame prediction with an acceptable performance.

关键词： frame prediction deep learning recurrent neural networks stateful training convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：