Scientific user facilities present a unique set of challenges for imageprocessing due to the large volume of data generated from experiments and simulations. Furthermore, developing and implementing algorithms for re...
详细信息
View synthesis (VS) for light field images is a very time-consuming task due to the great quantity of involved pixels and intensive computations, which may prevent it from the practical three-dimensional real-time sys...
详细信息
View synthesis (VS) for light field images is a very time-consuming task due to the great quantity of involved pixels and intensive computations, which may prevent it from the practical three-dimensional real-time systems. In this article, we propose an acceleration approach for deep learning-based light field view synthesis, which can significantly reduce calculations by using compact-resolution (CR) representation and super-resolution (SR) techniques, as well as light-weight neuralnetworks. The proposed architecture has three cascaded neuralnetworks, including a CR network to generate the compact representation for original input views, a VS network to synthesize new views from down-scaled compact views, and a SR network to reconstruct high-quality views with full resolution. All these networks are jointly trained with the integrated losses of CR, VS, and SR networks. Moreover, due to the redundancy of deep neuralnetworks, we use the efficient light-weight strategy to prune filters for simplification and inference acceleration. Experimental results demonstrate that the proposed method can greatly reduce the processing time and become much more computationally efficient with competitive image quality.
There have been many advances in the artificial intelligence field due to the emergence of deep learning. In almost all sub-fields, artificialneuralnetworks have reached or exceeded human-level performance. However,...
详细信息
ISBN:
(纸本)9789897584862
There have been many advances in the artificial intelligence field due to the emergence of deep learning. In almost all sub-fields, artificialneuralnetworks have reached or exceeded human-level performance. However, most of the models are not interpretable. As a result, it is hard to trust their decisions, especially in life and death scenarios. In recent years, there has been a movement toward creating explainable artificial intelligence, but most work to date has concentrated on imageprocessing models, as it is easier for humans to perceive visual patterns. There has been little work in other fields like natural language processing. In this paper, we train a convolutional model on textual data and analyze the global logic of the model by studying its filter values. In the end, we find the most important words in our corpus to our model's logic and remove the rest (95%). New models trained on just the 5% most important words can achieve the same performance as the original model while reducing training time by more than half. Approaches such as this will help us to understand NLP models, explain their decisions according to their word choices, and improve them by finding blind spots and biases.
The emerging technology of adversarial information hiding can generate adversarial example by embedding useful information instead of meaningless noise into the host image. The obtained image can deceive classificatio...
详细信息
The emerging technology of adversarial information hiding can generate adversarial example by embedding useful information instead of meaningless noise into the host image. The obtained image can deceive classification models and therefore equip the function of privacy protection. However, existing research mainly focuses on the effectiveness of attack rather than preserving the integrity of the protected image. It cannot meet the requirements in the fields such as medical imaging, financial transactions, and copyright protection where users' raw data should not be damaged. Therefore, we propose a novel reversible adversarial information hiding based on interpretability of neuralnetworks. The Grad-CAM-generated heatmap is employed to identify a minimal set of high-impact pixels for embedding, ensuring that minor modifications can induce significant misclassification. Then, the user-defined secret bits are embedded into the identified pixels by using difference expansion. After extraction, the original image can be perfectly restored. The technologies of adversarial example and reversible information hiding are combined to accommodate wider applications. The proposed method is tested on CIFAR-10 with three different neural architectures (NiN, AlexNet, ResNet). Experimental results show that the proposed method can completely restore the cover image while ensuring the extraction of the embedded data and misleading the neural network.
Depth-estimation from a single input image can be used in applications such as robotics and autonomous driving. Recently, depth-estimation networks with UNet encoder/decoder structures have been widely used. In these ...
详细信息
Depth-estimation from a single input image can be used in applications such as robotics and autonomous driving. Recently, depth-estimation networks with UNet encoder/decoder structures have been widely used. In these decoders, operations are repeated to gradually increase the image resolution, while decreasing the channel size. If the upsampling operation at a high magnification can be processed at once, the amount of computation in the decoder can be dramatically reduced. To achieve this, we propose a new network structure, i.e., a cocktail glass network. In this network, convolution layers in the decoder are reduced, and a novel fast upsampling method is used that is known as channel-to-space unrolling, which converts thick channel data into high-resolution data. The proposed method can be easily implemented using simple reshaping operations;therefore, it is suitable for reducing the depth-estimation network. Considering the experimental results based on the NYU V2 and KITTI datasets, we demonstrate that the proposed method reduces the amount of computation in the decoder by half, while maintaining the same level of accuracy;it can be used in both lightweight and large-model-capacity networks.
The proceedings contain 63 papers. The special focus in this conference is on Recent Trends in Machine Learning, IoT, Smart Cities and applications. The topics include: Early-Onset Identification of Stomach Cancer Usi...
ISBN:
(纸本)9789811960871
The proceedings contain 63 papers. The special focus in this conference is on Recent Trends in Machine Learning, IoT, Smart Cities and applications. The topics include: Early-Onset Identification of Stomach Cancer Using CNN;multifold Secured Bank Application Authentication Service Using Random Visual Cryptography and Multimodal Steganography with Blockchain Technology;role of Blockchain in Health Care: A Comprehensive Study;a Novel Three Dense Layered Deep Fully Connected neural Network for Hyperspectral image Classification;research Trends in artificial Intelligence and Nature Inspired Techniques;comparative Analysis of Software Defect Prediction Using Dimensionality Reduction;an Approach to Learn Structural Similarity Between Decision Trees Using Hungarian Algorithm;design and Development of IoT-Based Intelligent Cattle Shed Management;review Paper on Technologies to Curb Noise Pollution in No Honking Zones;security Threats in Healthcare Systems—A Bibliometric Study;news Channel Debate Analysis: A Detailed Insight;disease Recognization of Plant Using Different imageprocessing Algorithm;a Detailed Survey on Sentimental Analysis on Social Media;perceiving Correlation Among Spatiotemporal Gait Parameters and Verifying Its Relation Using Machine Learning Classification Technique Pilot Study for Indian Population;radial Basis neural Network Trained Minimum Snap Trajectory for Quadrotor;REMICARE—Medicine Intake Tracker and Healthcare Assistant;Detection of Fraudulent Credit Card Transactions in Real Time Using SparkML and Kafka;real-Time Face Detection and Face Recognition: Study of Approaches;ASIC Implementation of AI Edge Network on Chip (NoC) at 28 nm Technology Node and Its Various Timing Optimization Techniques at Each Stage;3D IC Integration Using Blockchain;Impact of COVID-19 on Import and Export of Petroleum Products and Crude Oil in India.
Deep learning has driven remarkable advancements in medical image segmentation. The requirement for comprehensive annotations, however, poses a significant challenge due to the labor-intensive and expensive nature of ...
详细信息
Deep learning has driven remarkable advancements in medical image segmentation. The requirement for comprehensive annotations, however, poses a significant challenge due to the labor-intensive and expensive nature of expert annotation. Addressing this challenge, we introduce a multiple mixed-supervisory signals learning (MSL) strategy, MixSegNet, that synergistically harnesses the benefits of Fully-Supervised (FSL), Weakly-Supervised (WSL), and Semi-Supervised Learning (SSL). This approach enables the utilization of various data-efficient annotations for network training, promoting efficient medical image segmentation within realistic clinical scenarios. MixSegNet concurrently trains networks with a combination of limited dense labels, a larger proportion of cost-efficient sparse labels, and unlabeled data. The networks utilized in this system comprise Vision Transformer (ViT) and Convolutional neuralnetworks (CNN), which work together via an effective strategy including network self-ensembling and label dynamic-ensembling. This strategy adeptly handles the training challenges arising from datasets with limited or absent supervisory signals. We validated MixSegNet on a public Magnetic Resonance Imaging (MRI) cardiac segmentation benchmark dataset. It demonstrated superior performance compared to 21 other SSL or WSL baseline methods under similar labeling-cost conditions, as supported by comprehensive evaluation metrics, and slightly outperform classical FSL methods. The code for MixSegNet, all baseline methods, and the data pre-processing techniques with the datasets for different annotation situations are available at https://***/ziyangwang007/MixSegNet.
Earth Observation Satellite constellations are more and more requested by customers to fit their strategic needs. They offer the ability to collect large amounts of data from various places, from various sensors. They...
详细信息
Earth Observation Satellite constellations are more and more requested by customers to fit their strategic needs. They offer the ability to collect large amounts of data from various places, from various sensors. They offer versatility, robustness and an undeniable strategic advantage of revisit. Classical way of operations, where every satellite, subsystems and the ground segment were operated by a team, does fit neither technically nor commercially for constellations. artificial intelligence (AI) has proven its value in multiple applications. Automated mission operations, especially in multi-missions context, have a high impact in terms of efficiency where reduction of human interactions is foreseen: routine operations, monitoring, data processing and ground segment health. Airbus has already developed automated collision avoidance maneuver (CAM), automated mission plan uplink and advanced monitoring that already allow us to operate with a limited number of operators. AI-based image production and analysis is one of the latest developments Airbus has performed: change detection and Deepzoom are now available on the market. AI is set to play a major role in the automation of future EO systems, enabled by the significant advances in Machine Learning techniques of recent years. Data-driven AI-based solutions can be developed to improve operations effectiveness. Multi-variable multi-subsystems analysis (based on neuralnetworks) for time series can be used to predict future behaviors. Identifying a potential upcoming failure before it occurs and proposing preventive procedures to reduce the downtime is one of the leaps where the classical way of operations is outdated. The spacecraft can no longer be controlled individually by operators: the impact of on-board AI will drastically shake current CONOPS. Today, Airbus is working on artificial intelligence concepts for the next-gen EO constellations: AI on board and on ground to reach the lights-out center, where AI and a
Object detection algorithms are widely used in image classification and detection tasks recently. Among the other algorithms, You Only Look Once (YOLO) performs better in real-time detection. Convolutional neural netw...
详细信息
The proceedings contain 26 papers. The topics discussed include: deep transfer learning of traversability assessment for heterogeneous robots;on the incremental construction of deep rule theories;text-to-ontology mapp...
The proceedings contain 26 papers. The topics discussed include: deep transfer learning of traversability assessment for heterogeneous robots;on the incremental construction of deep rule theories;text-to-ontology mapping via natural language processing models;tams: text augmentation using most similar synonyms for SMS spam filtering;using artificialneuralnetworks to determine ontologies most relevant to scientific texts;learning to segment from object sizes;beyond sensor data analysis: unexpected challenges in a honeybee monitoring project;image classifier with dynamic set of known classes;analysis of the semantic vector space induced by a neural language model and a corpus;on reducing automata and their normalizations;and keyphrase extraction from Slovak court decisions.
暂无评论