Nowadays, deeplearning architectures like CNN have proven their superiority in image recognition tasks. To effectively deploy CNN networks in practice, especially for AIoT applications, it is essential to find a netw...
详细信息
This research work introduces a novel application of Artificial Intelligence (AI) in monument identification, utilizing the state-of-the-art object detection model YOLOv8. With the aim of revolutionizing the study, pr...
详细信息
The increasing complexity and interconnectedness of Internet of Things (IoT) software systems necessitate the development of intelligent solutions for predictive maintenance and security. Conventional techniques often...
详细信息
Accurate delineation of agricultural fields from satellite imagery is crucial for digital agriculture and conservation. The Segment Anything Model (SAM), a state-of-the-art image segmentation model, brings new possibi...
详细信息
Accurate delineation of agricultural fields from satellite imagery is crucial for digital agriculture and conservation. The Segment Anything Model (SAM), a state-of-the-art image segmentation model, brings new possibilities for this task. However, its feasibility under different agricultural contexts remains unclear, and there are open questions regarding model parametrization, image preprocessing, and integration into an operational framework. This study proposes a new SAM-assisted crop field extraction framework (FieldSeg) using 2022 Sentinel-2 temporal composites and presents the lessons learned using this foundational model in eight agricultural regions across the world. Through rigorous experiments, this study optimized FieldSeg in three stages: input data preparation, model parametrization and patch management, and final fine parametrization. This study explored different bands and temporal metrics combinations and defined a set of optimal configurations for the framework based on performance and processingtime. Non-agricultural objects segmented using SAM were removed using an annual crop mask derived from Google Dynamic World. While performance was low to moderate in regions with small fields (<5ha in China, South Africa, and Spain), FieldSeg achieved a promising performance in the study areas with medium-large fields (>= 5 ha in Argentina, Australia, Brazil, USA-California, and USA-Iowa), with the rates of correctly extracted fields ranging from 0.541 to 0.814. The extracted fields showed a good segmentation quality, with mean dice coefficients ranging from 0.735 to 0.847. The large-scale applicability of FieldSeg was also demonstrated in four countries (1 million square kilometers), showing promising results and the ability to generalize across different regions.
This senior thesis develops a real-time handwritten digit identification system using a Raspberry Pi 3B+ with a camera module, leveraging a lightweight CNN optimized with MNIST. The project highlights the effective im...
详细信息
ISBN:
(数字)9798350350821
ISBN:
(纸本)9798350350838
This senior thesis develops a real-time handwritten digit identification system using a Raspberry Pi 3B+ with a camera module, leveraging a lightweight CNN optimized with MNIST. The project highlights the effective implementation of deeplearning on edge computing devices through seamless integration of CNN, TensorFlow Lite, and OpenCV's real-timeimageprocessing. The system is both cost-effective and precise, enabling real-time digit recognition tasks. This proposed work illustrates the potential of AI applications in education, industry, and commerce, setting the stage for future advancements in embedded AI systems.
This research work introduces an innovative approach to multimedia content creation by incorporating emotion and sentiment analysis into a Generative Adversarial Network (GAN) framework. The system dynamically detects...
详细信息
This research paper introduces a novel pipeline for image generation and enhancement, integrating advanced techniques from stable diffusion and deep neural network (DNN) super-resolution. The pipeline consists of thre...
详细信息
Agriculture production plays a significant role in the country's economy. Diseases are quite natural and common among plants. Identification of diseases in plants is necessary for averting losses in the yield of a...
详细信息
Agriculture production plays a significant role in the country's economy. Diseases are quite natural and common among plants. Identification of diseases in plants is necessary for averting losses in the yield of agricultural products. Manual monitoring of plants requires expertise, immense effort, and excessive time. Automatic detection will not only help in reducing time and effort but will also help in detecting disease at an early stage, as soon as it will start appearing on plant leaves. Recently, imageprocessing in agriculture has attained a surge of interest by researchers. This study presents a five-layered CNN model for automatic detection of plant disease utilizing leaf images. In order to better train a CNN model, 20,000 augmented images are generated. Experimental results demonstrate that proposed optimized-CNN model can predict pepper bell plant leaf as healthy or bacterial with 99.99% accuracy. Robust results make the proposed optimized-CNN model a preliminary warning tool that can be applied as a disease identification system in a real cultivation environment.
The transformer-based deep neural network (DNN) models have shown considerable success across diverse tasks, prompting widespread adoption of distributed training methods such as data parallelism and pipeline parallel...
详细信息
The transformer-based deep neural network (DNN) models have shown considerable success across diverse tasks, prompting widespread adoption of distributed training methods such as data parallelism and pipeline parallelism. With the increasing parameter number, hybrid parallel training becomes imperative to scale training. The primary bottleneck in scaling remains the communication overhead. The communication scheduling technique, emphasizing the overlap of communication with computation, has demonstrated its benefits in scaling. However, most existing works focus on data parallelism, overlooking the nuances of hybrid parallel training. In this paper, we propose TriRace, an efficient communication scheduling framework for accelerating communications in hybrid parallel training of asynchronous pipeline parallelism and data parallelism. To achieve effective computation-communication overlap, TriRace introduces 3D communication scheduling, which adeptly leverages data dependencies between communication and computations, efficiently scheduling AllReduce communication, sparse communication, and peer-to-peer communication in hybrid parallel training. To avoid possible communication contentions, TriRace also incorporates a topology-aware runtime which optimizes the execution of communication operations by considering ongoing communication operations and real-time network status. We have implemented a prototype of TriRace based on PyTorch and Pipedream-2BW, and conducted comprehensive evaluations with three representative baselines. Experimental results show that TriRace achieves up to 1.07-1.45x speedup compared to the state-of-the-art pipeline parallelism training baseline Pipedream-2BW, and 1.24-1.81x speedup compared to the Megatron.
In the dynamic realm of digital advertising, enhancing click-through and conversion rates within ad extensions remains a significant challenge for agencies. Ad extensions play a pivotal role in amplifying traditional ...
详细信息
暂无评论