In the context of small software development teams, this research article gives a thorough investigation of the adoption of test-driven development (TDD) approaches. It aims to highlight the benefits that TDD offers, ...
详细信息
We proposed new prediction models based on multilayer perceptron(MLP)which successfully predict the maximum run-up of landslide-generated tsunami waves and assess the role of parameters affecting *** input is approxim...
详细信息
We proposed new prediction models based on multilayer perceptron(MLP)which successfully predict the maximum run-up of landslide-generated tsunami waves and assess the role of parameters affecting *** input is approximately 55,000 rows of data generated through an analytical solution employing slide’s cross section,initial submergence,vertical thickness,horizontal length,beach slope angle and the maximum run-up itself,along with its occurrence *** parameters are first ranked through a feature selection algorithm and six models are constructed for a 9,000-row randomly sampled *** MLP-based models led predictions with a minimum Mean Absolute Percentage Error of 1.1%and revealed that vertical slide thickness has the largest impact on the maximum tsunami run-up,whereas beach slope angle has minimal *** parison with existing literature showed the reliability and applicability of the offered *** methodology introduced here can be suggested as fast and flexible method for prediction of landslide-induced tsunami run-up.
Head pose estimation methods can be generally classified into two categories: model-based and appearance-based methods. The model-based approach relies on facial landmarks for three-dimensional reconstruction, aiming ...
详细信息
PROBLEM Recent years have witnessed the rapid progress of self-supervised language models (LMs)[1],especially large language models (LLMs)[2].LLMs not only achieved state-of-the-art performance on many natural languag...
PROBLEM Recent years have witnessed the rapid progress of self-supervised language models (LMs)[1],especially large language models (LLMs)[2].LLMs not only achieved state-of-the-art performance on many natural language processing tasks,but also captured widespread attention from the public due to their great potential in a variety of real-world applications (***,search engines,writing assistants,etc.)through providing general-purpose intelligent services.A few of the LLMs are becoming foundation models,an analogy to infrastructure,that empower hundreds of downstream applications.
Feature selection (FS) is an important data pre-processing technique in classification. It aims to remove redundant and irrelevant features from the data, which reduces the dimensionality of data and improves the perf...
详细信息
Testing is an inevitable part of any softwareengineering process to ensure quality and reliability. Model-based testing is a successful approach for the automated generation of test cases but requires a model of the ...
详细信息
Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this ***,as the performance of crack detect...
详细信息
Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this ***,as the performance of crack detection in cement pavement improves,the depth and width of the network structure are significantly increased,which necessitates more computing power and storage *** limitation hampers the practical implementation of crack detection models on various platforms,particularly portable devices like small mobile *** solve these problems,we propose a dual-encoder-based network architecture that focuses on extracting more comprehensive fracture feature information and combines cross-fusion modules and coordinated attention mechanisms formore efficient feature ***,we use small channel convolution to construct shallow feature extractionmodule(SFEM)to extract low-level feature information of cracks in cement pavement images,in order to obtainmore information about cracks in the shallowfeatures of *** addition,we construct large kernel atrous convolution(LKAC)to enhance crack information,which incorporates coordination attention mechanism for non-crack information filtering,and large kernel atrous convolution with different cores,using different receptive fields to extract more detailed edge and context ***,the three-stage feature map outputs from the shallow feature extraction module is cross-fused with the two-stage feature map outputs from the large kernel atrous convolution module,and the shallow feature and detailed edge feature are fully fused to obtain the final crack prediction *** evaluate our method on three public crack datasets:DeepCrack,CFD,and *** results on theDeepCrack dataset demonstrate the effectiveness of our proposed method compared to state-of-the-art crack detection methods,which achieves Precision(P)87.2%,Recall(R)87.7%,and F-score(F1)87.4%.Thanks to our lightweight cr
Dialogue generation systems (DGS) is an important topic in the field of Natural Language Processing (NLP). It enables a wide range of real-world applications to interact with humans in various languages naturally and ...
详细信息
Generating uniform design tables (UDTs) is the first step to experimenting efficiently and effectively, and is also one of the most critical steps. Thus, the construction of uniform design tables has received much att...
详细信息
In this paper, we introduce InternVL 1.5, an open-source multimodal large language model(MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introdu...
详细信息
In this paper, we introduce InternVL 1.5, an open-source multimodal large language model(MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements.(1) Strong vision encoder: we explored a continuous learning strategy for the large-scale vision foundation model — InternViT-6B, boosting its visual understanding capabilities, and making it can be transferred and reused in different LLMs.(2) Dynamic high-resolution: we divide images into tiles ranging from 1 to 40 of 448×448 pixels according to the aspect ratio and resolution of the input images, which supports up to 4K resolution input.(3) High-quality bilingual dataset: we carefully collected a high-quality bilingual dataset that covers common scenes, document images,and annotated them with English and Chinese question-answer pairs, significantly enhancing performance in optical character recognition(OCR) and Chinese-related tasks. We evaluate InternVL 1.5 through a series of benchmarks and comparative studies. Compared to both open-source and proprietary commercial models, InternVL 1.5 shows competitive performance, achieving state-of-the-art results in 8 of 18 multimodal benchmarks. Code and models are available at https://***/OpenGVLab/InternVL.
暂无评论