To meet the current demand in China for Eucalyptus globulus and Acacia mangium mixed pulping, a study was conducted to collect the near infrared (NIR) spectra of 150 mixed samples of E. globulus and A. mangium in whic...
详细信息
To meet the current demand in China for Eucalyptus globulus and Acacia mangium mixed pulping, a study was conducted to collect the near infrared (NIR) spectra of 150 mixed samples of E. globulus and A. mangium in which the content of E. globulus was manually controlled. After the original spectra were pretreated by first derivative and standard normal variate (SNV), the least absolute shrinkage and selection operator (lasso) algorithm and cross-validation were used to calculate the optimal adjustment parameters of 14.30, 19.16, 12.10, and 9.74, respectively. The optimal calibration models for the content of E. globulus, holocellulose, pentosan, and acid insoluble lignin were generated. An independent verification of the calibration models showed that the root mean square error of prediction (RMSEP) for these models was 1.59%, 0.54%, 0.66%, and 0.40%, respectively. The absolute deviation (AD) was -2.58% to 2.73%, -0.91% to 0.84%, -1.19% to 1.06%, and -0.61% to 0.64%, respectively. The prediction performance of the four models was sufficient for real-time analysis in the pulping production line. The lasso algorithm was judged to be efficient for the prediction and analysis of mixed raw materials in pulping industry.
With T + 0 and short selling mechanism, the stock index futures are attractive to short-term traders in China, where stocks cannot be liquidated within the day and are difficult to short. So in terms of futures, how t...
详细信息
With T + 0 and short selling mechanism, the stock index futures are attractive to short-term traders in China, where stocks cannot be liquidated within the day and are difficult to short. So in terms of futures, how to improve the accuracy and speed of intraday price forecasting always fascinates short-term traders and researchers. Here we propose a novel forecasting model, VIX-lasso-GRU Model, which based on the gated recurrent unit (GRU) by adding VIX information and a method called Least absolute shrinkage and selection operator (lasso). The volatility index (VIX) can reduce the prediction errors and the lasso algorithm significantly improve the training speed of the model. We predict the 5-minute closing prices of three datasets of index futures by VIX-lasso-GRU Model. Comparing to the pure GRU and LSTM, we find that this new prediction model can improve the prediction efficiency with faster speed and higher accuracy.
Many factors impact water quality (WQ), such as climate change and population growth. Thus, the present work aims to propose an accurate and potent solution for the WQ instabilities challenge in the South Platte River...
详细信息
Many factors impact water quality (WQ), such as climate change and population growth. Thus, the present work aims to propose an accurate and potent solution for the WQ instabilities challenge in the South Platte River in United States. The data driven model based on the machine learning model tuned with Kalman filter (KF) was considered to reduce input data noise. The least absolute shrinkage and selection operator (lasso) algorithm were used to analyze the importance of features and select the best inputs. The US Geological Survey (USGS) archive provided the primary database related to 2023-2024, with over 38,000 samples. The random forest (RF) was combined with KF and lasso to reduce noise and analyze the importance of features due to the high number of samples. Artificial neural network (ANN), linear regression (LR), and support vector machine (SVM) were developed to compare the accuracy of the proposed model. The proposed model had the highest coefficient of determination values, which were between 0.95 and 0.99. Modeling the indicators revealed that some WQ variations could negatively affect aquatic ecosystems.
It is crucial for rural residents to consciously separate their domestic waste to improve their living environment and build beautiful villages. Exploring the decisive factors of household waste separation in rural Ch...
详细信息
It is crucial for rural residents to consciously separate their domestic waste to improve their living environment and build beautiful villages. Exploring the decisive factors of household waste separation in rural China can offer more precise guidance for waste segregation, thereby contributing to the establishment of a more efficient and sustainable waste management system. Using the 2022 China rural revitalization comprehensive survey data (CRRS), this paper combined the least absolute shrinkage and selection operator (lasso) algorithm with the Logit model to identify the determinants of waste separation in rural China from multiple aspects. It was found that families with better internet conditions, more equipment, and online training are more willing to participate in waste separation. Families who are more satisfied with the various tasks of the village committee are more willing to separate waste. Additionally, greater concern about food safety, and active learning of health knowledge are more inclined to engage in waste separation. Thus, this paper proposes improving internet conditions, enhancing satisfaction with village cadre work and promoting health awareness education.
Background This study aimed to identify differentially expressed genes (DEGs) that are associated with hepatocarcinogenesis and metastasis in hepatocellular carcinoma (HCC) and to explore their value in predicting ove...
详细信息
Background This study aimed to identify differentially expressed genes (DEGs) that are associated with hepatocarcinogenesis and metastasis in hepatocellular carcinoma (HCC) and to explore their value in predicting overall survival (OS). The methods used included bioinformatics analysis of gene expression datasets and in vitro experiments using HCC cell lines. Methods Gene expression profiles from metastatic and non-metastatic liver cancer specimens were analyzed using the limma R package. Functional enrichment was performed using Metascape. A prognostic 5-gene signature was constructed using the lasso algorithm based on TCGA-LIHC data. Kaplan-Meier survival analysis assessed the association of these genes with clinical outcomes (DFI, DSS, OS, and PFS). In vitro, Huh7 and Hep3B cells were transfected with shRNA for SPP1 knockdown. Cell viability was measured with CCK-8 assays, and migration was assessed with Transwell and wound-healing assays. Protein expression was evaluated via western blotting. Results The analysis of gene expression profiles led to the identification of 11 DEGs associated with immune response, phagocytosis, and cell migration. From these DEGs, the lasso algorithm identified a 5-DEG signature (MASP1, MASP2, MUC1, TREM1, and SPP1) that was predictive of OS in liver cancer patients. Among the five genes, SPP1 was the most upregulated in cancer samples and was significantly associated with poorer outcomes, including DFI, DSS, OS, and PFS. In vitro experiments confirmed that SPP1 knockdown in Huh7 and Hep3B cells significantly inhibited cancer cell viability and migration. Western blot analysis showed alterations in key proteins, with a reduction in vimentin and Ki-67 and an increase in E-cadherin following SPP1 knockdown. Conclusion This study highlights the pivotal effect of SPP1 on HCC development and underscores its potential as a biomarker for the OS of liver cancer patients. The identified DEGs may serve as predictive markers for OS and potentia
Chronic obstructive pulmonary disease (COPD) is a common lung disease that can lead to restricted airflow and respiratory problems, causing a significant health, economic, and social burden. Detecting the COPD stage c...
详细信息
Chronic obstructive pulmonary disease (COPD) is a common lung disease that can lead to restricted airflow and respiratory problems, causing a significant health, economic, and social burden. Detecting the COPD stage can provide a timely warning for prompt intervention in COPD patients. However, existing methods based on inspiratory (IN) and expiratory (EX) chest CT images are not sufficiently accurate and efficient in COPD stage detection. The lung region images are autonomously segmented from IN and EX chest CT images to extract the 1, 781 x 2 lung radiomics and 13, 824 x 2 3D CNN features. Furthermore, a strategy for concatenating and selecting features was employed in COPD stage detection based on radiomics and 3D CNN features. Finally, we combine all the radiomics, 3D CNN features, and factor risks (age, gender, and smoking history) to detect the COPD stage based on the Auto-Metric Graph Neural Network (AMGNN). The AMGNN with radiomics and 3D CNN features achieves the best performance at 89.7% of accuracy, 90.9% of precision, 89.5% of F1-score, and 95.8% of AUC compared to six classic machine learning (ML) classifiers. Our proposed approach demonstrates high accuracy in detecting the stage of COPD using both IN and EX chest CT images. This method can potentially establish an efficient diagnostic tool for patients with COPD. Additionally, we have identified radiomics and 3D CNN as more appropriate biomarkers than Parametric Response Mapping (PRM). Moreover, our findings indicate that expiration yields better results than inspiration in detecting the stage of COPD.
In the dynamic landscape of China's booming economy, the surge in e-commerce customer volume presents both opportunities and challenges, notably in managing customer churn (CC). Addressing this critical issue, thi...
详细信息
In the dynamic landscape of China's booming economy, the surge in e-commerce customer volume presents both opportunities and challenges, notably in managing customer churn (CC). Addressing this critical issue, this study introduces an innovative approach employing a radial basis function neural network for predicting CC within the e-commerce sector. To enhance the model's performance in handling the vast and complex data inherent to e-commerce, the least absolute shrinkage and selection operator regression algorithm is employed, optimizing the model's predictive accuracy. By meticulously analyzing the customer lifecycle, this refined model adeptly predicts churn at various stages, enabling the identification of features most correlated with churn. Empirical results underscore the model's exceptional capability, achieving a prediction accuracy of 95% and a remarkably low loss rate of 3%. Furthermore, during the excavation, advanced, stable, and decline stages of the customer lifecycle, accuracy levels of 97.6, 93.1, 92.7, and 91.8% are attained, respectively, facilitating the precise selection of highly correlated customer features. Thus, the advanced churn prediction model proposed herein significantly contributes to the e-commerce domain, offering a robust tool for strategizing customer retention and mitigating churn.
Rapid and accurate estimation of soil petroleum hydrocarbon content is crucial for analyzing the degree of soil pollution and evaluating pollution status. Surface soil samples and hyperspectral measurements in the res...
详细信息
Rapid and accurate estimation of soil petroleum hydrocarbon content is crucial for analyzing the degree of soil pollution and evaluating pollution status. Surface soil samples and hyperspectral measurements in the reservoir area in Lenghu Town, Qinghai Province, China, were viewed as research objects, and the correlation between different spectral forms of original data and petroleum hydrocarbon content in soil was analyzed. To improve the estimation accuracy, we proposed a solution that introduces least absolute shrinkage and selection operator (lasso) combined with extremely randomized trees (ERT) and gradient boosting decision tree (GBDT) ensemble learning for constructing hyperspectral estimation model. The results show: lasso algorithm can not only solve the spectral multicollinearity problem effectively but also reduce the number and calculation complexity of soil hyperspectral variables to a great extent. Compared with traditional machine learning, ERT and GBDT perform superior. In particular, the estimation accuracy of the lasso-GBDT model is the highest.
PurposeTo retrospectively evaluate the diagnostic potential of magnetic resonance imaging (MRI)-based features and radiomics analysis (RA)-based features for discriminating ovarian clear cell carcinoma (CCC) from endo...
详细信息
PurposeTo retrospectively evaluate the diagnostic potential of magnetic resonance imaging (MRI)-based features and radiomics analysis (RA)-based features for discriminating ovarian clear cell carcinoma (CCC) from endometrioid carcinoma (EC).Materials and methodsThirty-five patients with 40 ECs and 42 patients with 43 CCCs who underwent pretherapeutic MRI examinations between 2011 and 2022 were enrolled. MRI-based features of the two groups were compared. RA-based features were extracted from the whole tumor volume on T2-weighted images (T2WI), contrast-enhanced T1-weighted images (cT1WI), and apparent diffusion coefficient (ADC) maps. The least absolute shrinkage and selection operator (lasso) regression with tenfold cross-validation method was performed to select features. Logistic regression analysis was conducted to construct the discriminating models. Receiver operating characteristic curve (ROC) analyses were performed to predict *** features with the highest absolute value of the lasso algorithm were selected for the MRI-based, RA-based, and combined models: the ADC value, absence of thickening of the uterine endometrium, absence of peritoneal dissemination, and growth pattern of the solid component for the MRI-based model;Gray-Level Run Length Matrix (GLRLM) Long Run Low Gray-Level Emphasis (LRLGLE) on T2WI, spherical disproportion and Gray-Level Size Zone Matrix (GLSZM), Large Zone High Gray-Level Emphasis (LZHGE) on cT1WI, and GLSZM Normalized Gray-Level Nonuniformity (NGLN) on ADC map for the RA-based model;and the ADC value, spherical disproportion and GLSZM_LZHGE on cT1WI, and GLSZM_NGLN on ADC map for the combined model. Area under the ROC curves of those models were 0.895, 0.910, and 0.956. The diagnostic performance of the combined model was significantly superior (p = 0.02) to that of the MRI-based model. No significant differences were observed between the combined and RA-based *** MRI-based analysis can effecti
Under the guidance of the goal of the "carbon peaking and carbon neutrality", due to the high proportion of renewable energy and the high proportion of power electronic equipment, the power system will bring...
详细信息
Under the guidance of the goal of the "carbon peaking and carbon neutrality", due to the high proportion of renewable energy and the high proportion of power electronic equipment, the power system will bring strong randomness, low inertia and other characteristics, causing a large number of frequency stability problems. In order to solve the problems of traditional power system frequency prediction methods, such as difficulty in modeling and poor prediction accuracy, and to determine whether the frequency stability problem will occur after the wind power grid-connected system is disturbed, the lasso algorithm is first used to reduce the dimension of the input data, and then the attention mechanism based long and short memory (attention LSTM) neural network is used to predict the output frequency curve, The network parameters are optimized by the global search algorithm Whale Optimization algorithm (WOA). Finally, the accuracy of the algorithm is verified by taking the improved wind power grid-connected 39 bus as an example. The results show that this method has a good guiding significance for evaluating the frequency stability of the wind power grid-connected system after interference, and can effectively predict the frequency curve of the system after interference, and evaluate the frequency stability of the system after topology change. (c) 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://***/licenses/by-nc-nd/4.0/).
暂无评论