Browsing by Keyword "Random forest"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Data Driven Performance Prediction in Steel Making(2022-01-18) Boto, Fernando; Murua, Maialen; Gutierrez, Teresa; Casado, Sara; Carrillo, Ana; Arteaga, Asier; Tecnalia Research & Innovation; FACTORY; CIRMETAL; PROMETALThis work presents three data-driven models based on process data, to estimate different indicators related to process performance in a steel production process. The generated models allow the optimization of the process parameters to achieve optimal performance and quality levels. A new approach based on ensembles has been developed with feature selection methods and four state-of-the-art regression approximations (random forest, gradient boosting, xgboost and neural networks). The results show that the proposed approach makes the prediction more stable reducing the variance for all cases, even in one case, slightly reducing the bias. Furthermore, from the four machine learning paradigms presented, random forest is the one with the best results in a quantitative way, obtaining a coefficient of determination of 0.98 as a maximum, depending on the target sub-process.Item Relationship between electrocardiogram‐based features and personality traits: Machine learning approach: Machine learning approach(2022-01) Boljanić, Tanja; Miljković, Nadica; Lazarevic, Ljiljana B.; Knezevic, Goran; Milašinović, Goran; SGBackground: Based on the known relationship between the human emotion and standard surface electrocardiogram (ECG), we explored the relationship between features extracted from standard ECG recorded during relaxation and seven personality traits (Honesty/humility, Emotionality, eXtraversion, Agreeableness, Conscientiousness, Openness, and Disintegration) by using the machine learning (ML) approach which learns from the ECG-based features and predicts the appropriate personality trait by adopting an automated software algorithm. Methods: A total of 71 healthy university students participated in the study. For quantification of 62 ECG-based parameters (heart rate variability, as well as temporal and amplitude-based parameters) for each ECG record, we used computation procedures together with publicly available data and code. Among 62 parameters, 34 were segregated into separate features according to their diagnostic relevance in clinical practice. To examine the feature influence on personality trait classification and to perform classification, we used random forest ML algorithm. Results: Classification accuracy when clinically relevant ECG features were employed was high for Disintegration (81.3%) and Honesty/humility (75.0%) and moderate to high for Openness (73.3%) and Conscientiousness (70%), while it was low for Agreeableness (56.3%), eXtraversion (47.1%), and Emotionality (43.8%). When all calculated features were used, the classification accuracies were the same or lower, except for the eXtraversion (52.9%). Correlation analysis for selected features is presented. Conclusions: Results indicate that clinically relevant features might be applicable for personality traits prediction, although no remarkable differences were found among selected groups of parameters. Physiological associations of established relationships should be further explored.Item Sensitivity Studies for a Hybrid Numerical–Statistical Short-Term Wind and Gust Forecast at Three Locations in the Basque Country (Spain)(MDPI AG, 2020) Carreno-Madinabeitia, Sheila; Ibarra-Berastegi, Gabriel; Sáenz, Jon; Zorita, Eduardo; Ulazia, AlainThis study evaluates the performance of statistical models applied to the output of numerical models for short-term (1–24 h) hourly wind forecasts at three locations in the Basque Country. The target variables are horizontal wind components and the maximum wind gust at 3 h intervals. Statistical approaches such as persistence, analogues, linear regression, and random forest (RF) are used. The verification statistics used are coefficient of determination (R2) and root mean square error (RMSE). Statistical models use three inputs: (1) Local wind observations; (2) extended EOFs (empirical orthogonal functions) derived from past local observations and ERA-Interim variables in a previous 24-h period covering a domain around the area of study; and (3) wind forecasts provided by ERA-Interim. Results indicate that, for horizons less than 1–4 h, persistence is the best model. For longer predictions, RF provides the best forecasts. For horizontal components at 4–24 h horizons, RF slightly outperformed ERA-Interim wind forecasts. For gust, RF performs better than ERA-Interim for all the horizons. Persistence is the most influential factor for 2–5 h. Beyond this horizon, predictors from the ERA-Interim wind forecasts led the contribution. Hybrid numerical–statistical methods can be used to improve short-term wind forecasts.