solar_plant.ipynb that demonstrates an end-to-end pipeline for detecting failure events in solar plants. The notebook covers data preparation, visualization, model training, evaluation, and finally, model persistence. Below is an overview of the main sections and processes:pandas, numpymatplotlib, seaborntrain_test_split, StandardScaler, and LabelEncoderRandomOverSampler from imblearnXGBClassifier from the XGBoost packagejoblibwarnings (to ignore unnecessary warnings) are also configured.AC_POWER is zero during daytime (i.e., when the current hour is between the sunrise and sunset times).failure) is created based on specific conditions:AC_POWER equals 0.Humidity is 80 or above.Radiation is either below 200 or above 900.SOURCE_KEY column is label encoded to transform any non-numeric identifiers into numerical values.failure column to maintain the same class distribution across splits.scale_dataset) is defined to:DATE_TIME).StandardScaler.RandomOverSampler) to the training set to handle any class imbalance.XGBClassifier) is set up with the following hyperparameters:SolarPlant.joblib) using the joblib library. This allows for later reuse of the model without needing to retrain it.Posted Apr 13, 2025
Developed a Jupyter Notebook for solar plant failure prediction.
0
4