solar_plant.ipynb
that demonstrates an end-to-end pipeline for detecting failure events in solar plants. The notebook covers data preparation, visualization, model training, evaluation, and finally, model persistence. Below is an overview of the main sections and processes:pandas
, numpy
matplotlib
, seaborn
train_test_split
, StandardScaler
, and LabelEncoder
RandomOverSampler
from imblearn
XGBClassifier
from the XGBoost packagejoblib
warnings
(to ignore unnecessary warnings) are also configured.AC_POWER
is zero during daytime (i.e., when the current hour is between the sunrise and sunset times).failure
) is created based on specific conditions:AC_POWER
equals 0.Humidity
is 80 or above.Radiation
is either below 200 or above 900.SOURCE_KEY
column is label encoded to transform any non-numeric identifiers into numerical values.failure
column to maintain the same class distribution across splits.scale_dataset
) is defined to:DATE_TIME
).StandardScaler
.RandomOverSampler
) to the training set to handle any class imbalance.XGBClassifier
) is set up with the following hyperparameters:SolarPlant.joblib
) using the joblib
library. This allows for later reuse of the model without needing to retrain it.Posted Apr 13, 2025
Developed a Jupyter Notebook for solar plant failure prediction.
0
0