Electricity load forecasting is an important task in ensuring power systems'' safety and efficiency. Addressing the pivotal issue of short-term electricity load forecasting in distribution areas, this paper presents a novel approach that combines electrical feature processing and Informer model optimization. The study employs Discrete Wavelet Transform (DWT) for denoising current data while utilizing Prophet model for extracting temporal features to enhance input data quality. Additionally, the method incorporates ProbSparse self-attention and self-attention distillation, thereby bolstering feature capture and prediction speed within the model. Validation with instance data demonstrates that the DWT-Informer model, enriched through both data preprocessing and model optimization, outperforms the baseline model across various performance metrics.