AI for Time Series and Anomaly Detection

Table 2.

Summary of Empirical Results


Model Category	Representative Models	Primary Strengths	Weaknesses / Limitations	Average Performance (F1 / RMSE)

Traditional Statistical	ARIMA, Holt-Winters	Interpretable, low complexity	Poor scalability, weak with nonlinear data	F1 ≈ 0.60 / RMSE ↑ 15–20%
Machine Learning	SVM, Random Forest, XGBoost	Moderate accuracy, interpretable	Heavy feature engineering	F1 ≈ 0.75 / RMSE ↓ 10%
Deep Sequential	LSTM, GRU	Captures temporal dependencies	Slow training, gradient issues	F1 ≈ 0.85 / RMSE ↓ 18%
Deep Convolutional	TCN	Fast inference, robust to noise	Limited long-term context	F1 ≈ 0.88 / RMSE ↓ 20%
Transformer-Based	TFT, Informer, TimesNet	High accuracy, interpretable via attention	Computationally expensive	F1 ≈ 0.91 / RMSE ↓ 25%
Generative / Hybrid	Autoencoder, VAE, GAN	Excellent anomaly detection	Hard to tune, interpretability issues	F1 ≈ 0.93 / RMSE ↓ 22%