A Comparative Study of PyCaret AutoML and CNN-BiLSTM for Binary Hate Speech Detection in Indonesian Twitter
Researchers benchmarked AutoML and neural sequence models on Indonesian hate speech detection, finding CNN-BiLSTM outperforms traditional feature engineering with 83.8% accuracy on a 13K-row dataset. The work highlights a persistent pattern in NLP: deep bidirectional architectures still edge out automated classical pipelines on language tasks with directional context, even as AutoML tools mature. For practitioners building content moderation systems in non-English languages, the result underscores that neural approaches remain necessary when capturing nuanced linguistic abuse, though the controlled comparison methodology offers a useful template for evaluating tool trade-offs.48
























