Spaces:
Sleeping
Sleeping
| # Evaluation Summary for IndoHoaxDetector Space | |
| ## Metrics Overview | |
| - **Model Architecture**: Logistic Regression trained on Indonesian news labeled as HOAX vs FAKTA. | |
| - **Vectorizer**: TF-IDF transform created with `tfidf_vectorizer.pkl` after applying Indonesian-specific preprocessing. | |
| - **Accuracy**: ~97.83% on the held-out validation split used during training (metadata stored in `model_metadata.txt`). | |
| - **Precision & Recall**: Balanced on the styled labels. Precision indicates how often the model flags hoax-style text correctly; recall shows how many hoax-like examples are captured. | |
| - **Confidence Scores**: The Gradio app exposes probability values for both labels. Use the HOAX probability as a stylistic warning, not a verdict. | |
| ## Testing Guidelines | |
| 1. Prepare a set of Indonesian news snippets (title + body) with known labels. | |
| 2. Run the preprocessing steps defined in `app.py` (lowercasing, URL stripping, non-letter removal, stop word removal, Sastrawi stemming, TF-IDF transform). | |
| 3. Use the loaded model to infer probabilities via `predict_news`. | |
| 4. Compare predictions with labels and compute metrics with any evaluation script (e.g., run the repository-level `evaluate.py` if you copy it inside this folder). | |
| ## Reporting | |
| - Document any changes to the dataset or vectorizer. | |
| - If you retrain the model, update this file with the new accuracy, precision, recall, and dataset description to keep the Space trustworthy. | |
| ## Caveats | |
| - Metrics refer only to stylistic label consistency, not factual verification. | |
| - The evaluation set may not include every possible writing style; monitor drift over time and retrain as needed. |