Web4/LS-W4-Mini-RF_Addiction_Impact

Model Summary

This model is a Random Forest Classifier designed to predict whether social media usage has an impact on a student's academic performance. The model was trained using the "Social Media Addiction vs. Relationships" dataset, which is a collection of survey responses from students aged 16 to 25. The model is built as a scikit-learn pipeline, which includes data preprocessing steps and the classifier itself, making it easy to load and use in a Python environment.

Intended Use

The model's primary use case is to provide a statistical tool for trend analysis regarding the relationship between social media use and academic performance in a specific student demographic. It is suitable for academic research and data exploration, but should not be used as a clinical diagnostic tool.

Training Details

The model was trained on the Social Media Addiction vs. Relationships dataset from Kaggle. The dataset contains 705 records and 13 features, all based on self-reported survey data. The pipeline's structure includes a ColumnTransformer for one-hot encoding categorical features before they are passed to the classifier.

Key hyperparameters used for the RandomForestClassifier were:

Performance

The model's performance was evaluated on a held-out test set from the original dataset, achieving an accuracy of 0.93.

Limitations and Ethical Considerations

It is crucial to understand the limitations of this model for responsible use:

Usage Guide

The model can be loaded and used in a Python environment using the joblib library, as shown in the example below.

https://huggingface.co/Web4/LS-W4-Mini-RF_Addiction_Impact