Detecting Toxic Text

How To Use

Select a demo to view an example of a certain type of toxicity in action

Type the text you want to analyze for toxicity

My Models:

Toxicity - 1 Epoch : First version of my finetuned model

Toxicity - 8 Epochs : First version of my model with 8 epochs of training

Woxicity - Weighted : Final version of my model that implemented weights in order to correctly classify underrepresented categories

Base Model: DistilBERT Base Uncased (SST-2) - Analyzes text for either positive or negative content

Hit the submit button to view the output

My Model:

Tweet (portion): The text that was inputted

Toxicity Class: toxic / severe_toxic / obscene / threat / insult / identity_hate

Probability: Probability of prediction

Base Model:

Tweet (portion): The text that was inputted

Result: POSITIVE / NEGATIVE

Probability: Probability of prediction

Visit huggingface site to use model / reload for site

Github Repo

Hugging Face App

Hugging Face Account