How F-Score works in Text Categorization ?

Question

YageshP5

Member since 2016

20 posts

Perficient

Posted: Aug 3, 2020

Last activity: Dec 3, 2020

Posted: 3 Aug 2020 16:19 EDT
Last activity: 3 Dec 2020 1:18 EST

Closed

How F-Score works in Text Categorization ?

Report

How F-Score works in case of Text Categorization ? When we create model and upload the Training and test data, based on the test data entries F-Score is calculated. What happens to F-Score when new emails start coming ( in context to Email Channel) and they sometime map correctly or incorrectly to the topic ? Does F-Score change automatically if incoming email is not able to map to the topic ?

To see attachments, please log in.

Pega Platform 8.4

Conversational Channels

Like (0)
Share this page Facebook Twitter LinkedIn Email Copying... Copied!

Posted: 3 years ago

Updated: 3 years ago

Posted: 4 Aug 2020 2:02 EDT
Updated: 4 Aug 2020 2:08 EDT

VikasRaidhan

PEGA

replied to YageshP5

Report

When new email comes and it goes for manual triaging, any case created or a reply sent by CSR causes feedback record to be generated which goes and sits in Training data tab of email channel. After you verify these feedback records and build the model, only then F-Score changes. F-score is harmonic mean of Recall and Precision of a given model and ideally should increases when you provide variety of labels(topics) and variety of text. It may not improve if you are providing same text and labels all the time.

To see attachments, please log in.

Likes (1)

Yagesh Paliwal

Posted: 3 years ago

Posted: 4 Aug 2020 14:14 EDT

YageshP5

Perficient

replied to VikasRaidhan

Report

Thanks again Vikas. When creating Training data for Text categorization model, what are the best practices to create test and non test ( Type= Test and empty ) data ?

To see attachments, please log in.

Like (0)

Posted: 3 years ago

Posted: 6 Aug 2020 7:18 EDT

VikasRaidhan

PEGA

replied to YageshP5

Report

Both train and test should be good, accurate, non-junk data. You may choose not to provide anything in there and model build process will automatically take the random samples. Typically a 70:30 train:test split is done for ML models.

Model will generate F-score based on test data only. The process will build model with 'train' data and then test the generated model against 'test' data. Truth table (TP, FP) thus generated will be used to calculate scores. Specify 'Test' if you want to be sure that the F-score generated is according to 'your' test data and not some random sample.

To see attachments, please log in.

Likes (1)

Yagesh Paliwal

Posted: 3 years ago

Posted: 3 Dec 2020 1:18 EST

VirendraSawant

Siemens

replied to VikasRaidhan

Report

Hello Vikas,

we are facing issue where fScore is decreasing after every feedback. the feedback text is cleansed and noise removed carefully. Please see this post - https://collaborate.pega.com/question/topic-detection-model-nlp-not-training-feedback-fscore-decreasing

To see attachments, please log in.

Like (0)

Get Started with Community

Question

How F-Score works in Text Categorization ?

Need help or want to help others?

Experience the benefits of Support Center when you log in.

Question

How F-Score works in Text Categorization ?

Related content:

Need help or want to help others?

Experience the benefits of Support Center when you log in.

We'd prefer it if you saw us at our best.