New phase assessModel #296

sonalgoyal · 2022-05-26T07:36:28Z

Write a python script which whill expose the model stats - confusion matrix and number of records marked, unmarked, matches, non matches, not sure.

We will use the Labeller class. The python script takes the conf and passes it to the Client. Client will invoke the Labeller. Refer to the python api example at https://github.com/zinggAI/zingg/blob/main/api/scala/FebrlExample.py.

The script calls getMarkedRecords, getMarkedRecordsStat, getUnmarkedRecords on the Client and provides the stats. You can convert the df returned by the Client to python df. To build the confusion matrix, following can be used.

import pandas as pd
import seaborn as sn
import matplotlib.pyplot as plt

confusion_matrix = pd.crosstab(markedRecords['z_isMatch'], markedRecords['z_prediction'], rownames=['Actual'], colnames=['Predicted'])

sn.heatmap(confusion_matrix, annot=True)
plt.show()

sonalgoyal · 2022-05-26T16:12:19Z

I have added new methods on the client - getMatchedMarkedRecordsStat(Dataset markedRecords), getUnmatchedMarkedRecordsStat(Dataset markedRecords), getUnsureMarkedRecordsStat and getMarkedRecords()

you can use them to build the logic

sonalgoyal · 2022-05-26T16:12:50Z

@RavirajBaraiya

navinrathore · 2022-06-06T19:39:20Z

Confusion Matrix looks like below

navinrathore · 2022-06-06T19:46:44Z

Generated Config File from Arguments object
ArgumentsToFile.txt

navinrathore · 2022-06-06T19:47:36Z

Statistics for model 100

No. of Records Marked   :  76
No. of Records UnMarked :  72
No. of Matches          :  14
No. of Non-Matches      :  24
No. of Not Sure         :  0

sonalgoyal · 2022-06-15T18:22:54Z

need to look at the right model internally for this - should be expose label model or should we expose the actual model

sonalgoyal assigned RavirajBaraiya May 26, 2022

sonalgoyal self-assigned this May 28, 2022

navinrathore mentioned this issue Jun 6, 2022

Updates in Python classes and 'assessModel' python phase #313

Merged

sonalgoyal added this to the 0.3.4 milestone Jun 15, 2022

sonalgoyal unassigned RavirajBaraiya Jun 17, 2022

sonalgoyal modified the milestones: 0.3.4, 0.3.5 Jul 26, 2022

sonalgoyal removed this from the 0.3.5 milestone Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New phase assessModel #296

New phase assessModel #296

sonalgoyal commented May 26, 2022

sonalgoyal commented May 26, 2022

sonalgoyal commented May 26, 2022

navinrathore commented Jun 6, 2022

navinrathore commented Jun 6, 2022

navinrathore commented Jun 6, 2022

sonalgoyal commented Jun 15, 2022

New phase assessModel #296

New phase assessModel #296

Comments

sonalgoyal commented May 26, 2022

sonalgoyal commented May 26, 2022

sonalgoyal commented May 26, 2022

navinrathore commented Jun 6, 2022

navinrathore commented Jun 6, 2022

navinrathore commented Jun 6, 2022

sonalgoyal commented Jun 15, 2022