Field Confidence Threshold Optimizer Tool

Kofax Transformation defaults to 80% as the default field confidence threshold. Finding the optimum threshold for each field is a tedious task.
This tool

visualizes true/false positives/negatives for thresholds 0%,5%,10%,15%,...90%,95% and 100%
enables you to find the optimum threshold for each field.
helps you find the optimal configuration for each locator, evaluator, formatter, validation rule, and OCR profile.

Color	Type	Correct	Valid	Human Action
🟢	True Positive	Correct	Valid	-
🔵	False Negative	Correct	Invalid	Press ENTER
🟡	True Negative	Incorrect	Invalid	Correct Text
🔴	False Positive	InCorrect	Valid	Lost trust in system

Accuracy = 🟢+🔵. Human Effort = 🔵+🟡. Errors = 🟡+🔴.

Optimizing a Project

Unfortunately many think that the way to improve a KT project is to increase accuracy 🟢+🔵. This will not lead to success. Efforts that focus on Accuracy 🟢+🔵, ignore false positives 🔴 and don't focus on reducing human effort 🔵+🟡.

Accuracy cannot be converted into Time Savings, Cost Savings, FTE (Fulltime equivalent) reduction, ROI or productivity.
Human effort 🔵+🟡 can directly and accuractely be converted to Time Savings, Cost Savings, FTE (Fulltime equivalent) reduction, ROI or productivity. Therefore each project should focus on human effort 🔵+🟡.

The two goals of every project should be

Reduce the number of errors 🔴 that humans are currently producing.
Reduce human effort 🔵+🟡.

The data comes from chart underneath. We can see that 82% is the lowest threshold that still gives 0% false positives 🔴.

Confidence	TP	FN	TN	FP	Accuracy	Human Effort
	🟢	🔵	🟡	🔴	🟢+ 🔵	🔵+🟡
82%	10.5%	79.9%	9.6%	0.0%	90.4%	89.5%
80%	17.4%	72.9%	9.4%	0.2%	90.4%	82.3%

Reducing the threshold by 2% to 80% gives a false positive rate 🔴 of 0.2% (1 in 500) and a reduction in fields requiring human effort by 7.2 percentage points. It is always a game of compromise.

The following data shows that Accuracy metrics provide no value for productivity of a solution, nor do they help in anyway with optimizing the solution. Notice that an accuracy of about 90% can have a productivity gain between 1.4 and 2.8, and clearly 2.8 is twice as good at 1.4!! 2.8 means that 1.8 people can be reassigned to other work - if an employee costs $100,000 /year, then this solution will save $180,000 / year. The next step is to see how the productivity gain can be improved even further.
This is an example of how the Field Confidence Threshold Optimizer Tool can be used.

This example uses 1 second as the time to confirm a correct and invalid field 🔵, and 1.5 seconds to enter the correct value of a incorrect field 🟡.

Config	Confidence	TP	FN	TN	FP	Accuracy	Human Effort	Correction Ratio	Productivity gain
		🟢	🔵	🟡	🔴	🟢+ 🔵	🔵+🟡	🟡/(🔵+🟡)	🔵x 1s+ 🟡x 1.5s
Parascript Hand Alphanumeric	82%	0.2%	83.0%	16.8%	0.0%	83.2%	99.8%	17%	1.4
Parascript Hand Alpha	80.0%	18.3%	72.5%	8.9%	0.2%	90.8%	81.4%	11%	1.7
Parascript FirstName	79.0%	15.2%	72.3%	12.5%	0.0%	87.5%	84.8%	15%	1.6
Parascript LastName	80.0%	17.4%	72.9%	9.4%	0.2%	90.4%	82.3%	11%	1.7
Parascript Alpha + Truth Dictionary	73.0%	60.2%	35.8%	4.0%	0.0%	96.0%	39.8%	10%	3.6
Parascript Alpha + 2300 Names	67.0%	51.9%	37.4%	10.7%	0.0%	89.3%	48.1%	22%	2.8
Parascript Last + 2300 Names	67.0%	51.9%	38.0%	10.1%	0.0%	89.9%	48.1%	21%	2.8

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Anonymizing Zones from Documents.md		Anonymizing Zones from Documents.md
README.md		README.md
Step-by-step Guide.md		Step-by-step Guide.md
Threshold Optimizer.xlsx		Threshold Optimizer.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anonymizing Zones from Documents.md

Anonymizing Zones from Documents.md

README.md

README.md

Step-by-step Guide.md

Step-by-step Guide.md

Threshold Optimizer.xlsx

Threshold Optimizer.xlsx

Repository files navigation

Field Confidence Threshold Optimizer Tool

Optimizing a Project

About

Releases

Packages

TungstenTransformation/ThresholdOptimizer

Folders and files

Latest commit

History

Repository files navigation

Field Confidence Threshold Optimizer Tool

Optimizing a Project

About

Resources

Stars

Watchers

Forks