Probabilities of False Claims Toolkit

False claims estimator

Using a Bayesian approach, this tool estimates the probability that a reported outperformance claim is false. It evaluates whether the observed difference between two methods could have occurred by chance, given the test-set size and the reported performance values. The methodology adapts to the type of task: for classification, it models case-wise agreement patterns between methods; for segmentation, it incorporates both performance variability and correlation between methods' per-case scores. The estimator currently supports Accuracy-based comparison for classification and Dice Similarity Coefficient (DSC) values for segmentation.

Data consent, terms of use and citation

I have read and agree to the Data consent and Terms of use.

I understand that I have to cite the referenced article(s) listed in the Publication page.

Please accept both checkboxes to proceed.

Select your scenario

For reviewers For researchers

Select the task type

Segmentation Classification

Provide the following values

Size of the test set

Enter an integer value (minimum 2)

Test set size must be between 2 and 100,000.

Is Standard Deviation reported?

Mean DSC of the 1st ranked method

Enter a value between 0 and 1

Value must be between 0 and 1.

Mean DSC of the 2nd ranked method

Enter a value between 0 and 1

Value must be between 0 and 1.

case_id	alg_01	alg_02	alg_03
1	0.9254	0.8712	0.9701
2	0.6753	0.7330	0.8902
3	0.8120	0.7991	0.9405
4	0.9012	0.8450	0.9603
...	...	...	...

case_id	ground_truth	alg_01	alg_02	alg_03
1	1	1	0	1
2	0	0	1	0
3	1	1	1	0
4	1	0	1	0
...	...	...	...	...

case_id	ground_truth	alg_01	alg_02	alg_03
1	0	0	1	0
2	2	2	2	1
3	1	1	0	1
4	3	3	2	3
5	2	2	2	3
6	0	0	0	0
...	...	...	...	...

False claims estimator

Data consent, terms of use and citation

Select your scenario

Select the task type

Select the classification mode

Provide the following values

Expected CSV format

Expected CSV format

Expected CSV format

Upload your CSV file

False claims estimator

Data consent, terms of use and citation

Select your scenario i

Select the task type

Select the classification mode

Provide the following values

Expected CSV format

Expected CSV format

Expected CSV format

Upload your CSV file

Select your scenario