Classification is widely used in medical applications. However, the quality of the 
classifier depends critically on the accurate labeling of the training data. But 
for many medical applications, labeling a sample or grading a biopsy can be 
subjective. Existing studies confirm this phenomenon and show that even a very 
small number of mislabeled samples could deeply degrade the performance of the 
obtained classifier, particularly when the sample size is small. The problem we 
address in this paper is to develop a method for automatically detecting samples 
that are possibly mislabeled.We propose two algorithms, a 
classification-stability algorithm and a leave-one-out-error-sensitivity algorithm 
for detecting possibly mislabeled samples. For both algorithms, the key structure 
is the computation of the leave-one-out perturbation matrix. The 
classification-stability algorithm is based on measuring the stability of the 
label of a sample with respect to label changes of other samples and the version of 
this algorithm based on the support vector machine appears to be quite accurate for 
three real datasets. The suspect list produced by the version is of high quality. 
Furthermore, when human intervention is not available, the correction heuristic 
appears to be beneficial.