[OTDev] Validation: classification statistics for non-binary class values

Martin Guetlein martin.guetlein at googlemail.com
Tue Dec 8 14:51:51 CET 2009


Hi Nina, All,

very good Point. Here is how it could look like:

[[
default:confusionmatrix
  a ot:ConfusionMatrix ;

  # contains numClassValues**2 entries like the following

  ot:confusionMatrixValue
  [
    a ot:ConfusionMatrixValue ;
    dc:value "25"^^xsd:int ;
    ot:confusionMatrixCoordinates ;
    [
	a ot:ConfusionMatrixCoordinate ;
	dc:predictedValue "active"^^xsd:String ;
	dc:actualValue "moderately_active"^^xsd:String ;
    ]
  ]
  ...
]]

I think we will end up with quite a lot of Classes in our opentox.owl.

Best Regards,
Martin



On Tue, Dec 8, 2009 at 12:57 PM, Nina Jeliazkova <nina at acad.bg> wrote:
> Hi Martin,
>
> Do we have confusion matrix somewhere in the classification statistics?
> It provides more information than just true positives.
>
> Best regards,
> Nina
>
>
> Martin Guetlein wrote:
>> Hello All,
>>
>> as Harry noted in one of the last meetings, the classification
>> statistics in the validation object only take binary classification
>> into account so far. There can of course be more than one class value
>> (e.g. inacitve, moderately-active, active).
>> Hence, some classification results (e.g. numTruePositives) are now
>> available multiple times (once for each class-value).
>>
>> As collections are not allowed in OWL-DL, I had to create
>> intermediate classes (following the scheme Nina proposed for the
>> dataset). Here is how an example of the Classification Statistics
>> Object may look like:
>>
>> [[
>> default:thisClassificationStatistics
>>   a ot:classificationStatistics ;
>>
>>   ot:accuracy "99.0"^^xsd:float ; # accuracy is only available once
>>   ot:numberUnclassified "26"^^xsd:int ;
>>   ...
>>
>>   ot:classStatisticEntry
>>     [ a ot:classStatisticEntry ;
>>       ot:classValue "moderately_active"^^String ;
>>       ot:classStatisticValue
>>         [ a ot:ClassStatisticValue ;
>>           ot:classStatistic default:areaUnderRocCurve ;
>>           ot:value "0.77"^^:xsd:float ;
>>         ] ;
>>       ot:classStatisticValue
>>         [ a ot:ClassStatisticValue ;
>>           ot:classStatistic default:numTruePositives ;
>>           ot:value "123"^^:xsd:int ;
>>         ] ;
>>       ot:classStatisticValue
>>       ...
>>
>>   ot:classStatisticEntry
>>     [ a ot:classStatisticEntry ;
>>       ot:classValue "intactive"^^String ;
>>       ...
>>
>>   ot:classStatisticEntry
>>     [ a ot:classStatisticEntry ;
>>       ot:classValue "active"^^String ;
>>       ...
>> ]]
>>
>> Here is the old classification statistics object (I renamed it from
>> ClassifcationInformation to ClassificationStatistics):
>> http://www.opentox.org/data/documents/development/RDF%20files/Validation/#-ot-classificationinfo-rdf
>>
>> Any comments, corrections before I add that to the opentox.owl?
>>
>> Best regards,
>> Martin
>>
>>
>>
>
> _______________________________________________
> Development mailing list
> Development at opentox.org
> http://www.opentox.org/mailman/listinfo/development
>



-- 
Dipl-Inf. Martin Gütlein
Phone:
+49 (0)761 203 8442 (office)
+49 (0)177 623 9499 (mobile)
Email:
guetlein at informatik.uni-freiburg.de



More information about the Development mailing list