Difference between revisions of "F-Scores and Accuracy/zh-hans"

From Eyewire
Jump to: navigation, search
(Created page with "召回率指相同体积里正确添加片段占总共需要添加片段的比例。比如某玩家的召回率是0.9409,那就遗漏了6%的片段。计算时,正确添...")
Line 16: Line 16:
  
  
Recall measures how much of the volume was missed. Let’s say Player A has a recall of 0.9409. That means that Player A missed about 6% of the correct segments in the cubes Player A worked on. To determine a player’s recall we use their true positive (tp) results, correctly added, and false negative (fn) results, incorrectly missed, in this formula: [[File:Recallformula.png|center]]
+
召回率指相同体积里正确添加片段占总共需要添加片段的比例。比如某玩家的召回率是0.9409,那就遗漏了6%的片段。计算时,正确添加片段相当于真阳性结果,记为 tp ;遗落的片段相当于假阴性结果 ,记为 fn 。则有如下召回率计算公式:[[File:Recallformula.png|center]]
  
  

Revision as of 17:05, 21 October 2016

在Eyewire,您将获得基于您的F分数的准确率评级。F分数是决定准确度占准确率和召回这两种统计方法。或者更简单地说,F分数是HQ根据添加多的、缺失的模块来决定你的准确率。传统F分数的公式为:

Error creating thumbnail: Unable to save thumbnail to destination

在计算最终的F分数时,要提前计算好玩家个人的准确率和召回。方块由玩家搭建完后,程序识别并输出4种判断的结果:真阳性、假阳性、真阴性、假阳性。真阳性(tp)是你正确添加了遗失的片段;假阳性(fp)是你错误添加了不该有的片段;真阴性(tn)是你移除原来模型中错误的片段;假阴性(fn)是你遗漏了需要添加的片段。你可以在下图看到假阳性与假阴性的区别。


NewFScoreEyeWire.png
左边是玩家提交的一个方块。 这个示例中,绿色和红色是玩家已经提交的增添了的片段,紫色是玩家遗漏的部分片段。


图中红色片段是假阳性,紫色片段是假阴性。 玩家错误地添加了红色段,而不是区添加紫色片段。 绿色片段正确。


再次说明,准确度是单位体积里正确片段的个数。比如某玩家的准确度是0.9221,那92%片段正确添加,8%片段错误添加。正确添加的片段也就是真阳性,记为 tp ;相应有 fp 。则可以用如下公式计算准确度:


召回率指相同体积里正确添加片段占总共需要添加片段的比例。比如某玩家的召回率是0.9409,那就遗漏了6%的片段。计算时,正确添加片段相当于真阳性结果,记为 tp ;遗落的片段相当于假阴性结果 ,记为 fn 。则有如下召回率计算公式:
Error creating thumbnail: Unable to save thumbnail to destination


Now we would take the results from both of those formulas and plug them into the formula above to get a player’s F-score. Another way to look at it is we take the harmonic mean of a player’s precision and recall to get their overall accuracy rating.

How Accurate are F-Scores?

One question we a get a lot is how do we know what is correct and what isn’t? What is correct is determined by combining the GrimReaper’s corrections with the EyeWirer consensus. If a cube does not have a GrimReaper correction we just use the EyeWirer consensus. EyeWire consensuses have proven to be quite accurate. However, there is still a small chance that a consensus may contain a wrong piece. This means that F-scores cannot prove user accuracy 100% of the time. However, they are accurate enough that we feel confident using them as a player guide.