Difference between revisions of "F-Scores and Accuracy/zh-hans"
(Created page with "F-成绩和准确度") |
(Created page with "在Eyewire,您将获得基于您的F分数的准确率评级。F分数是决定准确度占准确率和召回这两种统计方法。或者更简单地说,F分数是HQ根...") |
||
Line 1: | Line 1: | ||
− | + | 在Eyewire,您将获得基于您的F分数的准确率评级。F分数是决定准确度占准确率和召回这两种统计方法。或者更简单地说,F分数是HQ根据添加多的、缺失的模块来决定你的准确率。传统F分数的公式为: | |
[[File:F_score_calculation.png|center]] | [[File:F_score_calculation.png|center]] |
Revision as of 17:19, 19 October 2016
在Eyewire,您将获得基于您的F分数的准确率评级。F分数是决定准确度占准确率和召回这两种统计方法。或者更简单地说,F分数是HQ根据添加多的、缺失的模块来决定你的准确率。传统F分数的公式为:
Before we can calculate the final F-score first we must calculate your individual precision and recall. When a player does a cube there are four possible outcomes for every segment in that cube: a true positive result, a false positive result, a false negative result and a true negative result. A true positive (tp) result is when a player adds a segment that should be added. A false positive (fp) is when a player adds a segment that should not be added. A false negative (fn) is when a player misses a segment they should have added. A true negative (tn) is when a player correctly leaves out a segment that does not belong. In the figure below you can see an example of false negative and of false positive.
The red segment here is a false positive and the purple segment is a false negative. The player mistakenly added the red segment when they should have added the purple segment instead. The green segment is correct.
Now we would take the results from both of those formulas and plug them into the formula above to get a player’s F-score. Another way to look at it is we take the harmonic mean of a player’s precision and recall to get their overall accuracy rating.
How Accurate are F-Scores?
One question we a get a lot is how do we know what is correct and what isn’t? What is correct is determined by combining the GrimReaper’s corrections with the EyeWirer consensus. If a cube does not have a GrimReaper correction we just use the EyeWirer consensus. EyeWire consensuses have proven to be quite accurate. However, there is still a small chance that a consensus may contain a wrong piece. This means that F-scores cannot prove user accuracy 100% of the time. However, they are accurate enough that we feel confident using them as a player guide.