-
1
-
-
0033650323
-
Evaluating evaluation measure stability
-
C. Buckley and E.M. Voorhees, "Evaluating Evaluation Measure Stability, " ACM SIGIR, pp. 33-34, 2000.
-
(2000)
ACM SIGIR
, pp. 33-34
-
-
Buckley, C.1
Voorhees, E.M.2
-
2
-
-
84873582800
-
-
music-ir email list, Available at
-
J.S. Downie, "MIREX Next Generation, " music-ir email list, 2011. Available at: http://listes.ircam.fr/wws/info/music-ir.
-
(2011)
MIREX Next Generation
-
-
Downie, J.S.1
-
3
-
-
77949592967
-
The music information retrieval evaluation exchange: Some observations and insights
-
W.R. Zbigniew and A.A. Wieczorkowska, (eds.), Springer
-
J.S. Downie, A.F. Ehmann, M. Bay, and M.C. Jones, "The Music Information Retrieval Evaluation eXchange: Some Observations and Insights, " Advances in Music Information Retrieval, W.R. Zbigniew and A.A. Wieczorkowska, (eds.), Springer, pp. 93-115, 2010.
-
(2010)
Advances in Music Information Retrieval
, pp. 93-115
-
-
Downie, J.S.1
Ehmann, A.F.2
Bay, M.3
Jones, M.C.4
-
5
-
-
1842637192
-
Cumulated gain-based evaluation of IR techniques
-
K. Järvelin and J. Kekäläinen, "Cumulated Gain-Based Evaluation of IR Techniques, " ACM Transactions on Information Systems, 20:4, pp. 422-446, 2002.
-
(2002)
ACM Transactions on Information Systems
, vol.20
, Issue.4
, pp. 422-446
-
-
Järvelin, K.1
Kekäläinen, J.2
-
6
-
-
84873598993
-
Crowdsourcing music similarity judgments using mechanical turk
-
J.H. Lee, "Crowdsourcing Music Similarity Judgments using Mechanical Turk, " ISMIR, pp. 183-188, 2010.
-
(2010)
ISMIR
, pp. 183-188
-
-
Lee, J.H.1
-
7
-
-
33750437740
-
On the reliability of information retrieval metrics based on graded relevance
-
T. Sakai, "On the Reliability of Information Retrieval Metrics Based on Graded Relevance, " Information Processing and Management, 43:2, pp. 531-548, 2007.
-
(2007)
Information Processing and Management
, vol.43
, Issue.2
, pp. 531-548
-
-
Sakai, T.1
-
8
-
-
84885608872
-
Information retrieval system evaluation: Effort, sensitivity, and reliability
-
M. Sanderson and J. Zobel, "Information Retrieval System Evaluation: Effort, Sensitivity, and Reliability, " ACM SIGIR, pp. 162-169, 2005.
-
(2005)
ACM SIGIR
, pp. 162-169
-
-
Sanderson, M.1
Zobel, J.2
-
9
-
-
0000411660
-
New developments in pairwise multiple comparisons: Some powerful and practicable procedures
-
M.A. Seaman, J.R. Levin, and R.C. Serlin, "New Developments in Pairwise Multiple Comparisons: Some Powerful and Practicable Procedures, " Psychological Bulletin, 110:3, pp. 577-586, 1991.
-
(1991)
Psychological Bulletin
, vol.110
, Issue.3
, pp. 577-586
-
-
Seaman, M.A.1
Levin, J.R.2
Serlin, R.C.3
-
10
-
-
33846122073
-
A ground truth for half a million musical incipits
-
R. Typke, M. den Hoed, J. de Nooijer, F. Wiering, and R.C. Veltkamp, "A Ground Truth for Half a Million Musical Incipits, " Journal of Digital Information Management, vol. 3, no. 1, pp. 34-39, 2005.
-
(2005)
Journal of Digital Information Management
, vol.3
, Issue.1
, pp. 34-39
-
-
Typke, R.1
Hoed, M.D.2
Nooijer, J.D.3
Wiering, F.4
Veltkamp, R.C.5
-
11
-
-
34247598892
-
A measure for evaluating retrieval techniques based on partially ordered ground truth lists
-
R. Typke, R.C. Veltkamp, and F. Wiering, "A Measure for Evaluating Retrieval Techniques based on Partially Ordered Ground Truth Lists, " IEEE International Conference on Multimedia and Expo, pp. 1793-1796, 2006.
-
(2006)
IEEE International Conference on Multimedia and Expo
, pp. 1793-1796
-
-
Typke, R.1
Veltkamp, R.C.2
Wiering, F.3
-
12
-
-
84861015604
-
Information retrieval meta-evaluation: Challenges and opportunities in the music domain
-
J. Urbano, "Information Retrieval Meta-Evaluation: Challenges and Opportunities in the Music Domain, " ISMIR, 2011.
-
(2011)
ISMIR
-
-
Urbano, J.1
-
13
-
-
80053112542
-
Improving the generation of ground truths based on partially ordered lists
-
J. Urbano, M. Marrero, D. Martín, and J. Lloréns, "Improving the Generation of Ground Truths based on Partially Ordered Lists, " ISMIR, pp. 285-290, 2010.
-
(2010)
ISMIR
, pp. 285-290
-
-
Urbano, J.1
Marrero, M.2
Martín, D.3
Lloréns, J.4
-
14
-
-
80053099228
-
Crowdsourcing preference judgments for evaluation of music similarity tasks
-
J. Urbano, J. Morato, M. Marrero, and D. Martín, "Crowdsourcing Preference Judgments for Evaluation of Music Similarity Tasks, " ACM SIGIR Workshop on Crowdsourcing for Search Evaluation, pp. 9-16, 2010.
-
(2010)
ACM SIGIR Workshop on Crowdsourcing for Search Evaluation
, pp. 9-16
-
-
Urbano, J.1
Morato, J.2
Marrero, M.3
Martín, D.4
-
15
-
-
72449211066
-
Topic set size Redux
-
E.M. Voorhees, "Topic Set Size Redux, " ACM SIGIR, pp. 806-807, 2009.
-
(2009)
ACM SIGIR
, pp. 806-807
-
-
Voorhees, E.M.1
-
16
-
-
0036993119
-
The effect of topic set size on retrieval experiment error
-
E.M. Voorhees and C. Buckley, "The Effect of Topic Set Size on Retrieval Experiment Error, " ACM SIGIR, pp. 316-323, 2002.
-
(2002)
ACM SIGIR
, pp. 316-323
-
-
Voorhees, E.M.1
Buckley, C.2
-
17
-
-
57349093460
-
Precision- At-ten considered redundant
-
W. Webber, A. Moffat, J. Zobel, and T. Sakai, "Precision- At-Ten Considered Redundant, " ACM SIGIR, pp. 695-696, 2008.
-
(2008)
ACM SIGIR
, pp. 695-696
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
Sakai, T.4
|