Are you planning to release the relevance judgements? #55

vitojph · 2019-10-04T12:14:25Z

Hi,

Fantastic initiative, thanks a lot :-)

Are you planning to publish the relevance judgements, ie, the 4k expert relevance annotations?

hamelsmu · 2019-10-07T17:08:40Z

I am not sure we are going to release that. I'll let my colleagues chime in on that:

mallamanis · 2019-10-08T09:19:38Z

Hi,
I am against publicly releasing the annotations at this point. By having them "hidden" behind the leaderboard evaluation we are in less danger of overfitting on the dataset (or someone "cheating" by looking in the test set). The test set is quite small and sooner or later solutions will start overfitting it.

Having said that, (a) I think that we should eventually release them (e.g. after a year or so) and/or (b) share them with individual when they have a good reason (e.g. an alternate use case) and they verbally agree not to share the testset further and not to use the testset for the CodeSearchNet challenge.

Let me know what you think.

vitojph · 2019-10-08T12:32:43Z

Hi @mallamanis! I understand your reasons to keep the annotations away from curious eyes, especially when the competition just got started. But, anyway I encourage you folks to release them in the near future in order to foster evaluation of NLP techniques applied to search engines.

AFAIK, it's quite difficult to find freely available datasets and annotations to fully evaluate information retrieval systems. TREC collection is one of them but your data collection would definitely add a lot of value for a different domain.

Thanks anyway for your effort :-)

vitojph changed the title ~~Are you planning to relase the relevance judgements?~~ Are you planning to release the relevance judgements? Oct 4, 2019

hamelsmu pinned this issue Oct 10, 2019

hamelsmu closed this Oct 15, 2019

hamelsmu unpinned this issue Nov 5, 2019

github / CodeSearchNet

Are you planning to release the relevance judgements? #55

Are you planning to release the relevance judgements? #55

vitojph commented Oct 4, 2019

hamelsmu commented Oct 7, 2019

mallamanis commented Oct 8, 2019

vitojph commented Oct 8, 2019

github / CodeSearchNet

Join GitHub today

GitHub is where the world builds software

Are you planning to release the relevance judgements? #55

Are you planning to release the relevance judgements? #55

Comments

vitojph commented Oct 4, 2019

hamelsmu commented Oct 7, 2019

mallamanis commented Oct 8, 2019

vitojph commented Oct 8, 2019

Essential cookies

Always active

Analytics cookies