Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Shuffle Validation Set when calculating MRR in included scripts #87
Comments
|
Hi @hamelsmu @mallamanis , According to my understanding, the order of the data should be fixed when we calculate MRR. But I find that this is not the case in the code. In model_test.py at line 82, the data is shuffled. |
|
Hi @fengzhangyin, I wouldn't expect this to be the case. Yes, the data is shuffled here but the numpy seed is fixed to I still haven't found a reason that the loss seems to be monotonically increasing... It's worth investigating more, of course. |
from @mallamanis
One masters' student in Berkeley has asked me the following question for CodeSearchNet.
The validation loss logging shows that the MRR performance decreases as it is being computed (at the end of the epoch). This seems to be the case with many of the runs on W&B. Do you have any idea why this might happening? I don't see anything obviously wrong.
For example,