Bug Fix: KeyError and Repeat Counts in evaluation #178

realgump · 2023-10-13T07:23:48Z

进行多轮评估的时候，从第二轮开始，qid已经在prefer_dict里面了，直接进入elif会导致没有"round_1"而报错；
直接进行pass rate比对的时候，reference_model或output_model已经+1了，但没有标记为complete。这样从文件读取再次评估的时候，似乎会重复计算这部分。

When 'args.evaluate_times' is greater than 1, during the second evaluation, the key 'qid' is initialized, but 'round_i' is not. This leads to a KeyError.

The comparison result has been updated, but the completion status has not. As a result, when loading the JSON file, the 'qids' from this section will be counted repeatedly.

skzhang1 · 2024-07-09T19:18:08Z

Good PR!

realgump added 2 commits October 13, 2023 15:05

Bug Fix: Resolved KeyError Occurring During the Round Loop

82473c5

When 'args.evaluate_times' is greater than 1, during the second evaluation, the key 'qid' is initialized, but 'round_i' is not. This leads to a KeyError.

Bug Fix: Duplicate Preference Counts in Some Cases.

92f8a33

The comparison result has been updated, but the completion status has not. As a result, when loading the JSON file, the 'qids' from this section will be counted repeatedly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug Fix: KeyError and Repeat Counts in evaluation #178

Bug Fix: KeyError and Repeat Counts in evaluation #178

Uh oh!

realgump commented Oct 13, 2023

Uh oh!

skzhang1 commented Jul 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Bug Fix: KeyError and Repeat Counts in evaluation #178

Are you sure you want to change the base?

Bug Fix: KeyError and Repeat Counts in evaluation #178

Uh oh!

Conversation

realgump commented Oct 13, 2023

Uh oh!

skzhang1 commented Jul 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants