One PairFact repo for ACL code review #7

lihebi · 2023-01-17T22:11:03Z

This is the up-to-date code & results for PairFact, including factcc,qags,frank datasets. It is all-in-one repo ready for ACL code review. Changes in this PR:

add factcc,qags,frank datasets (from PR (WIP) add factcc, frank, qags datasets #6)
use pooled-level instead of summary-level for factcc & qags, because there's only one system for each docID, thus cannot compute correlation.
run on both cnndm & xsum
move bertscore & mnli metrics into DocAsRef_0
add generated results to this repo
skip experiments if the corresponding result files exist.
add some description to README.

TODO: anonymize it.

The important file is factcc.py and eval.py (the entry point). Other files are mostly copied from DocAsRef_0 repo.

Current results:

	frank-cnndm	frank-xsum	qags-cnndm	qags-xsum	factcc
system-level	done	done	done	done	done
summary-level	done	done	-	-	-
pooled-level	?	?	done	done	done

Note: the pooled-level collect all results for all docIds, and compute correlations for the entire vectors. The reason is qags and factcc has only one system per docID, thus cannot do summary-level (stats.scipy.correlation throws error on vectors of size 1).

Additional missing results (currently running):

qags-xsum-pooled Summary-Level
frank-xsum Summary-Level

… exist

lihebi · 2023-01-21T02:32:21Z

Update: add classification results in the results-classification folder. Include the following thresholds: 0.3, 0.4, 0.5, 0.6, 0.7

lihebi added 8 commits January 13, 2023 10:18

add factcc, frank, qags

2a743d9

add pooled-level results; run on both cnndm and xsum; skip if results…

bcf635e

… exist

add bertscore & mnli metrics into evalbase

2f53246

add results

e80dfbd

check & create parent directory before writing results

4fa4571

add simple README for PairFact submission

833e427

fix: remove the "[:5]" for debugging

8a9ea80

add all missing exp results

83e29c8

lihebi mentioned this pull request Jan 19, 2023

(WIP) Factcc full results SigmaWe/DocAsRef#24

Open

5 tasks

lihebi added 3 commits January 20, 2023 16:47

add classification results for qags-xsum/cnndm system-level

070bf14

add more classification results

1dc1669

add the final classification result

75e7049

lihebi added 5 commits January 20, 2023 21:47

add balanced_accuracy result for qags & frank

e456bf7

add factcc BA results

92a42fe

add factcc val BA results

4256a32

add summeval classification BA results

3578607

add classification results for roberta and BART

820d280

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

One PairFact repo for ACL code review #7

One PairFact repo for ACL code review #7

Uh oh!

lihebi commented Jan 17, 2023 •

edited

Loading

Uh oh!

lihebi commented Jan 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

One PairFact repo for ACL code review #7

Are you sure you want to change the base?

One PairFact repo for ACL code review #7

Uh oh!

Conversation

lihebi commented Jan 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lihebi commented Jan 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lihebi commented Jan 17, 2023 •

edited

Loading