Skip to content

new optional input (corpus weights)#2

Open
lzlniu wants to merge 14 commits intolarsjuhljensen:masterfrom
lzlniu:master
Open

new optional input (corpus weights)#2
lzlniu wants to merge 14 commits intolarsjuhljensen:masterfrom
lzlniu:master

Conversation

@lzlniu
Copy link
Copy Markdown

@lzlniu lzlniu commented Feb 9, 2022

Specify optional argument --corpus-weights=filename.tsv to input the weights
The new optional input file should be in TSV format and better to have 2 columns. But it's ok for it to have more or fewer columns, if fewer, then the corpus weight will not be specify and set to 1.0, if more, then >2 columns will not be read.
It's also not necessary for each PubMed ID that occur in the "documents.tsv" to appear in this corpus weights file (not appear PubMed ID's corpus weight will be set to 1.0).

Fix the standard copy constructor of Document Class

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant