Why is there a big gap between the evaluation result of ROUGE and the paper in the single document summary

              My ROUGE installation should be fine as I have no problem with the CNN/DailyMail dataset at all, but the ROUGE score on the Multi-News dataset is: Rouge1 =40.4, RougE2 =15.7, Rougel =35.5
 




------------------&nbsp;原始邮件&nbsp;------------------
发件人: "Danqing ***@***.***&gt;; 
发送时间: 2022年8月17日(星期三) 下午4:10
收件人: ***@***.***&gt;; 
抄送: ***@***.***&gt;; ***@***.***&gt;; 
主题: Re: [dqwang122/HeterSumGraph] Question about R1, R2, RL score (Issue #32)





  
Yes, I get a ROUGE score on the published output and a 6% difference on the multipurpose news dataset from the data listed by the author
  
What does "multipurpose news dataset" refer to? Is it the multi-news?
 What is the exact "a ROUGE score"?  Is it R1 40.4? If you cannot get the reported scores (R1 46.05) from the released outputs, you had better check the installation of ROUGE. You can follow the instruction here(https://github.com/dqwang122/HeterSumGraph#rouge-installation).
 Besides, you should also recheck the data format and preprocessing.
 
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***&gt;

_Originally posted by @suwu-suwu in https://github.com/dqwang122/HeterSumGraph/issues/32#issuecomment-1217666084_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is there a big gap between the evaluation result of ROUGE and the paper in the single document summary #38

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Why is there a big gap between the evaluation result of ROUGE and the paper in the single document summary #38

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions