--------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.17292 (95%-conf.int. 0.16449 - 0.17925) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.40014 (95%-conf.int. 0.36428 - 0.43572) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.24103 (95%-conf.int. 0.22863 - 0.25228) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.02654 (95%-conf.int. 0.01250 - 0.03600) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.06665 (95%-conf.int. 0.03333 - 0.08333) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.03787 (95%-conf.int. 0.01818 - 0.05013) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03899 (95%-conf.int. 0.03395 - 0.04374) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.22227 (95%-conf.int. 0.19074 - 0.25370) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06601 (95%-conf.int. 0.05823 - 0.07190) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.07476 (95%-conf.int. 0.06710 - 0.08324) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.30026 (95%-conf.int. 0.27500 - 0.31250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.11953 (95%-conf.int. 0.10817 - 0.13133) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.00935 (95%-conf.int. 0.00748 - 0.01175) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.15579 (95%-conf.int. 0.13333 - 0.16667) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.01761 (95%-conf.int. 0.01417 - 0.02187) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.07476 (95%-conf.int. 0.06710 - 0.08324) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.30026 (95%-conf.int. 0.27500 - 0.31250) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.11953 (95%-conf.int. 0.10817 - 0.13133) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.00935 (95%-conf.int. 0.00748 - 0.01175) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.15579 (95%-conf.int. 0.13333 - 0.16667) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.01761 (95%-conf.int. 0.01417 - 0.02187) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44562 (95%-conf.int. 0.42552 - 0.46506) baseline ROUGE-1 Average_P: 0.06733 (95%-conf.int. 0.06262 - 0.07196) baseline ROUGE-1 Average_F: 0.11687 (95%-conf.int. 0.10953 - 0.12407) --------------------------------------------- baseline ROUGE-2 Average_R: 0.08069 (95%-conf.int. 0.05985 - 0.10123) baseline ROUGE-2 Average_P: 0.01134 (95%-conf.int. 0.00896 - 0.01368) baseline ROUGE-2 Average_F: 0.01987 (95%-conf.int. 0.01558 - 0.02407) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.15645 (95%-conf.int. 0.14521 - 0.16628) baseline ROUGE-SU* Average_P: 0.00419 (95%-conf.int. 0.00359 - 0.00475) baseline ROUGE-SU* Average_F: 0.00816 (95%-conf.int. 0.00701 - 0.00922) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.18416 (95%-conf.int. 0.17643 - 0.19098) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.70042 (95%-conf.int. 0.66250 - 0.73750) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.29159 (95%-conf.int. 0.27857 - 0.30334) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07037 (95%-conf.int. 0.05280 - 0.08488) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.33417 (95%-conf.int. 0.25000 - 0.40000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.11623 (95%-conf.int. 0.08716 - 0.14004) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04175 (95%-conf.int. 0.03923 - 0.04399) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.57835 (95%-conf.int. 0.52222 - 0.62778) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.07785 (95%-conf.int. 0.07281 - 0.08218) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.23679 (95%-conf.int. 0.22975 - 0.24273) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.51453 (95%-conf.int. 0.48572 - 0.53571) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.32424 (95%-conf.int. 0.31318 - 0.33339) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07037 (95%-conf.int. 0.05280 - 0.08488) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.16708 (95%-conf.int. 0.12500 - 0.20000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.09900 (95%-conf.int. 0.07421 - 0.11917) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06904 (95%-conf.int. 0.06503 - 0.07204) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.31882 (95%-conf.int. 0.28704 - 0.34259) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11343 (95%-conf.int. 0.10631 - 0.11879) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.23679 (95%-conf.int. 0.22975 - 0.24273) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.51453 (95%-conf.int. 0.48572 - 0.53571) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.32424 (95%-conf.int. 0.31318 - 0.33339) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07037 (95%-conf.int. 0.05280 - 0.08488) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.16708 (95%-conf.int. 0.12500 - 0.20000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.09900 (95%-conf.int. 0.07421 - 0.11917) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.06904 (95%-conf.int. 0.06503 - 0.07204) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.31882 (95%-conf.int. 0.28704 - 0.34259) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.11343 (95%-conf.int. 0.10631 - 0.11879) --------------------------------------------- baseline ROUGE-1 Average_R: 0.42080 (95%-conf.int. 0.40119 - 0.44074) baseline ROUGE-1 Average_P: 0.05204 (95%-conf.int. 0.04878 - 0.05529) baseline ROUGE-1 Average_F: 0.09262 (95%-conf.int. 0.08701 - 0.09820) --------------------------------------------- baseline ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) baseline ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) baseline ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.18771 (95%-conf.int. 0.17479 - 0.20043) baseline ROUGE-SU* Average_P: 0.00307 (95%-conf.int. 0.00276 - 0.00338) baseline ROUGE-SU* Average_F: 0.00604 (95%-conf.int. 0.00543 - 0.00664) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.21505 (95%-conf.int. 0.19210 - 0.23788) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.38025 (95%-conf.int. 0.34500 - 0.41000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27414 (95%-conf.int. 0.24897 - 0.29909) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.08491 (95%-conf.int. 0.06273 - 0.10699) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.15570 (95%-conf.int. 0.12222 - 0.18889) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10963 (95%-conf.int. 0.08278 - 0.13633) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04614 (95%-conf.int. 0.03688 - 0.05535) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.15943 (95%-conf.int. 0.13426 - 0.18148) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.07130 (95%-conf.int. 0.05827 - 0.08425) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.19229 (95%-conf.int. 0.17606 - 0.20845) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.26169 (95%-conf.int. 0.24231 - 0.27692) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.22118 (95%-conf.int. 0.20633 - 0.23589) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.08491 (95%-conf.int. 0.06273 - 0.10699) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.11677 (95%-conf.int. 0.09166 - 0.14166) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.09808 (95%-conf.int. 0.07436 - 0.12165) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.04167 (95%-conf.int. 0.03492 - 0.04837) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.08677 (95%-conf.int. 0.07445 - 0.09611) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.05604 (95%-conf.int. 0.04841 - 0.06358) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.21445 (95%-conf.int. 0.20289 - 0.22579) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.16529 (95%-conf.int. 0.15652 - 0.17608) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.18627 (95%-conf.int. 0.17914 - 0.19236) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.08491 (95%-conf.int. 0.06273 - 0.10699) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.06370 (95%-conf.int. 0.05000 - 0.07728) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.07260 (95%-conf.int. 0.05555 - 0.08954) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.04992 (95%-conf.int. 0.04484 - 0.05498) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.03421 (95%-conf.int. 0.03091 - 0.03800) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.04039 (95%-conf.int. 0.03712 - 0.04305) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50545 (95%-conf.int. 0.49425 - 0.51474) baseline ROUGE-1 Average_P: 0.08113 (95%-conf.int. 0.07432 - 0.08919) baseline ROUGE-1 Average_F: 0.13967 (95%-conf.int. 0.12936 - 0.15162) --------------------------------------------- baseline ROUGE-2 Average_R: 0.10633 (95%-conf.int. 0.09652 - 0.11851) baseline ROUGE-2 Average_P: 0.01639 (95%-conf.int. 0.01364 - 0.01909) baseline ROUGE-2 Average_F: 0.02837 (95%-conf.int. 0.02390 - 0.03294) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.22029 (95%-conf.int. 0.21577 - 0.22576) baseline ROUGE-SU* Average_P: 0.00676 (95%-conf.int. 0.00584 - 0.00767) baseline ROUGE-SU* Average_F: 0.01311 (95%-conf.int. 0.01137 - 0.01483) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.17145 (95%-conf.int. 0.15385 - 0.19287) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.28556 (95%-conf.int. 0.25000 - 0.32143) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.21355 (95%-conf.int. 0.20000 - 0.24090) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07118 (95%-conf.int. 0.05556 - 0.08685) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.12492 (95%-conf.int. 0.11111 - 0.15278) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09034 (95%-conf.int. 0.07408 - 0.11036) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03765 (95%-conf.int. 0.03051 - 0.04479) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.11103 (95%-conf.int. 0.09259 - 0.12963) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.05558 (95%-conf.int. 0.04787 - 0.06336) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.15133 (95%-conf.int. 0.12821 - 0.17444) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.24993 (95%-conf.int. 0.23810 - 0.27381) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.18786 (95%-conf.int. 0.16667 - 0.20909) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07118 (95%-conf.int. 0.05556 - 0.08685) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.12492 (95%-conf.int. 0.11111 - 0.15278) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.09034 (95%-conf.int. 0.07408 - 0.11036) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03219 (95%-conf.int. 0.02373 - 0.04062) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.09256 (95%-conf.int. 0.08642 - 0.10494) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.04717 (95%-conf.int. 0.03724 - 0.05708) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.15133 (95%-conf.int. 0.12821 - 0.17444) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.24993 (95%-conf.int. 0.23810 - 0.27381) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.18786 (95%-conf.int. 0.16667 - 0.20909) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07118 (95%-conf.int. 0.05556 - 0.08685) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.12492 (95%-conf.int. 0.11111 - 0.15278) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.09034 (95%-conf.int. 0.07408 - 0.11036) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.03219 (95%-conf.int. 0.02373 - 0.04062) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.09256 (95%-conf.int. 0.08642 - 0.10494) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.04717 (95%-conf.int. 0.03724 - 0.05708) --------------------------------------------- baseline ROUGE-1 Average_R: 0.47368 (95%-conf.int. 0.43590 - 0.52277) baseline ROUGE-1 Average_P: 0.06395 (95%-conf.int. 0.06202 - 0.06589) baseline ROUGE-1 Average_F: 0.11252 (95%-conf.int. 0.11056 - 0.11448) --------------------------------------------- baseline ROUGE-2 Average_R: 0.12046 (95%-conf.int. 0.08333 - 0.15757) baseline ROUGE-2 Average_P: 0.01470 (95%-conf.int. 0.01176 - 0.01765) baseline ROUGE-2 Average_F: 0.02616 (95%-conf.int. 0.02061 - 0.03173) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21017 (95%-conf.int. 0.17627 - 0.25969) baseline ROUGE-SU* Average_P: 0.00441 (95%-conf.int. 0.00419 - 0.00463) baseline ROUGE-SU* Average_F: 0.00862 (95%-conf.int. 0.00823 - 0.00902) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.30701 (95%-conf.int. 0.28571 - 0.35000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.65681 (95%-conf.int. 0.56250 - 0.75000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.41296 (95%-conf.int. 0.39112 - 0.43396) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07365 (95%-conf.int. 0.06386 - 0.08333) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.17893 (95%-conf.int. 0.11905 - 0.23810) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10311 (95%-conf.int. 0.08251 - 0.12345) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08543 (95%-conf.int. 0.06446 - 0.11869) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.50071 (95%-conf.int. 0.38095 - 0.61905) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.14102 (95%-conf.int. 0.11274 - 0.17932) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.30701 (95%-conf.int. 0.28571 - 0.35000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.65681 (95%-conf.int. 0.56250 - 0.75000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.41296 (95%-conf.int. 0.39112 - 0.43396) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07365 (95%-conf.int. 0.06386 - 0.08333) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.17893 (95%-conf.int. 0.11905 - 0.23810) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.10311 (95%-conf.int. 0.08251 - 0.12345) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06705 (95%-conf.int. 0.04871 - 0.09390) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.39347 (95%-conf.int. 0.29048 - 0.49524) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11067 (95%-conf.int. 0.08524 - 0.14191) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.35551 (95%-conf.int. 0.30953 - 0.42222) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.75039 (95%-conf.int. 0.68750 - 0.81250) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.47601 (95%-conf.int. 0.44402 - 0.52172) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.10717 (95%-conf.int. 0.09348 - 0.12083) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.25037 (95%-conf.int. 0.19048 - 0.30952) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.14792 (95%-conf.int. 0.12729 - 0.16667) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.08728 (95%-conf.int. 0.06277 - 0.13284) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.48617 (95%-conf.int. 0.40952 - 0.56190) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.14260 (95%-conf.int. 0.10971 - 0.19943) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50615 (95%-conf.int. 0.47506 - 0.55159) baseline ROUGE-1 Average_P: 0.10683 (95%-conf.int. 0.08740 - 0.12602) baseline ROUGE-1 Average_F: 0.17505 (95%-conf.int. 0.14914 - 0.20065) --------------------------------------------- baseline ROUGE-2 Average_R: 0.13237 (95%-conf.int. 0.10781 - 0.15108) baseline ROUGE-2 Average_P: 0.02782 (95%-conf.int. 0.01749 - 0.03498) baseline ROUGE-2 Average_F: 0.04568 (95%-conf.int. 0.02957 - 0.05667) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21986 (95%-conf.int. 0.17485 - 0.28706) baseline ROUGE-SU* Average_P: 0.01362 (95%-conf.int. 0.01000 - 0.01719) baseline ROUGE-SU* Average_F: 0.02539 (95%-conf.int. 0.01905 - 0.03164) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.22559 (95%-conf.int. 0.20181 - 0.24949) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.53141 (95%-conf.int. 0.50000 - 0.59375) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.31565 (95%-conf.int. 0.28752 - 0.34385) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.05614 (95%-conf.int. 0.04394 - 0.06822) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.14299 (95%-conf.int. 0.10714 - 0.17857) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.08031 (95%-conf.int. 0.06291 - 0.09626) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.05596 (95%-conf.int. 0.04519 - 0.06928) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.32158 (95%-conf.int. 0.28333 - 0.38095) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.09423 (95%-conf.int. 0.07852 - 0.11045) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.22559 (95%-conf.int. 0.20181 - 0.24949) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.53141 (95%-conf.int. 0.50000 - 0.59375) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.31565 (95%-conf.int. 0.28752 - 0.34385) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.05614 (95%-conf.int. 0.04394 - 0.06822) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.14299 (95%-conf.int. 0.10714 - 0.17857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.08031 (95%-conf.int. 0.06291 - 0.09626) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05596 (95%-conf.int. 0.04519 - 0.06928) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.32158 (95%-conf.int. 0.28333 - 0.38095) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.09423 (95%-conf.int. 0.07852 - 0.11045) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.27915 (95%-conf.int. 0.24356 - 0.31353) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.47729 (95%-conf.int. 0.43939 - 0.52272) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.35096 (95%-conf.int. 0.31715 - 0.37738) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07063 (95%-conf.int. 0.05314 - 0.08818) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.12503 (95%-conf.int. 0.10000 - 0.15000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08986 (95%-conf.int. 0.06939 - 0.11035) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.07890 (95%-conf.int. 0.06069 - 0.10132) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.24234 (95%-conf.int. 0.21025 - 0.27692) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.11708 (95%-conf.int. 0.09588 - 0.13798) --------------------------------------------- baseline ROUGE-1 Average_R: 0.55732 (95%-conf.int. 0.49880 - 0.61171) baseline ROUGE-1 Average_P: 0.09723 (95%-conf.int. 0.08951 - 0.10494) baseline ROUGE-1 Average_F: 0.16526 (95%-conf.int. 0.15371 - 0.17682) --------------------------------------------- baseline ROUGE-2 Average_R: 0.11028 (95%-conf.int. 0.08962 - 0.13234) baseline ROUGE-2 Average_P: 0.01869 (95%-conf.int. 0.01402 - 0.02337) baseline ROUGE-2 Average_F: 0.03191 (95%-conf.int. 0.02418 - 0.03965) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.26890 (95%-conf.int. 0.22528 - 0.31903) baseline ROUGE-SU* Average_P: 0.00931 (95%-conf.int. 0.00795 - 0.01062) baseline ROUGE-SU* Average_F: 0.01795 (95%-conf.int. 0.01548 - 0.02041) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.27419 (95%-conf.int. 0.25397 - 0.29313) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.59988 (95%-conf.int. 0.58333 - 0.61667) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.37587 (95%-conf.int. 0.35476 - 0.39385) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.14899 (95%-conf.int. 0.13077 - 0.17262) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.35981 (95%-conf.int. 0.33000 - 0.39000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.21039 (95%-conf.int. 0.18670 - 0.23844) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08355 (95%-conf.int. 0.07230 - 0.09786) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.40987 (95%-conf.int. 0.39000 - 0.43000) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.13825 (95%-conf.int. 0.12226 - 0.15705) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.37977 (95%-conf.int. 0.35619 - 0.40261) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.62477 (95%-conf.int. 0.58750 - 0.66250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.47172 (95%-conf.int. 0.44489 - 0.49241) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.18199 (95%-conf.int. 0.15715 - 0.21541) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.31409 (95%-conf.int. 0.27857 - 0.35000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.23003 (95%-conf.int. 0.20084 - 0.26417) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.15596 (95%-conf.int. 0.13334 - 0.18256) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.43977 (95%-conf.int. 0.39000 - 0.48428) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.22897 (95%-conf.int. 0.19954 - 0.25635) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.44041 (95%-conf.int. 0.41485 - 0.46407) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.52715 (95%-conf.int. 0.49545 - 0.55909) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.47917 (95%-conf.int. 0.45535 - 0.49664) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.19854 (95%-conf.int. 0.16589 - 0.23732) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.23986 (95%-conf.int. 0.20500 - 0.27000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.21682 (95%-conf.int. 0.18365 - 0.25024) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.20232 (95%-conf.int. 0.17399 - 0.23466) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.30758 (95%-conf.int. 0.27539 - 0.33769) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.24243 (95%-conf.int. 0.21542 - 0.26415) --------------------------------------------- baseline ROUGE-1 Average_R: 0.60567 (95%-conf.int. 0.58584 - 0.62578) baseline ROUGE-1 Average_P: 0.06721 (95%-conf.int. 0.06176 - 0.07185) baseline ROUGE-1 Average_F: 0.12093 (95%-conf.int. 0.11170 - 0.12867) --------------------------------------------- baseline ROUGE-2 Average_R: 0.22772 (95%-conf.int. 0.19543 - 0.25519) baseline ROUGE-2 Average_P: 0.02372 (95%-conf.int. 0.01907 - 0.02712) baseline ROUGE-2 Average_F: 0.04294 (95%-conf.int. 0.03479 - 0.04901) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.38062 (95%-conf.int. 0.35955 - 0.40190) baseline ROUGE-SU* Average_P: 0.00538 (95%-conf.int. 0.00449 - 0.00607) baseline ROUGE-SU* Average_F: 0.01059 (95%-conf.int. 0.00887 - 0.01196) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.26449 (95%-conf.int. 0.23346 - 0.31177) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.66672 (95%-conf.int. 0.63889 - 0.70000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.37699 (95%-conf.int. 0.34191 - 0.42658) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.04584 (95%-conf.int. 0.03849 - 0.05342) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.12494 (95%-conf.int. 0.10625 - 0.14375) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.06674 (95%-conf.int. 0.05600 - 0.07699) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04696 (95%-conf.int. 0.03525 - 0.06865) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.32719 (95%-conf.int. 0.30227 - 0.34318) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.08054 (95%-conf.int. 0.06325 - 0.11171) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.15874 (95%-conf.int. 0.14170 - 0.18668) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.90005 (95%-conf.int. 0.87500 - 0.92500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.26907 (95%-conf.int. 0.24438 - 0.30896) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.03660 (95%-conf.int. 0.03174 - 0.04177) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.26668 (95%-conf.int. 0.25000 - 0.30000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.06419 (95%-conf.int. 0.05633 - 0.07201) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.02096 (95%-conf.int. 0.01570 - 0.03083) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.71110 (95%-conf.int. 0.68333 - 0.73889) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.04050 (95%-conf.int. 0.03070 - 0.05887) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.15874 (95%-conf.int. 0.14170 - 0.18668) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.90005 (95%-conf.int. 0.87500 - 0.92500) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.26907 (95%-conf.int. 0.24438 - 0.30896) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.03660 (95%-conf.int. 0.03174 - 0.04177) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.26668 (95%-conf.int. 0.25000 - 0.30000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.06419 (95%-conf.int. 0.05633 - 0.07201) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.02096 (95%-conf.int. 0.01570 - 0.03083) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.71110 (95%-conf.int. 0.68333 - 0.73889) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.04050 (95%-conf.int. 0.03070 - 0.05887) --------------------------------------------- baseline ROUGE-1 Average_R: 0.51243 (95%-conf.int. 0.48508 - 0.54381) baseline ROUGE-1 Average_P: 0.21452 (95%-conf.int. 0.19636 - 0.23091) baseline ROUGE-1 Average_F: 0.30127 (95%-conf.int. 0.28552 - 0.31813) --------------------------------------------- baseline ROUGE-2 Average_R: 0.08936 (95%-conf.int. 0.08030 - 0.09874) baseline ROUGE-2 Average_P: 0.03704 (95%-conf.int. 0.02871 - 0.04260) baseline ROUGE-2 Average_F: 0.05220 (95%-conf.int. 0.04179 - 0.06000) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21969 (95%-conf.int. 0.19226 - 0.26471) baseline ROUGE-SU* Average_P: 0.04716 (95%-conf.int. 0.03983 - 0.05286) baseline ROUGE-SU* Average_F: 0.07648 (95%-conf.int. 0.06753 - 0.08398) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.27331 (95%-conf.int. 0.22386 - 0.35326) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.47489 (95%-conf.int. 0.45625 - 0.49375) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34276 (95%-conf.int. 0.30034 - 0.40662) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.14139 (95%-conf.int. 0.10774 - 0.19338) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.25701 (95%-conf.int. 0.23572 - 0.27857) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.17975 (95%-conf.int. 0.14787 - 0.22497) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.07453 (95%-conf.int. 0.04307 - 0.13368) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.26846 (95%-conf.int. 0.24857 - 0.28715) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10869 (95%-conf.int. 0.07348 - 0.17216) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.27331 (95%-conf.int. 0.22386 - 0.35326) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.47489 (95%-conf.int. 0.45625 - 0.49375) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.34276 (95%-conf.int. 0.30034 - 0.40662) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.14139 (95%-conf.int. 0.10774 - 0.19338) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.25701 (95%-conf.int. 0.23572 - 0.27857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.17975 (95%-conf.int. 0.14787 - 0.22497) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.07453 (95%-conf.int. 0.04307 - 0.13368) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.26846 (95%-conf.int. 0.24857 - 0.28715) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.10869 (95%-conf.int. 0.07348 - 0.17216) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.24167 (95%-conf.int. 0.20852 - 0.29597) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.42509 (95%-conf.int. 0.41250 - 0.43750) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.30463 (95%-conf.int. 0.27976 - 0.34222) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.12542 (95%-conf.int. 0.09742 - 0.16881) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.22850 (95%-conf.int. 0.21429 - 0.24286) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.15960 (95%-conf.int. 0.13394 - 0.19630) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.05583 (95%-conf.int. 0.03618 - 0.09352) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.21148 (95%-conf.int. 0.20000 - 0.22143) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.08262 (95%-conf.int. 0.06172 - 0.12135) --------------------------------------------- baseline ROUGE-1 Average_R: 0.34573 (95%-conf.int. 0.32152 - 0.37216) baseline ROUGE-1 Average_P: 0.09626 (95%-conf.int. 0.08270 - 0.10866) baseline ROUGE-1 Average_F: 0.14961 (95%-conf.int. 0.13380 - 0.16519) --------------------------------------------- baseline ROUGE-2 Average_R: 0.07728 (95%-conf.int. 0.06495 - 0.09831) baseline ROUGE-2 Average_P: 0.01961 (95%-conf.int. 0.01961 - 0.01961) baseline ROUGE-2 Average_F: 0.03101 (95%-conf.int. 0.03013 - 0.03239) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.10994 (95%-conf.int. 0.08586 - 0.14859) baseline ROUGE-SU* Average_P: 0.01207 (95%-conf.int. 0.00937 - 0.01430) baseline ROUGE-SU* Average_F: 0.02135 (95%-conf.int. 0.01716 - 0.02492) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.21640 (95%-conf.int. 0.20400 - 0.23587) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.38021 (95%-conf.int. 0.34500 - 0.41000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27379 (95%-conf.int. 0.26701 - 0.28046) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07222 (95%-conf.int. 0.06581 - 0.07955) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.13349 (95%-conf.int. 0.11667 - 0.15556) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09295 (95%-conf.int. 0.08628 - 0.10101) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04738 (95%-conf.int. 0.04076 - 0.05976) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.18905 (95%-conf.int. 0.15833 - 0.21389) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.07367 (95%-conf.int. 0.06711 - 0.08422) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.21640 (95%-conf.int. 0.20400 - 0.23587) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.38021 (95%-conf.int. 0.34500 - 0.41000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27379 (95%-conf.int. 0.26701 - 0.28046) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07222 (95%-conf.int. 0.06581 - 0.07955) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.13349 (95%-conf.int. 0.11667 - 0.15556) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.09295 (95%-conf.int. 0.08628 - 0.10101) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.04738 (95%-conf.int. 0.04076 - 0.05976) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.18905 (95%-conf.int. 0.15833 - 0.21389) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.07367 (95%-conf.int. 0.06711 - 0.08422) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.21640 (95%-conf.int. 0.20400 - 0.23587) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.38021 (95%-conf.int. 0.34500 - 0.41000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.27379 (95%-conf.int. 0.26701 - 0.28046) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07222 (95%-conf.int. 0.06581 - 0.07955) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.13349 (95%-conf.int. 0.11667 - 0.15556) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.09295 (95%-conf.int. 0.08628 - 0.10101) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.04738 (95%-conf.int. 0.04076 - 0.05976) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.18905 (95%-conf.int. 0.15833 - 0.21389) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.07367 (95%-conf.int. 0.06711 - 0.08422) --------------------------------------------- baseline ROUGE-1 Average_R: 0.59808 (95%-conf.int. 0.58434 - 0.61414) baseline ROUGE-1 Average_P: 0.07741 (95%-conf.int. 0.06642 - 0.08540) baseline ROUGE-1 Average_F: 0.13670 (95%-conf.int. 0.11951 - 0.14909) --------------------------------------------- baseline ROUGE-2 Average_R: 0.07276 (95%-conf.int. 0.06598 - 0.08053) baseline ROUGE-2 Average_P: 0.00882 (95%-conf.int. 0.00809 - 0.00919) baseline ROUGE-2 Average_F: 0.01568 (95%-conf.int. 0.01458 - 0.01635) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.29356 (95%-conf.int. 0.28362 - 0.31209) baseline ROUGE-SU* Average_P: 0.00707 (95%-conf.int. 0.00518 - 0.00834) baseline ROUGE-SU* Average_F: 0.01378 (95%-conf.int. 0.01016 - 0.01621) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.21852 (95%-conf.int. 0.19262 - 0.25241) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.35574 (95%-conf.int. 0.33333 - 0.38889) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27029 (95%-conf.int. 0.24561 - 0.30596) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.11755 (95%-conf.int. 0.10105 - 0.14017) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.20013 (95%-conf.int. 0.18750 - 0.22500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.14782 (95%-conf.int. 0.13131 - 0.17263) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.05979 (95%-conf.int. 0.04821 - 0.07323) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.16830 (95%-conf.int. 0.15341 - 0.18864) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.08777 (95%-conf.int. 0.07386 - 0.10530) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.27342 (95%-conf.int. 0.23996 - 0.30869) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.50021 (95%-conf.int. 0.47500 - 0.53750) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.35300 (95%-conf.int. 0.31882 - 0.39328) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.16222 (95%-conf.int. 0.13473 - 0.19498) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.31451 (95%-conf.int. 0.28571 - 0.35714) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.21364 (95%-conf.int. 0.18309 - 0.25210) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.07475 (95%-conf.int. 0.05861 - 0.09128) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.26305 (95%-conf.int. 0.24000 - 0.29714) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11587 (95%-conf.int. 0.09431 - 0.14024) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.39452 (95%-conf.int. 0.36619 - 0.42246) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.30529 (95%-conf.int. 0.29737 - 0.31316) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.34362 (95%-conf.int. 0.33161 - 0.35867) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.13186 (95%-conf.int. 0.11790 - 0.14784) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.10003 (95%-conf.int. 0.09722 - 0.10555) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.11353 (95%-conf.int. 0.10655 - 0.12309) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.15696 (95%-conf.int. 0.13590 - 0.17776) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.10372 (95%-conf.int. 0.09868 - 0.10873) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.12412 (95%-conf.int. 0.11672 - 0.13309) --------------------------------------------- baseline ROUGE-1 Average_R: 0.40807 (95%-conf.int. 0.37869 - 0.44694) baseline ROUGE-1 Average_P: 0.10716 (95%-conf.int. 0.10178 - 0.11250) baseline ROUGE-1 Average_F: 0.16955 (95%-conf.int. 0.16266 - 0.17849) --------------------------------------------- baseline ROUGE-2 Average_R: 0.13186 (95%-conf.int. 0.11790 - 0.14784) baseline ROUGE-2 Average_P: 0.03274 (95%-conf.int. 0.03182 - 0.03454) baseline ROUGE-2 Average_F: 0.05238 (95%-conf.int. 0.05011 - 0.05598) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.16094 (95%-conf.int. 0.14301 - 0.18495) baseline ROUGE-SU* Average_P: 0.01267 (95%-conf.int. 0.01141 - 0.01376) baseline ROUGE-SU* Average_F: 0.02344 (95%-conf.int. 0.02123 - 0.02552) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.21330 (95%-conf.int. 0.20402 - 0.22562) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.46013 (95%-conf.int. 0.44000 - 0.48500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.29133 (95%-conf.int. 0.28040 - 0.30760) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.06804 (95%-conf.int. 0.06336 - 0.07189) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.15567 (95%-conf.int. 0.14445 - 0.16667) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09464 (95%-conf.int. 0.08811 - 0.10020) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04089 (95%-conf.int. 0.03731 - 0.04424) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.18900 (95%-conf.int. 0.17686 - 0.20185) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06712 (95%-conf.int. 0.06210 - 0.07209) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.18562 (95%-conf.int. 0.16809 - 0.20312) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.50030 (95%-conf.int. 0.45625 - 0.54375) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27065 (95%-conf.int. 0.24579 - 0.29574) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.04884 (95%-conf.int. 0.03890 - 0.05861) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.14311 (95%-conf.int. 0.11428 - 0.17143) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.07279 (95%-conf.int. 0.05801 - 0.08732) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.04029 (95%-conf.int. 0.03356 - 0.04647) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.28599 (95%-conf.int. 0.25000 - 0.32143) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.07055 (95%-conf.int. 0.05937 - 0.08035) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.31553 (95%-conf.int. 0.29442 - 0.33342) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.30901 (95%-conf.int. 0.29545 - 0.32273) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.31205 (95%-conf.int. 0.29570 - 0.32522) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09729 (95%-conf.int. 0.08919 - 0.10477) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.09518 (95%-conf.int. 0.08809 - 0.10238) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.09616 (95%-conf.int. 0.08922 - 0.10286) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.10858 (95%-conf.int. 0.09422 - 0.12110) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.10710 (95%-conf.int. 0.09802 - 0.11528) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.10753 (95%-conf.int. 0.09634 - 0.11591) --------------------------------------------- baseline ROUGE-1 Average_R: 0.28744 (95%-conf.int. 0.27191 - 0.30088) baseline ROUGE-1 Average_P: 0.07474 (95%-conf.int. 0.07048 - 0.07892) baseline ROUGE-1 Average_F: 0.11859 (95%-conf.int. 0.11223 - 0.12482) --------------------------------------------- baseline ROUGE-2 Average_R: 0.05835 (95%-conf.int. 0.04831 - 0.06867) baseline ROUGE-2 Average_P: 0.01466 (95%-conf.int. 0.01220 - 0.01707) baseline ROUGE-2 Average_F: 0.02342 (95%-conf.int. 0.01948 - 0.02735) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.10681 (95%-conf.int. 0.09843 - 0.11642) baseline ROUGE-SU* Average_P: 0.00770 (95%-conf.int. 0.00683 - 0.00853) baseline ROUGE-SU* Average_F: 0.01435 (95%-conf.int. 0.01282 - 0.01587) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.18392 (95%-conf.int. 0.16528 - 0.19810) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.52516 (95%-conf.int. 0.48750 - 0.57500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27173 (95%-conf.int. 0.24770 - 0.28822) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.08178 (95%-conf.int. 0.07073 - 0.09347) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.25732 (95%-conf.int. 0.21429 - 0.30000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.12380 (95%-conf.int. 0.10764 - 0.14251) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03669 (95%-conf.int. 0.03186 - 0.04111) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.33730 (95%-conf.int. 0.30428 - 0.38000) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06594 (95%-conf.int. 0.05768 - 0.07286) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.18392 (95%-conf.int. 0.16528 - 0.19810) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.52516 (95%-conf.int. 0.48750 - 0.57500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27173 (95%-conf.int. 0.24770 - 0.28822) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.09124 (95%-conf.int. 0.07534 - 0.10385) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.28586 (95%-conf.int. 0.23572 - 0.33571) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.13797 (95%-conf.int. 0.11554 - 0.15963) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03932 (95%-conf.int. 0.03321 - 0.04476) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.36014 (95%-conf.int. 0.32143 - 0.40286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.07065 (95%-conf.int. 0.06010 - 0.07931) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.34894 (95%-conf.int. 0.33940 - 0.35857) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.08893 (95%-conf.int. 0.08223 - 0.09555) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.14143 (95%-conf.int. 0.13365 - 0.14913) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09943 (95%-conf.int. 0.09224 - 0.10687) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.02474 (95%-conf.int. 0.02078 - 0.02865) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.03953 (95%-conf.int. 0.03387 - 0.04514) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.12742 (95%-conf.int. 0.12225 - 0.13264) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.01017 (95%-conf.int. 0.00881 - 0.01152) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.01879 (95%-conf.int. 0.01650 - 0.02105) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50415 (95%-conf.int. 0.49810 - 0.51093) baseline ROUGE-1 Average_P: 0.12754 (95%-conf.int. 0.11429 - 0.14066) baseline ROUGE-1 Average_F: 0.20317 (95%-conf.int. 0.18607 - 0.22009) --------------------------------------------- baseline ROUGE-2 Average_R: 0.21659 (95%-conf.int. 0.20326 - 0.22978) baseline ROUGE-2 Average_P: 0.05338 (95%-conf.int. 0.04500 - 0.06167) baseline ROUGE-2 Average_F: 0.08547 (95%-conf.int. 0.07371 - 0.09709) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21313 (95%-conf.int. 0.21055 - 0.21573) baseline ROUGE-SU* Average_P: 0.01674 (95%-conf.int. 0.01408 - 0.01936) baseline ROUGE-SU* Average_F: 0.03096 (95%-conf.int. 0.02641 - 0.03547) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.16169 (95%-conf.int. 0.15107 - 0.17273) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.44971 (95%-conf.int. 0.41250 - 0.48750) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.23716 (95%-conf.int. 0.22362 - 0.24881) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.02885 (95%-conf.int. 0.02182 - 0.03580) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.08577 (95%-conf.int. 0.07143 - 0.10000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.04306 (95%-conf.int. 0.03334 - 0.05266) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.02886 (95%-conf.int. 0.02471 - 0.03390) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.25689 (95%-conf.int. 0.22286 - 0.29286) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.05168 (95%-conf.int. 0.04454 - 0.06016) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.12637 (95%-conf.int. 0.11504 - 0.13756) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.46651 (95%-conf.int. 0.44167 - 0.49167) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.19837 (95%-conf.int. 0.18447 - 0.21211) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.03840 (95%-conf.int. 0.03160 - 0.04683) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.16007 (95%-conf.int. 0.15000 - 0.18000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.06179 (95%-conf.int. 0.05218 - 0.07425) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.01817 (95%-conf.int. 0.01509 - 0.02190) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.27984 (95%-conf.int. 0.24750 - 0.31000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.03403 (95%-conf.int. 0.02862 - 0.04052) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.20740 (95%-conf.int. 0.18878 - 0.22582) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.28738 (95%-conf.int. 0.26875 - 0.30625) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.24004 (95%-conf.int. 0.22328 - 0.25232) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.03840 (95%-conf.int. 0.03160 - 0.04683) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.05336 (95%-conf.int. 0.05000 - 0.06000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.04449 (95%-conf.int. 0.03869 - 0.05253) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.04405 (95%-conf.int. 0.03635 - 0.05250) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.10068 (95%-conf.int. 0.08778 - 0.11222) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.06072 (95%-conf.int. 0.05146 - 0.06914) --------------------------------------------- baseline ROUGE-1 Average_R: 0.45585 (95%-conf.int. 0.43280 - 0.47163) baseline ROUGE-1 Average_P: 0.13065 (95%-conf.int. 0.11731 - 0.14423) baseline ROUGE-1 Average_F: 0.20256 (95%-conf.int. 0.18561 - 0.21954) --------------------------------------------- baseline ROUGE-2 Average_R: 0.10441 (95%-conf.int. 0.08066 - 0.12686) baseline ROUGE-2 Average_P: 0.02858 (95%-conf.int. 0.02208 - 0.03312) baseline ROUGE-2 Average_F: 0.04474 (95%-conf.int. 0.03461 - 0.05181) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21393 (95%-conf.int. 0.19386 - 0.22943) baseline ROUGE-SU* Average_P: 0.02192 (95%-conf.int. 0.01870 - 0.02519) baseline ROUGE-SU* Average_F: 0.03963 (95%-conf.int. 0.03443 - 0.04492) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.24516 (95%-conf.int. 0.23369 - 0.26476) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.40656 (95%-conf.int. 0.33333 - 0.45833) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.30411 (95%-conf.int. 0.27191 - 0.33357) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.08078 (95%-conf.int. 0.07549 - 0.08741) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.14301 (95%-conf.int. 0.10714 - 0.17857) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10253 (95%-conf.int. 0.08836 - 0.11731) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.07297 (95%-conf.int. 0.06896 - 0.07994) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.23601 (95%-conf.int. 0.15715 - 0.28571) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10934 (95%-conf.int. 0.09232 - 0.12432) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.28757 (95%-conf.int. 0.25861 - 0.31666) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.62525 (95%-conf.int. 0.58334 - 0.66667) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.39158 (95%-conf.int. 0.36794 - 0.41523) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.12138 (95%-conf.int. 0.10683 - 0.13575) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.30042 (95%-conf.int. 0.23333 - 0.36667) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.17183 (95%-conf.int. 0.14550 - 0.19785) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06554 (95%-conf.int. 0.05634 - 0.07882) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.35031 (95%-conf.int. 0.28333 - 0.40000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.10838 (95%-conf.int. 0.09818 - 0.12125) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.39782 (95%-conf.int. 0.37247 - 0.41875) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.23875 (95%-conf.int. 0.20076 - 0.25758) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.29670 (95%-conf.int. 0.26621 - 0.31779) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.08078 (95%-conf.int. 0.07549 - 0.08741) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.04767 (95%-conf.int. 0.03572 - 0.05952) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.05956 (95%-conf.int. 0.04834 - 0.07078) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.15666 (95%-conf.int. 0.14814 - 0.16396) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.06952 (95%-conf.int. 0.04762 - 0.08135) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.09442 (95%-conf.int. 0.07140 - 0.10679) --------------------------------------------- baseline ROUGE-1 Average_R: 0.57974 (95%-conf.int. 0.52976 - 0.62203) baseline ROUGE-1 Average_P: 0.07530 (95%-conf.int. 0.05744 - 0.08576) baseline ROUGE-1 Average_F: 0.13299 (95%-conf.int. 0.10340 - 0.14991) --------------------------------------------- baseline ROUGE-2 Average_R: 0.17868 (95%-conf.int. 0.13462 - 0.21068) baseline ROUGE-2 Average_P: 0.02208 (95%-conf.int. 0.01389 - 0.02614) baseline ROUGE-2 Average_F: 0.03922 (95%-conf.int. 0.02510 - 0.04651) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.36870 (95%-conf.int. 0.29974 - 0.41491) baseline ROUGE-SU* Average_P: 0.00809 (95%-conf.int. 0.00460 - 0.01002) baseline ROUGE-SU* Average_F: 0.01581 (95%-conf.int. 0.00905 - 0.01956) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.26767 (95%-conf.int. 0.24607 - 0.29190) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.51096 (95%-conf.int. 0.47222 - 0.54445) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34956 (95%-conf.int. 0.33330 - 0.36736) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.11254 (95%-conf.int. 0.09701 - 0.13240) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.22502 (95%-conf.int. 0.21875 - 0.23750) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.14914 (95%-conf.int. 0.13439 - 0.16571) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.07097 (95%-conf.int. 0.05810 - 0.08884) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.30430 (95%-conf.int. 0.26136 - 0.33864) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11249 (95%-conf.int. 0.09785 - 0.12946) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.22183 (95%-conf.int. 0.20087 - 0.24682) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.47486 (95%-conf.int. 0.45000 - 0.50000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.30088 (95%-conf.int. 0.28155 - 0.32000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.11236 (95%-conf.int. 0.09484 - 0.13103) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.25701 (95%-conf.int. 0.23572 - 0.27857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.15546 (95%-conf.int. 0.13609 - 0.17450) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05301 (95%-conf.int. 0.04160 - 0.06964) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.27988 (95%-conf.int. 0.25143 - 0.30857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.08722 (95%-conf.int. 0.07237 - 0.10676) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.27897 (95%-conf.int. 0.25809 - 0.30526) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.25250 (95%-conf.int. 0.23421 - 0.27106) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.26363 (95%-conf.int. 0.25193 - 0.27274) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.12343 (95%-conf.int. 0.11019 - 0.13682) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.11104 (95%-conf.int. 0.10000 - 0.12222) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.11618 (95%-conf.int. 0.10756 - 0.12267) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.07786 (95%-conf.int. 0.06237 - 0.09994) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.07719 (95%-conf.int. 0.06640 - 0.08651) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.07498 (95%-conf.int. 0.06784 - 0.07951) --------------------------------------------- baseline ROUGE-1 Average_R: 0.40569 (95%-conf.int. 0.37561 - 0.43558) baseline ROUGE-1 Average_P: 0.08135 (95%-conf.int. 0.07326 - 0.08779) baseline ROUGE-1 Average_F: 0.13516 (95%-conf.int. 0.12417 - 0.14505) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06106 (95%-conf.int. 0.05743 - 0.06453) baseline ROUGE-2 Average_P: 0.01176 (95%-conf.int. 0.01000 - 0.01353) baseline ROUGE-2 Average_F: 0.01966 (95%-conf.int. 0.01711 - 0.02229) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.15264 (95%-conf.int. 0.13011 - 0.17980) baseline ROUGE-SU* Average_P: 0.00785 (95%-conf.int. 0.00639 - 0.00896) baseline ROUGE-SU* Average_F: 0.01487 (95%-conf.int. 0.01224 - 0.01691) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.25237 (95%-conf.int. 0.21786 - 0.29441) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.53595 (95%-conf.int. 0.46429 - 0.59524) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34243 (95%-conf.int. 0.29688 - 0.39560) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07275 (95%-conf.int. 0.04250 - 0.10263) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.16700 (95%-conf.int. 0.09722 - 0.22222) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10111 (95%-conf.int. 0.06034 - 0.14039) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.06491 (95%-conf.int. 0.04484 - 0.08445) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.30588 (95%-conf.int. 0.20679 - 0.37037) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10674 (95%-conf.int. 0.07350 - 0.13726) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.26904 (95%-conf.int. 0.22674 - 0.31742) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.57174 (95%-conf.int. 0.47619 - 0.64286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.36511 (95%-conf.int. 0.30469 - 0.42655) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.10809 (95%-conf.int. 0.07039 - 0.14079) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.25032 (95%-conf.int. 0.15278 - 0.31944) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.15062 (95%-conf.int. 0.09637 - 0.19273) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08039 (95%-conf.int. 0.05367 - 0.10436) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.38003 (95%-conf.int. 0.25000 - 0.45679) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.13226 (95%-conf.int. 0.08797 - 0.16970) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.26904 (95%-conf.int. 0.22674 - 0.31742) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.57174 (95%-conf.int. 0.47619 - 0.64286) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.36511 (95%-conf.int. 0.30469 - 0.42655) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.10809 (95%-conf.int. 0.07039 - 0.14079) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.25032 (95%-conf.int. 0.15278 - 0.31944) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.15062 (95%-conf.int. 0.09637 - 0.19273) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.08039 (95%-conf.int. 0.05367 - 0.10436) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.38003 (95%-conf.int. 0.25000 - 0.45679) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.13226 (95%-conf.int. 0.08797 - 0.16970) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50010 (95%-conf.int. 0.47547 - 0.52380) baseline ROUGE-1 Average_P: 0.09371 (95%-conf.int. 0.08334 - 0.10417) baseline ROUGE-1 Average_F: 0.15762 (95%-conf.int. 0.14206 - 0.17197) --------------------------------------------- baseline ROUGE-2 Average_R: 0.07020 (95%-conf.int. 0.03974 - 0.09375) baseline ROUGE-2 Average_P: 0.01263 (95%-conf.int. 0.00738 - 0.01688) baseline ROUGE-2 Average_F: 0.02138 (95%-conf.int. 0.01242 - 0.02838) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.22704 (95%-conf.int. 0.21499 - 0.24277) baseline ROUGE-SU* Average_P: 0.00910 (95%-conf.int. 0.00777 - 0.01055) baseline ROUGE-SU* Average_F: 0.01749 (95%-conf.int. 0.01501 - 0.02014) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.19789 (95%-conf.int. 0.16382 - 0.25517) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.62461 (95%-conf.int. 0.56250 - 0.68750) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.29446 (95%-conf.int. 0.26237 - 0.34524) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.05456 (95%-conf.int. 0.03602 - 0.08040) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.17865 (95%-conf.int. 0.15476 - 0.19048) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.08169 (95%-conf.int. 0.05834 - 0.11114) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.02880 (95%-conf.int. 0.01812 - 0.04873) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.37101 (95%-conf.int. 0.30477 - 0.43810) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.05102 (95%-conf.int. 0.03454 - 0.08117) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.17922 (95%-conf.int. 0.14512 - 0.23431) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.64256 (95%-conf.int. 0.59524 - 0.69048) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27491 (95%-conf.int. 0.23728 - 0.33222) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.05456 (95%-conf.int. 0.03602 - 0.08040) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.20842 (95%-conf.int. 0.18056 - 0.22222) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.08467 (95%-conf.int. 0.05997 - 0.11621) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.02933 (95%-conf.int. 0.01754 - 0.05155) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.47182 (95%-conf.int. 0.40740 - 0.53704) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.05313 (95%-conf.int. 0.03384 - 0.08912) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.21709 (95%-conf.int. 0.18278 - 0.27465) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.49971 (95%-conf.int. 0.45454 - 0.54545) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.29583 (95%-conf.int. 0.27319 - 0.33505) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.05456 (95%-conf.int. 0.03602 - 0.08040) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.12505 (95%-conf.int. 0.10833 - 0.13333) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.07397 (95%-conf.int. 0.05396 - 0.09832) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.04188 (95%-conf.int. 0.02542 - 0.07281) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.28433 (95%-conf.int. 0.23846 - 0.33077) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.06778 (95%-conf.int. 0.04659 - 0.10625) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50808 (95%-conf.int. 0.48074 - 0.54429) baseline ROUGE-1 Average_P: 0.12720 (95%-conf.int. 0.09905 - 0.15252) baseline ROUGE-1 Average_F: 0.20084 (95%-conf.int. 0.16456 - 0.23291) --------------------------------------------- baseline ROUGE-2 Average_R: 0.09353 (95%-conf.int. 0.07454 - 0.12388) baseline ROUGE-2 Average_P: 0.02142 (95%-conf.int. 0.01905 - 0.02381) baseline ROUGE-2 Average_F: 0.03428 (95%-conf.int. 0.03140 - 0.03772) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.21329 (95%-conf.int. 0.17813 - 0.27163) baseline ROUGE-SU* Average_P: 0.02055 (95%-conf.int. 0.01289 - 0.02652) baseline ROUGE-SU* Average_F: 0.03663 (95%-conf.int. 0.02381 - 0.04652) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.17837 (95%-conf.int. 0.15673 - 0.20000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.56290 (95%-conf.int. 0.50000 - 0.62500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.26981 (95%-conf.int. 0.24280 - 0.29670) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.06498 (95%-conf.int. 0.05238 - 0.07758) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.25019 (95%-conf.int. 0.22222 - 0.30555) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10274 (95%-conf.int. 0.08467 - 0.12077) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03669 (95%-conf.int. 0.02857 - 0.04571) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.38924 (95%-conf.int. 0.33333 - 0.44444) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06648 (95%-conf.int. 0.05302 - 0.08018) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.15995 (95%-conf.int. 0.13245 - 0.18750) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.50027 (95%-conf.int. 0.43750 - 0.56250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.24137 (95%-conf.int. 0.20526 - 0.27747) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.04510 (95%-conf.int. 0.02619 - 0.06406) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.16669 (95%-conf.int. 0.11111 - 0.22222) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.07065 (95%-conf.int. 0.04233 - 0.09903) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03014 (95%-conf.int. 0.01992 - 0.04377) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.30574 (95%-conf.int. 0.25000 - 0.35185) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.05434 (95%-conf.int. 0.03696 - 0.07654) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.15995 (95%-conf.int. 0.13245 - 0.18750) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.50027 (95%-conf.int. 0.43750 - 0.56250) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.24137 (95%-conf.int. 0.20526 - 0.27747) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.04510 (95%-conf.int. 0.02619 - 0.06406) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.16669 (95%-conf.int. 0.11111 - 0.22222) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.07065 (95%-conf.int. 0.04233 - 0.09903) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.03014 (95%-conf.int. 0.01992 - 0.04377) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.30574 (95%-conf.int. 0.25000 - 0.35185) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.05434 (95%-conf.int. 0.03696 - 0.07654) --------------------------------------------- baseline ROUGE-1 Average_R: 0.35731 (95%-conf.int. 0.32039 - 0.40625) baseline ROUGE-1 Average_P: 0.09004 (95%-conf.int. 0.08167 - 0.10167) baseline ROUGE-1 Average_F: 0.14330 (95%-conf.int. 0.13186 - 0.15730) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06626 (95%-conf.int. 0.03929 - 0.09685) baseline ROUGE-2 Average_P: 0.01532 (95%-conf.int. 0.01020 - 0.02041) baseline ROUGE-2 Average_F: 0.02475 (95%-conf.int. 0.01618 - 0.03312) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.12165 (95%-conf.int. 0.09626 - 0.16175) baseline ROUGE-SU* Average_P: 0.00903 (95%-conf.int. 0.00817 - 0.01053) baseline ROUGE-SU* Average_F: 0.01671 (95%-conf.int. 0.01518 - 0.01917) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.21523 (95%-conf.int. 0.20971 - 0.22040) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.40014 (95%-conf.int. 0.38500 - 0.41500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27979 (95%-conf.int. 0.27257 - 0.28426) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07954 (95%-conf.int. 0.07398 - 0.08515) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.15554 (95%-conf.int. 0.14445 - 0.16667) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10521 (95%-conf.int. 0.09779 - 0.11267) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04509 (95%-conf.int. 0.04341 - 0.04662) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.15560 (95%-conf.int. 0.15093 - 0.16111) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06986 (95%-conf.int. 0.06808 - 0.07164) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.22602 (95%-conf.int. 0.21552 - 0.23417) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.60035 (95%-conf.int. 0.56428 - 0.62857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.32828 (95%-conf.int. 0.31287 - 0.33798) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.09094 (95%-conf.int. 0.08722 - 0.09670) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.26681 (95%-conf.int. 0.25000 - 0.28334) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.13559 (95%-conf.int. 0.12932 - 0.14414) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05904 (95%-conf.int. 0.05606 - 0.06198) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.40766 (95%-conf.int. 0.38148 - 0.42963) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.10309 (95%-conf.int. 0.09824 - 0.10772) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.22602 (95%-conf.int. 0.21552 - 0.23417) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.60035 (95%-conf.int. 0.56428 - 0.62857) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.32828 (95%-conf.int. 0.31287 - 0.33798) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09094 (95%-conf.int. 0.08722 - 0.09670) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.26681 (95%-conf.int. 0.25000 - 0.28334) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.13559 (95%-conf.int. 0.12932 - 0.14414) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.05904 (95%-conf.int. 0.05606 - 0.06198) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.40766 (95%-conf.int. 0.38148 - 0.42963) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.10309 (95%-conf.int. 0.09824 - 0.10772) --------------------------------------------- baseline ROUGE-1 Average_R: 0.48435 (95%-conf.int. 0.46985 - 0.49863) baseline ROUGE-1 Average_P: 0.13240 (95%-conf.int. 0.12794 - 0.13677) baseline ROUGE-1 Average_F: 0.20789 (95%-conf.int. 0.20174 - 0.21378) --------------------------------------------- baseline ROUGE-2 Average_R: 0.05693 (95%-conf.int. 0.05436 - 0.05848) baseline ROUGE-2 Average_P: 0.01493 (95%-conf.int. 0.01493 - 0.01493) baseline ROUGE-2 Average_F: 0.02365 (95%-conf.int. 0.02341 - 0.02379) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.20085 (95%-conf.int. 0.19173 - 0.20988) baseline ROUGE-SU* Average_P: 0.01596 (95%-conf.int. 0.01533 - 0.01652) baseline ROUGE-SU* Average_F: 0.02955 (95%-conf.int. 0.02846 - 0.03049) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.14448 (95%-conf.int. 0.13973 - 0.15007) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.55992 (95%-conf.int. 0.52000 - 0.60000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.22935 (95%-conf.int. 0.22213 - 0.23820) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.01718 (95%-conf.int. 0.01522 - 0.01952) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.27135 (95%-conf.int. 0.22857 - 0.30357) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.03224 (95%-conf.int. 0.02860 - 0.03644) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.11360 (95%-conf.int. 0.10963 - 0.11735) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.44006 (95%-conf.int. 0.41000 - 0.48000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.18031 (95%-conf.int. 0.17487 - 0.18577) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.01265 (95%-conf.int. 0.01150 - 0.01408) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.20011 (95%-conf.int. 0.17500 - 0.23214) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.02374 (95%-conf.int. 0.02162 - 0.02629) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.11360 (95%-conf.int. 0.10963 - 0.11735) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.44006 (95%-conf.int. 0.41000 - 0.48000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.18031 (95%-conf.int. 0.17487 - 0.18577) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.01265 (95%-conf.int. 0.01150 - 0.01408) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.20011 (95%-conf.int. 0.17500 - 0.23214) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.02374 (95%-conf.int. 0.02162 - 0.02629) --------------------------------------------- baseline ROUGE-1 Average_R: 0.49218 (95%-conf.int. 0.46046 - 0.51451) baseline ROUGE-1 Average_P: 0.08974 (95%-conf.int. 0.07757 - 0.10093) baseline ROUGE-1 Average_F: 0.15163 (95%-conf.int. 0.13255 - 0.16871) --------------------------------------------- baseline ROUGE-2 Average_R: 0.05349 (95%-conf.int. 0.04258 - 0.06085) baseline ROUGE-2 Average_P: 0.00944 (95%-conf.int. 0.00708 - 0.01132) baseline ROUGE-2 Average_F: 0.01603 (95%-conf.int. 0.01210 - 0.01907) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.22900 (95%-conf.int. 0.20406 - 0.24406) baseline ROUGE-SU* Average_P: 0.00900 (95%-conf.int. 0.00690 - 0.01071) baseline ROUGE-SU* Average_F: 0.01731 (95%-conf.int. 0.01333 - 0.02051) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.15141 (95%-conf.int. 0.13902 - 0.16378) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.23089 (95%-conf.int. 0.21923 - 0.24231) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.18235 (95%-conf.int. 0.17241 - 0.19221) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.04267 (95%-conf.int. 0.03647 - 0.04889) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.06658 (95%-conf.int. 0.06250 - 0.07500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.05186 (95%-conf.int. 0.04593 - 0.05912) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.02562 (95%-conf.int. 0.02197 - 0.02927) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.06893 (95%-conf.int. 0.06444 - 0.07278) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.03714 (95%-conf.int. 0.03315 - 0.04110) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.19161 (95%-conf.int. 0.17743 - 0.20538) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.42243 (95%-conf.int. 0.40556 - 0.43889) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.26295 (95%-conf.int. 0.25032 - 0.27550) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.04267 (95%-conf.int. 0.03647 - 0.04889) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.09987 (95%-conf.int. 0.09375 - 0.11250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.05964 (95%-conf.int. 0.05238 - 0.06809) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03700 (95%-conf.int. 0.03275 - 0.04123) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.20469 (95%-conf.int. 0.19204 - 0.21704) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.06242 (95%-conf.int. 0.05644 - 0.06837) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.19155 (95%-conf.int. 0.17710 - 0.20597) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.42245 (95%-conf.int. 0.40000 - 0.45000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.26288 (95%-conf.int. 0.24824 - 0.27743) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.04267 (95%-conf.int. 0.03647 - 0.04889) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.09987 (95%-conf.int. 0.09375 - 0.11250) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.05964 (95%-conf.int. 0.05238 - 0.06809) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.03699 (95%-conf.int. 0.03240 - 0.04156) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.20471 (95%-conf.int. 0.18977 - 0.22387) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.06241 (95%-conf.int. 0.05579 - 0.06899) --------------------------------------------- baseline ROUGE-1 Average_R: 0.43206 (95%-conf.int. 0.41276 - 0.44887) baseline ROUGE-1 Average_P: 0.06994 (95%-conf.int. 0.06585 - 0.07398) baseline ROUGE-1 Average_F: 0.12021 (95%-conf.int. 0.11485 - 0.12553) --------------------------------------------- baseline ROUGE-2 Average_R: 0.03132 (95%-conf.int. 0.02887 - 0.03372) baseline ROUGE-2 Average_P: 0.00493 (95%-conf.int. 0.00410 - 0.00574) baseline ROUGE-2 Average_F: 0.00850 (95%-conf.int. 0.00718 - 0.00980) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.14970 (95%-conf.int. 0.14071 - 0.15847) baseline ROUGE-SU* Average_P: 0.00483 (95%-conf.int. 0.00436 - 0.00530) baseline ROUGE-SU* Average_F: 0.00935 (95%-conf.int. 0.00848 - 0.01021) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.14532 (95%-conf.int. 0.13640 - 0.16044) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.68011 (95%-conf.int. 0.66000 - 0.70000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.23915 (95%-conf.int. 0.22773 - 0.25996) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.03615 (95%-conf.int. 0.03091 - 0.04446) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.20009 (95%-conf.int. 0.18750 - 0.22500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.06115 (95%-conf.int. 0.05306 - 0.07414) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.02246 (95%-conf.int. 0.01930 - 0.02821) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.50014 (95%-conf.int. 0.47500 - 0.52500) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.04290 (95%-conf.int. 0.03718 - 0.05339) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.14532 (95%-conf.int. 0.13640 - 0.16044) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.68011 (95%-conf.int. 0.66000 - 0.70000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.23915 (95%-conf.int. 0.22773 - 0.25996) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.03615 (95%-conf.int. 0.03091 - 0.04446) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.20009 (95%-conf.int. 0.18750 - 0.22500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.06115 (95%-conf.int. 0.05306 - 0.07414) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.02246 (95%-conf.int. 0.01930 - 0.02821) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.50014 (95%-conf.int. 0.47500 - 0.52500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.04290 (95%-conf.int. 0.03718 - 0.05339) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.14532 (95%-conf.int. 0.13640 - 0.16044) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.68011 (95%-conf.int. 0.66000 - 0.70000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.23915 (95%-conf.int. 0.22773 - 0.25996) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.03615 (95%-conf.int. 0.03091 - 0.04446) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.20009 (95%-conf.int. 0.18750 - 0.22500) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.06115 (95%-conf.int. 0.05306 - 0.07414) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.02246 (95%-conf.int. 0.01930 - 0.02821) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.50014 (95%-conf.int. 0.47500 - 0.52500) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.04290 (95%-conf.int. 0.03718 - 0.05339) --------------------------------------------- baseline ROUGE-1 Average_R: 0.55071 (95%-conf.int. 0.53817 - 0.56299) baseline ROUGE-1 Average_P: 0.10401 (95%-conf.int. 0.09520 - 0.11280) baseline ROUGE-1 Average_F: 0.17477 (95%-conf.int. 0.16195 - 0.18757) --------------------------------------------- baseline ROUGE-2 Average_R: 0.15021 (95%-conf.int. 0.13899 - 0.16087) baseline ROUGE-2 Average_P: 0.02742 (95%-conf.int. 0.02419 - 0.03065) baseline ROUGE-2 Average_F: 0.04632 (95%-conf.int. 0.04121 - 0.05141) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.30496 (95%-conf.int. 0.29330 - 0.31637) baseline ROUGE-SU* Average_P: 0.01257 (95%-conf.int. 0.01075 - 0.01439) baseline ROUGE-SU* Average_F: 0.02412 (95%-conf.int. 0.02074 - 0.02749) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.28177 (95%-conf.int. 0.25321 - 0.31034) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.42872 (95%-conf.int. 0.39285 - 0.46428) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.33885 (95%-conf.int. 0.31642 - 0.36000) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.13057 (95%-conf.int. 0.10728 - 0.15385) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.20842 (95%-conf.int. 0.18056 - 0.22222) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.15988 (95%-conf.int. 0.13790 - 0.18182) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08313 (95%-conf.int. 0.07072 - 0.09551) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.21311 (95%-conf.int. 0.18210 - 0.24691) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11882 (95%-conf.int. 0.10329 - 0.13128) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.28177 (95%-conf.int. 0.25321 - 0.31034) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.42872 (95%-conf.int. 0.39285 - 0.46428) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.33885 (95%-conf.int. 0.31642 - 0.36000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.13057 (95%-conf.int. 0.10728 - 0.15385) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.20842 (95%-conf.int. 0.18056 - 0.22222) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.15988 (95%-conf.int. 0.13790 - 0.18182) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08313 (95%-conf.int. 0.07072 - 0.09551) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.21311 (95%-conf.int. 0.18210 - 0.24691) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11882 (95%-conf.int. 0.10329 - 0.13128) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.32745 (95%-conf.int. 0.31010 - 0.34483) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.38902 (95%-conf.int. 0.37037 - 0.42592) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.35426 (95%-conf.int. 0.34352 - 0.36201) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.13057 (95%-conf.int. 0.10728 - 0.15385) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.15632 (95%-conf.int. 0.13542 - 0.16667) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.14166 (95%-conf.int. 0.12327 - 0.16000) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.09701 (95%-conf.int. 0.08726 - 0.10674) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.15350 (95%-conf.int. 0.13826 - 0.17803) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.11797 (95%-conf.int. 0.10801 - 0.12324) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44418 (95%-conf.int. 0.40194 - 0.48276) baseline ROUGE-1 Average_P: 0.07312 (95%-conf.int. 0.06410 - 0.08333) baseline ROUGE-1 Average_F: 0.12532 (95%-conf.int. 0.11055 - 0.14022) --------------------------------------------- baseline ROUGE-2 Average_R: 0.05400 (95%-conf.int. 0.03113 - 0.07692) baseline ROUGE-2 Average_P: 0.00781 (95%-conf.int. 0.00521 - 0.01042) baseline ROUGE-2 Average_F: 0.01363 (95%-conf.int. 0.00891 - 0.01835) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.18021 (95%-conf.int. 0.14874 - 0.20787) baseline ROUGE-SU* Average_P: 0.00583 (95%-conf.int. 0.00470 - 0.00703) baseline ROUGE-SU* Average_F: 0.01129 (95%-conf.int. 0.00911 - 0.01353) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.43417 (95%-conf.int. 0.37894 - 0.51207) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.52040 (95%-conf.int. 0.47500 - 0.56500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.47108 (95%-conf.int. 0.42126 - 0.52308) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.18394 (95%-conf.int. 0.13765 - 0.23208) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.22229 (95%-conf.int. 0.17778 - 0.25556) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.20009 (95%-conf.int. 0.15707 - 0.23532) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.17983 (95%-conf.int. 0.12679 - 0.26457) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.29302 (95%-conf.int. 0.24722 - 0.33333) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.21694 (95%-conf.int. 0.16742 - 0.28386) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.36835 (95%-conf.int. 0.31313 - 0.44118) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.55034 (95%-conf.int. 0.50000 - 0.60000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.43921 (95%-conf.int. 0.38995 - 0.49921) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.11208 (95%-conf.int. 0.06143 - 0.15431) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.17134 (95%-conf.int. 0.10000 - 0.21429) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.13471 (95%-conf.int. 0.07634 - 0.17660) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.12657 (95%-conf.int. 0.08186 - 0.19072) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.31461 (95%-conf.int. 0.25000 - 0.36714) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.17622 (95%-conf.int. 0.12498 - 0.24367) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.64609 (95%-conf.int. 0.60371 - 0.70888) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.13003 (95%-conf.int. 0.12250 - 0.13833) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.21597 (95%-conf.int. 0.20601 - 0.22687) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.16615 (95%-conf.int. 0.12528 - 0.21861) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.03057 (95%-conf.int. 0.02458 - 0.03644) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.05146 (95%-conf.int. 0.04107 - 0.06097) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.38333 (95%-conf.int. 0.32171 - 0.48570) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.01926 (95%-conf.int. 0.01736 - 0.02113) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.03653 (95%-conf.int. 0.03306 - 0.03997) --------------------------------------------- baseline ROUGE-1 Average_R: 0.72453 (95%-conf.int. 0.69587 - 0.75347) baseline ROUGE-1 Average_P: 0.05397 (95%-conf.int. 0.04877 - 0.05798) baseline ROUGE-1 Average_F: 0.10035 (95%-conf.int. 0.09140 - 0.10722) --------------------------------------------- baseline ROUGE-2 Average_R: 0.12657 (95%-conf.int. 0.11410 - 0.14063) baseline ROUGE-2 Average_P: 0.00864 (95%-conf.int. 0.00803 - 0.00926) baseline ROUGE-2 Average_F: 0.01616 (95%-conf.int. 0.01502 - 0.01727) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.43835 (95%-conf.int. 0.40055 - 0.48303) baseline ROUGE-SU* Average_P: 0.00312 (95%-conf.int. 0.00254 - 0.00355) baseline ROUGE-SU* Average_F: 0.00620 (95%-conf.int. 0.00506 - 0.00704) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.36364 (95%-conf.int. 0.33199 - 0.40641) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.53118 (95%-conf.int. 0.50000 - 0.56250) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.43114 (95%-conf.int. 0.40814 - 0.47187) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.16434 (95%-conf.int. 0.14168 - 0.19032) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.24993 (95%-conf.int. 0.23810 - 0.27381) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.19799 (95%-conf.int. 0.17739 - 0.22454) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.13378 (95%-conf.int. 0.11619 - 0.16098) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.29281 (95%-conf.int. 0.26190 - 0.32381) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.18309 (95%-conf.int. 0.16509 - 0.21449) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.36364 (95%-conf.int. 0.33199 - 0.40641) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.53118 (95%-conf.int. 0.50000 - 0.56250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.43114 (95%-conf.int. 0.40814 - 0.47187) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.16434 (95%-conf.int. 0.14168 - 0.19032) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.24993 (95%-conf.int. 0.23810 - 0.27381) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.19799 (95%-conf.int. 0.17739 - 0.22454) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.13378 (95%-conf.int. 0.11619 - 0.16098) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.29281 (95%-conf.int. 0.26190 - 0.32381) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.18309 (95%-conf.int. 0.16509 - 0.21449) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.59812 (95%-conf.int. 0.55956 - 0.64706) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.31817 (95%-conf.int. 0.30303 - 0.33333) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.41483 (95%-conf.int. 0.40000 - 0.43333) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.23488 (95%-conf.int. 0.20641 - 0.26237) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.11905 (95%-conf.int. 0.11111 - 0.12698) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.15779 (95%-conf.int. 0.14447 - 0.17112) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.33160 (95%-conf.int. 0.29837 - 0.38240) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.10119 (95%-conf.int. 0.09193 - 0.11045) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.15463 (95%-conf.int. 0.14227 - 0.16703) --------------------------------------------- baseline ROUGE-1 Average_R: 0.34264 (95%-conf.int. 0.30588 - 0.39906) baseline ROUGE-1 Average_P: 0.06059 (95%-conf.int. 0.05556 - 0.06692) baseline ROUGE-1 Average_F: 0.10290 (95%-conf.int. 0.09483 - 0.11462) --------------------------------------------- baseline ROUGE-2 Average_R: 0.07071 (95%-conf.int. 0.05667 - 0.09113) baseline ROUGE-2 Average_P: 0.01153 (95%-conf.int. 0.01026 - 0.01410) baseline ROUGE-2 Average_F: 0.01981 (95%-conf.int. 0.01736 - 0.02442) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.12748 (95%-conf.int. 0.10864 - 0.15871) baseline ROUGE-SU* Average_P: 0.00441 (95%-conf.int. 0.00384 - 0.00505) baseline ROUGE-SU* Average_F: 0.00852 (95%-conf.int. 0.00745 - 0.00979) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.23265 (95%-conf.int. 0.19575 - 0.26953) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.49977 (95%-conf.int. 0.44792 - 0.54167) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.31623 (95%-conf.int. 0.27771 - 0.35478) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.05979 (95%-conf.int. 0.05346 - 0.06581) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.14271 (95%-conf.int. 0.10715 - 0.17857) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.08394 (95%-conf.int. 0.07124 - 0.09555) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.05156 (95%-conf.int. 0.03809 - 0.06501) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.25693 (95%-conf.int. 0.21667 - 0.29524) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.08506 (95%-conf.int. 0.06570 - 0.10430) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.23225 (95%-conf.int. 0.20484 - 0.26172) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.49989 (95%-conf.int. 0.46875 - 0.53125) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.31587 (95%-conf.int. 0.28994 - 0.34424) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07513 (95%-conf.int. 0.06710 - 0.08231) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.17849 (95%-conf.int. 0.14286 - 0.21429) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.10529 (95%-conf.int. 0.09330 - 0.11835) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05432 (95%-conf.int. 0.04363 - 0.06920) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.27135 (95%-conf.int. 0.25238 - 0.29524) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.08963 (95%-conf.int. 0.07532 - 0.10949) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.49175 (95%-conf.int. 0.45161 - 0.53679) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.02833 (95%-conf.int. 0.02722 - 0.02944) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.05353 (95%-conf.int. 0.05177 - 0.05529) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.13700 (95%-conf.int. 0.12964 - 0.14616) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.00752 (95%-conf.int. 0.00669 - 0.00836) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.01425 (95%-conf.int. 0.01276 - 0.01576) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.23101 (95%-conf.int. 0.19951 - 0.27674) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.00091 (95%-conf.int. 0.00084 - 0.00097) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.00181 (95%-conf.int. 0.00167 - 0.00194) --------------------------------------------- baseline ROUGE-1 Average_R: 0.40107 (95%-conf.int. 0.38355 - 0.41839) baseline ROUGE-1 Average_P: 0.05690 (95%-conf.int. 0.05013 - 0.06369) baseline ROUGE-1 Average_F: 0.09947 (95%-conf.int. 0.08904 - 0.10993) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06041 (95%-conf.int. 0.05042 - 0.07040) baseline ROUGE-2 Average_P: 0.00820 (95%-conf.int. 0.00614 - 0.01025) baseline ROUGE-2 Average_F: 0.01441 (95%-conf.int. 0.01101 - 0.01789) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.15865 (95%-conf.int. 0.14768 - 0.16982) baseline ROUGE-SU* Average_P: 0.00380 (95%-conf.int. 0.00309 - 0.00452) baseline ROUGE-SU* Average_F: 0.00742 (95%-conf.int. 0.00605 - 0.00880) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.28582 (95%-conf.int. 0.27545 - 0.29654) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.39995 (95%-conf.int. 0.36875 - 0.43125) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.33276 (95%-conf.int. 0.31658 - 0.34614) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.05791 (95%-conf.int. 0.04241 - 0.07081) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.08560 (95%-conf.int. 0.05714 - 0.10714) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.06893 (95%-conf.int. 0.04868 - 0.08515) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.09545 (95%-conf.int. 0.08977 - 0.10237) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.20561 (95%-conf.int. 0.17714 - 0.23285) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.12982 (95%-conf.int. 0.11996 - 0.13963) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.28585 (95%-conf.int. 0.26171 - 0.30691) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.35555 (95%-conf.int. 0.31667 - 0.38333) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.31631 (95%-conf.int. 0.28688 - 0.33422) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.03863 (95%-conf.int. 0.01877 - 0.05175) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.04996 (95%-conf.int. 0.02500 - 0.06250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.04347 (95%-conf.int. 0.02139 - 0.05650) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.10339 (95%-conf.int. 0.08469 - 0.11958) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.17725 (95%-conf.int. 0.13977 - 0.20114) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.13000 (95%-conf.int. 0.10500 - 0.14647) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.28694 (95%-conf.int. 0.27055 - 0.31486) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.24628 (95%-conf.int. 0.22692 - 0.26538) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.26454 (95%-conf.int. 0.24683 - 0.28701) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.03863 (95%-conf.int. 0.01877 - 0.05175) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.03331 (95%-conf.int. 0.01667 - 0.04167) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.03569 (95%-conf.int. 0.01762 - 0.04607) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.10176 (95%-conf.int. 0.08996 - 0.11996) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.08450 (95%-conf.int. 0.07167 - 0.09555) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.09186 (95%-conf.int. 0.07939 - 0.10486) --------------------------------------------- baseline ROUGE-1 Average_R: 0.56930 (95%-conf.int. 0.54555 - 0.59324) baseline ROUGE-1 Average_P: 0.06737 (95%-conf.int. 0.05947 - 0.07527) baseline ROUGE-1 Average_F: 0.12039 (95%-conf.int. 0.10725 - 0.13354) --------------------------------------------- baseline ROUGE-2 Average_R: 0.13688 (95%-conf.int. 0.10408 - 0.16894) baseline ROUGE-2 Average_P: 0.01491 (95%-conf.int. 0.01117 - 0.01809) baseline ROUGE-2 Average_F: 0.02687 (95%-conf.int. 0.02016 - 0.03252) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.36108 (95%-conf.int. 0.33945 - 0.38036) baseline ROUGE-SU* Average_P: 0.00601 (95%-conf.int. 0.00500 - 0.00702) baseline ROUGE-SU* Average_F: 0.01182 (95%-conf.int. 0.00986 - 0.01378) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.27209 (95%-conf.int. 0.26090 - 0.28337) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.59974 (95%-conf.int. 0.55000 - 0.65000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.37413 (95%-conf.int. 0.35377 - 0.39468) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.11974 (95%-conf.int. 0.10732 - 0.13006) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.28555 (95%-conf.int. 0.24286 - 0.32143) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.16863 (95%-conf.int. 0.14878 - 0.18518) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08338 (95%-conf.int. 0.07805 - 0.08801) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.40540 (95%-conf.int. 0.35000 - 0.46286) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.13809 (95%-conf.int. 0.12764 - 0.14782) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.27209 (95%-conf.int. 0.26090 - 0.28337) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.59974 (95%-conf.int. 0.55000 - 0.65000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.37413 (95%-conf.int. 0.35377 - 0.39468) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.11974 (95%-conf.int. 0.10732 - 0.13006) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.28555 (95%-conf.int. 0.24286 - 0.32143) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.16863 (95%-conf.int. 0.14878 - 0.18518) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08338 (95%-conf.int. 0.07805 - 0.08801) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.40540 (95%-conf.int. 0.35000 - 0.46286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.13809 (95%-conf.int. 0.12764 - 0.14782) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.45475 (95%-conf.int. 0.42902 - 0.48243) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.44447 (95%-conf.int. 0.40833 - 0.46944) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.44925 (95%-conf.int. 0.41856 - 0.47499) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.19291 (95%-conf.int. 0.17950 - 0.20428) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.18831 (95%-conf.int. 0.17059 - 0.20294) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.19044 (95%-conf.int. 0.17680 - 0.20320) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.22055 (95%-conf.int. 0.19719 - 0.24516) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.21999 (95%-conf.int. 0.18471 - 0.24118) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.21971 (95%-conf.int. 0.18963 - 0.24246) --------------------------------------------- baseline ROUGE-1 Average_R: 0.43145 (95%-conf.int. 0.42228 - 0.44273) baseline ROUGE-1 Average_P: 0.06907 (95%-conf.int. 0.06454 - 0.07318) baseline ROUGE-1 Average_F: 0.11904 (95%-conf.int. 0.11194 - 0.12544) --------------------------------------------- baseline ROUGE-2 Average_R: 0.04774 (95%-conf.int. 0.03996 - 0.05557) baseline ROUGE-2 Average_P: 0.00733 (95%-conf.int. 0.00596 - 0.00871) baseline ROUGE-2 Average_F: 0.01270 (95%-conf.int. 0.01037 - 0.01506) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.15647 (95%-conf.int. 0.14574 - 0.16458) baseline ROUGE-SU* Average_P: 0.00435 (95%-conf.int. 0.00371 - 0.00480) baseline ROUGE-SU* Average_F: 0.00847 (95%-conf.int. 0.00723 - 0.00932) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.25826 (95%-conf.int. 0.21598 - 0.29858) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.50017 (95%-conf.int. 0.45238 - 0.54762) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34019 (95%-conf.int. 0.29175 - 0.38644) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07906 (95%-conf.int. 0.07197 - 0.08452) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.16667 (95%-conf.int. 0.16667 - 0.16667) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10707 (95%-conf.int. 0.10035 - 0.11216) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.06799 (95%-conf.int. 0.05268 - 0.08310) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.25938 (95%-conf.int. 0.22531 - 0.29321) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10748 (95%-conf.int. 0.08501 - 0.12918) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.25826 (95%-conf.int. 0.21598 - 0.29858) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.50017 (95%-conf.int. 0.45238 - 0.54762) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.34019 (95%-conf.int. 0.29175 - 0.38644) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07906 (95%-conf.int. 0.07197 - 0.08452) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.16667 (95%-conf.int. 0.16667 - 0.16667) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.10707 (95%-conf.int. 0.10035 - 0.11216) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06799 (95%-conf.int. 0.05268 - 0.08310) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.25938 (95%-conf.int. 0.22531 - 0.29321) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.10748 (95%-conf.int. 0.08501 - 0.12918) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.33116 (95%-conf.int. 0.28123 - 0.36369) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.40918 (95%-conf.int. 0.37879 - 0.42424) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.36552 (95%-conf.int. 0.32222 - 0.39163) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07906 (95%-conf.int. 0.07197 - 0.08452) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.10000 (95%-conf.int. 0.10000 - 0.10000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08814 (95%-conf.int. 0.08354 - 0.09161) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.10668 (95%-conf.int. 0.08090 - 0.12562) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.16932 (95%-conf.int. 0.14487 - 0.18590) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.13043 (95%-conf.int. 0.10280 - 0.14863) --------------------------------------------- baseline ROUGE-1 Average_R: 0.56596 (95%-conf.int. 0.53425 - 0.60783) baseline ROUGE-1 Average_P: 0.08804 (95%-conf.int. 0.08333 - 0.09280) baseline ROUGE-1 Average_F: 0.15225 (95%-conf.int. 0.14474 - 0.15986) --------------------------------------------- baseline ROUGE-2 Average_R: 0.15822 (95%-conf.int. 0.13762 - 0.18472) baseline ROUGE-2 Average_P: 0.02298 (95%-conf.int. 0.02012 - 0.02586) baseline ROUGE-2 Average_F: 0.04009 (95%-conf.int. 0.03529 - 0.04531) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.32041 (95%-conf.int. 0.29119 - 0.36705) baseline ROUGE-SU* Average_P: 0.00861 (95%-conf.int. 0.00775 - 0.00949) baseline ROUGE-SU* Average_F: 0.01677 (95%-conf.int. 0.01509 - 0.01846) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.33948 (95%-conf.int. 0.30672 - 0.38977) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.65629 (95%-conf.int. 0.60417 - 0.70833) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.44462 (95%-conf.int. 0.41184 - 0.48259) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.17496 (95%-conf.int. 0.14546 - 0.22028) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.35728 (95%-conf.int. 0.30952 - 0.38095) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.23308 (95%-conf.int. 0.19776 - 0.27701) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.10711 (95%-conf.int. 0.07809 - 0.15283) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.43582 (95%-conf.int. 0.37381 - 0.49524) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.16798 (95%-conf.int. 0.12907 - 0.22207) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.33788 (95%-conf.int. 0.31355 - 0.37319) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.65615 (95%-conf.int. 0.60417 - 0.70833) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.44333 (95%-conf.int. 0.42947 - 0.45760) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.19315 (95%-conf.int. 0.16303 - 0.24536) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.39294 (95%-conf.int. 0.35714 - 0.42857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.25698 (95%-conf.int. 0.22367 - 0.30714) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.10351 (95%-conf.int. 0.08507 - 0.13756) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.42852 (95%-conf.int. 0.38095 - 0.47619) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.16312 (95%-conf.int. 0.14198 - 0.19964) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.45417 (95%-conf.int. 0.40192 - 0.51763) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.31813 (95%-conf.int. 0.30682 - 0.32954) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.37170 (95%-conf.int. 0.35978 - 0.38503) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.15742 (95%-conf.int. 0.13668 - 0.19397) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.10717 (95%-conf.int. 0.09921 - 0.11111) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.12650 (95%-conf.int. 0.11616 - 0.13981) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.18743 (95%-conf.int. 0.14583 - 0.25770) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.10612 (95%-conf.int. 0.09986 - 0.11408) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.13162 (95%-conf.int. 0.12386 - 0.14175) --------------------------------------------- baseline ROUGE-1 Average_R: 0.54339 (95%-conf.int. 0.52325 - 0.56932) baseline ROUGE-1 Average_P: 0.09546 (95%-conf.int. 0.08427 - 0.10486) baseline ROUGE-1 Average_F: 0.16188 (95%-conf.int. 0.14628 - 0.17502) --------------------------------------------- baseline ROUGE-2 Average_R: 0.15742 (95%-conf.int. 0.13668 - 0.19397) baseline ROUGE-2 Average_P: 0.02558 (95%-conf.int. 0.02368 - 0.02652) baseline ROUGE-2 Average_F: 0.04384 (95%-conf.int. 0.04050 - 0.04644) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.28307 (95%-conf.int. 0.26509 - 0.31650) baseline ROUGE-SU* Average_P: 0.01079 (95%-conf.int. 0.00834 - 0.01277) baseline ROUGE-SU* Average_F: 0.02072 (95%-conf.int. 0.01619 - 0.02438) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.19167 (95%-conf.int. 0.17725 - 0.20338) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.51412 (95%-conf.int. 0.48572 - 0.53571) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27917 (95%-conf.int. 0.26072 - 0.29372) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.03377 (95%-conf.int. 0.02251 - 0.04279) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.09981 (95%-conf.int. 0.06667 - 0.12500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.05046 (95%-conf.int. 0.03365 - 0.06374) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04167 (95%-conf.int. 0.03524 - 0.04693) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.28869 (95%-conf.int. 0.25741 - 0.31296) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.07280 (95%-conf.int. 0.06193 - 0.08158) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.25550 (95%-conf.int. 0.24153 - 0.26985) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.59984 (95%-conf.int. 0.57500 - 0.62500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.35828 (95%-conf.int. 0.34115 - 0.37686) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.04496 (95%-conf.int. 0.03335 - 0.05705) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.11410 (95%-conf.int. 0.08572 - 0.14286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.06449 (95%-conf.int. 0.04801 - 0.08152) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06723 (95%-conf.int. 0.05904 - 0.07587) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.35973 (95%-conf.int. 0.32571 - 0.39428) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11322 (95%-conf.int. 0.09984 - 0.12700) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.37264 (95%-conf.int. 0.36180 - 0.38258) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.25000 (95%-conf.int. 0.25000 - 0.25000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.29917 (95%-conf.int. 0.29568 - 0.30239) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.10128 (95%-conf.int. 0.09131 - 0.11211) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.06662 (95%-conf.int. 0.06111 - 0.07222) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08035 (95%-conf.int. 0.07304 - 0.08782) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.14189 (95%-conf.int. 0.13346 - 0.15342) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.06566 (95%-conf.int. 0.06407 - 0.06778) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.08970 (95%-conf.int. 0.08736 - 0.09384) --------------------------------------------- baseline ROUGE-1 Average_R: 0.45752 (95%-conf.int. 0.44226 - 0.47372) baseline ROUGE-1 Average_P: 0.10750 (95%-conf.int. 0.10375 - 0.11063) baseline ROUGE-1 Average_F: 0.17407 (95%-conf.int. 0.16800 - 0.17888) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06735 (95%-conf.int. 0.05859 - 0.07528) baseline ROUGE-2 Average_P: 0.01519 (95%-conf.int. 0.01329 - 0.01709) baseline ROUGE-2 Average_F: 0.02478 (95%-conf.int. 0.02167 - 0.02787) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.18834 (95%-conf.int. 0.17928 - 0.20022) baseline ROUGE-SU* Average_P: 0.01093 (95%-conf.int. 0.01030 - 0.01155) baseline ROUGE-SU* Average_F: 0.02066 (95%-conf.int. 0.01947 - 0.02178) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.16958 (95%-conf.int. 0.15991 - 0.18224) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.45727 (95%-conf.int. 0.43571 - 0.47857) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.24681 (95%-conf.int. 0.23756 - 0.25775) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.06661 (95%-conf.int. 0.06173 - 0.07183) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.20015 (95%-conf.int. 0.17500 - 0.22500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09971 (95%-conf.int. 0.09200 - 0.10891) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03341 (95%-conf.int. 0.03013 - 0.03792) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.26671 (95%-conf.int. 0.23889 - 0.28889) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.05907 (95%-conf.int. 0.05456 - 0.06549) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.22262 (95%-conf.int. 0.20841 - 0.24019) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.52526 (95%-conf.int. 0.49375 - 0.55625) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.31191 (95%-conf.int. 0.29590 - 0.32763) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.06661 (95%-conf.int. 0.06173 - 0.07183) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.17156 (95%-conf.int. 0.15000 - 0.19286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.09571 (95%-conf.int. 0.08829 - 0.10466) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05303 (95%-conf.int. 0.04699 - 0.06201) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.32595 (95%-conf.int. 0.28714 - 0.36143) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.09061 (95%-conf.int. 0.08172 - 0.10286) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.33937 (95%-conf.int. 0.31971 - 0.36365) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.25600 (95%-conf.int. 0.24600 - 0.26400) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.29099 (95%-conf.int. 0.28828 - 0.29327) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09981 (95%-conf.int. 0.09291 - 0.10765) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.07505 (95%-conf.int. 0.06458 - 0.08542) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08541 (95%-conf.int. 0.07679 - 0.09487) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.11835 (95%-conf.int. 0.10663 - 0.13649) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.07840 (95%-conf.int. 0.07160 - 0.08395) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.09320 (95%-conf.int. 0.09195 - 0.09444) --------------------------------------------- baseline ROUGE-1 Average_R: 0.46409 (95%-conf.int. 0.43825 - 0.48909) baseline ROUGE-1 Average_P: 0.10609 (95%-conf.int. 0.09518 - 0.11627) baseline ROUGE-1 Average_F: 0.17240 (95%-conf.int. 0.15679 - 0.18677) --------------------------------------------- baseline ROUGE-2 Average_R: 0.10001 (95%-conf.int. 0.09073 - 0.10916) baseline ROUGE-2 Average_P: 0.02197 (95%-conf.int. 0.01890 - 0.02500) baseline ROUGE-2 Average_F: 0.03596 (95%-conf.int. 0.03127 - 0.04063) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.23243 (95%-conf.int. 0.21517 - 0.24924) baseline ROUGE-SU* Average_P: 0.01464 (95%-conf.int. 0.01208 - 0.01702) baseline ROUGE-SU* Average_F: 0.02749 (95%-conf.int. 0.02293 - 0.03175) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.20361 (95%-conf.int. 0.19580 - 0.21306) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.56667 (95%-conf.int. 0.52500 - 0.60000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.29865 (95%-conf.int. 0.29511 - 0.30321) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.06443 (95%-conf.int. 0.05762 - 0.07268) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.20000 (95%-conf.int. 0.20000 - 0.20000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09713 (95%-conf.int. 0.08943 - 0.10648) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.04543 (95%-conf.int. 0.04126 - 0.05205) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.39998 (95%-conf.int. 0.35500 - 0.43750) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.08106 (95%-conf.int. 0.07531 - 0.09032) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.14426 (95%-conf.int. 0.13407 - 0.15674) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.47992 (95%-conf.int. 0.46000 - 0.50000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.22120 (95%-conf.int. 0.21027 - 0.23328) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.06443 (95%-conf.int. 0.05762 - 0.07268) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.25000 (95%-conf.int. 0.25000 - 0.25000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.10214 (95%-conf.int. 0.09362 - 0.11249) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.01724 (95%-conf.int. 0.01493 - 0.02102) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.21420 (95%-conf.int. 0.19286 - 0.23214) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.03174 (95%-conf.int. 0.02786 - 0.03789) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.14426 (95%-conf.int. 0.13407 - 0.15674) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.47992 (95%-conf.int. 0.46000 - 0.50000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.22120 (95%-conf.int. 0.21027 - 0.23328) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.06443 (95%-conf.int. 0.05762 - 0.07268) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.25000 (95%-conf.int. 0.25000 - 0.25000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.10214 (95%-conf.int. 0.09362 - 0.11249) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.01724 (95%-conf.int. 0.01493 - 0.02102) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.21420 (95%-conf.int. 0.19286 - 0.23214) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.03174 (95%-conf.int. 0.02786 - 0.03789) --------------------------------------------- baseline ROUGE-1 Average_R: 0.53575 (95%-conf.int. 0.51221 - 0.55910) baseline ROUGE-1 Average_P: 0.10710 (95%-conf.int. 0.09524 - 0.11905) baseline ROUGE-1 Average_F: 0.17815 (95%-conf.int. 0.16104 - 0.19539) --------------------------------------------- baseline ROUGE-2 Average_R: 0.16668 (95%-conf.int. 0.14637 - 0.19428) baseline ROUGE-2 Average_P: 0.03132 (95%-conf.int. 0.02771 - 0.03433) baseline ROUGE-2 Average_F: 0.05259 (95%-conf.int. 0.04644 - 0.05773) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.25892 (95%-conf.int. 0.23808 - 0.27825) baseline ROUGE-SU* Average_P: 0.01305 (95%-conf.int. 0.01069 - 0.01534) baseline ROUGE-SU* Average_F: 0.02479 (95%-conf.int. 0.02054 - 0.02897) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.24423 (95%-conf.int. 0.23728 - 0.25320) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.45810 (95%-conf.int. 0.41667 - 0.50000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.31802 (95%-conf.int. 0.30278 - 0.33344) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07113 (95%-conf.int. 0.05090 - 0.09099) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.14972 (95%-conf.int. 0.10000 - 0.20000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.09630 (95%-conf.int. 0.06720 - 0.12505) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.06509 (95%-conf.int. 0.06057 - 0.06780) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.23730 (95%-conf.int. 0.17917 - 0.27500) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10159 (95%-conf.int. 0.09015 - 0.10838) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.28667 (95%-conf.int. 0.25536 - 0.30925) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.40597 (95%-conf.int. 0.33333 - 0.45833) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.33546 (95%-conf.int. 0.28828 - 0.36677) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07113 (95%-conf.int. 0.05090 - 0.09099) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.10694 (95%-conf.int. 0.07143 - 0.14286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.08530 (95%-conf.int. 0.05919 - 0.11116) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.10086 (95%-conf.int. 0.08113 - 0.11295) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.21403 (95%-conf.int. 0.14286 - 0.26190) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.13630 (95%-conf.int. 0.10270 - 0.15740) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.37827 (95%-conf.int. 0.35910 - 0.39483) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.42487 (95%-conf.int. 0.38334 - 0.45833) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.39944 (95%-conf.int. 0.38047 - 0.42009) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.11987 (95%-conf.int. 0.08995 - 0.14660) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.13874 (95%-conf.int. 0.09259 - 0.17593) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.12839 (95%-conf.int. 0.09108 - 0.15991) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.13465 (95%-conf.int. 0.12990 - 0.13941) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.18043 (95%-conf.int. 0.14352 - 0.20370) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.15298 (95%-conf.int. 0.13509 - 0.16435) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44545 (95%-conf.int. 0.41834 - 0.46501) baseline ROUGE-1 Average_P: 0.07691 (95%-conf.int. 0.07051 - 0.08205) baseline ROUGE-1 Average_F: 0.13105 (95%-conf.int. 0.12184 - 0.13944) --------------------------------------------- baseline ROUGE-2 Average_R: 0.02320 (95%-conf.int. 0.00781 - 0.03175) baseline ROUGE-2 Average_P: 0.00390 (95%-conf.int. 0.00130 - 0.00521) baseline ROUGE-2 Average_F: 0.00668 (95%-conf.int. 0.00223 - 0.00895) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.19056 (95%-conf.int. 0.18181 - 0.19832) baseline ROUGE-SU* Average_P: 0.00641 (95%-conf.int. 0.00513 - 0.00711) baseline ROUGE-SU* Average_F: 0.01240 (95%-conf.int. 0.00999 - 0.01372) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.23270 (95%-conf.int. 0.21795 - 0.24695) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.32505 (95%-conf.int. 0.30833 - 0.33333) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.27113 (95%-conf.int. 0.25535 - 0.28370) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.03910 (95%-conf.int. 0.02470 - 0.05334) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.05567 (95%-conf.int. 0.03704 - 0.07407) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.04592 (95%-conf.int. 0.02963 - 0.06202) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.05293 (95%-conf.int. 0.04441 - 0.06145) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.10190 (95%-conf.int. 0.09105 - 0.11265) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06959 (95%-conf.int. 0.06002 - 0.07943) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.16116 (95%-conf.int. 0.14733 - 0.17286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.25006 (95%-conf.int. 0.23148 - 0.25926) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.19593 (95%-conf.int. 0.18004 - 0.20742) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03597 (95%-conf.int. 0.03033 - 0.04085) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.08526 (95%-conf.int. 0.07386 - 0.09091) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.05054 (95%-conf.int. 0.04299 - 0.05636) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.32246 (95%-conf.int. 0.28885 - 0.36052) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.25005 (95%-conf.int. 0.23148 - 0.26852) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.28157 (95%-conf.int. 0.25699 - 0.30778) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.03910 (95%-conf.int. 0.02470 - 0.05334) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.02947 (95%-conf.int. 0.01961 - 0.03922) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.03360 (95%-conf.int. 0.02186 - 0.04520) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.09610 (95%-conf.int. 0.07764 - 0.11758) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.05884 (95%-conf.int. 0.05098 - 0.06667) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.07290 (95%-conf.int. 0.06154 - 0.08508) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44571 (95%-conf.int. 0.42922 - 0.46771) baseline ROUGE-1 Average_P: 0.04110 (95%-conf.int. 0.03837 - 0.04441) baseline ROUGE-1 Average_F: 0.07526 (95%-conf.int. 0.07045 - 0.08111) --------------------------------------------- baseline ROUGE-2 Average_R: 0.03910 (95%-conf.int. 0.02470 - 0.05334) baseline ROUGE-2 Average_P: 0.00332 (95%-conf.int. 0.00221 - 0.00442) baseline ROUGE-2 Average_F: 0.00612 (95%-conf.int. 0.00405 - 0.00817) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.19184 (95%-conf.int. 0.18305 - 0.20305) baseline ROUGE-SU* Average_P: 0.00174 (95%-conf.int. 0.00155 - 0.00196) baseline ROUGE-SU* Average_F: 0.00345 (95%-conf.int. 0.00309 - 0.00388) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.10231 (95%-conf.int. 0.09378 - 0.10889) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.59986 (95%-conf.int. 0.57500 - 0.62500) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.17468 (95%-conf.int. 0.16147 - 0.18473) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.00864 (95%-conf.int. 0.00417 - 0.01128) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.06650 (95%-conf.int. 0.03333 - 0.08333) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.01529 (95%-conf.int. 0.00741 - 0.01986) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.01103 (95%-conf.int. 0.00940 - 0.01254) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.37773 (95%-conf.int. 0.35000 - 0.40556) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.02142 (95%-conf.int. 0.01830 - 0.02426) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.09314 (95%-conf.int. 0.08612 - 0.09957) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.43959 (95%-conf.int. 0.39000 - 0.48000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.15359 (95%-conf.int. 0.14152 - 0.16427) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.01729 (95%-conf.int. 0.00833 - 0.02256) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.09975 (95%-conf.int. 0.05000 - 0.12500) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.02945 (95%-conf.int. 0.01428 - 0.03822) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.01138 (95%-conf.int. 0.00976 - 0.01272) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.25680 (95%-conf.int. 0.20357 - 0.28929) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.02179 (95%-conf.int. 0.01863 - 0.02433) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.09314 (95%-conf.int. 0.08612 - 0.09957) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.43959 (95%-conf.int. 0.39000 - 0.48000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.15359 (95%-conf.int. 0.14152 - 0.16427) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.01729 (95%-conf.int. 0.00833 - 0.02256) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.09975 (95%-conf.int. 0.05000 - 0.12500) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.02945 (95%-conf.int. 0.01428 - 0.03822) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.01138 (95%-conf.int. 0.00976 - 0.01272) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.25680 (95%-conf.int. 0.20357 - 0.28929) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.02179 (95%-conf.int. 0.01863 - 0.02433) --------------------------------------------- baseline ROUGE-1 Average_R: 0.54357 (95%-conf.int. 0.53041 - 0.55777) baseline ROUGE-1 Average_P: 0.13909 (95%-conf.int. 0.13369 - 0.14511) baseline ROUGE-1 Average_F: 0.22129 (95%-conf.int. 0.21511 - 0.22797) --------------------------------------------- baseline ROUGE-2 Average_R: 0.09693 (95%-conf.int. 0.09026 - 0.10238) baseline ROUGE-2 Average_P: 0.02415 (95%-conf.int. 0.02143 - 0.02692) baseline ROUGE-2 Average_F: 0.03863 (95%-conf.int. 0.03459 - 0.04256) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.25503 (95%-conf.int. 0.24104 - 0.27562) baseline ROUGE-SU* Average_P: 0.01860 (95%-conf.int. 0.01743 - 0.01991) baseline ROUGE-SU* Average_F: 0.03464 (95%-conf.int. 0.03261 - 0.03680) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.37133 (95%-conf.int. 0.33196 - 0.42266) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.40000 (95%-conf.int. 0.36666 - 0.42778) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.38363 (95%-conf.int. 0.35248 - 0.41993) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.23009 (95%-conf.int. 0.18399 - 0.28308) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.24992 (95%-conf.int. 0.20000 - 0.28125) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.23834 (95%-conf.int. 0.19150 - 0.27730) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.12237 (95%-conf.int. 0.08946 - 0.16398) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.16817 (95%-conf.int. 0.13068 - 0.19318) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.13954 (95%-conf.int. 0.10689 - 0.17004) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.33045 (95%-conf.int. 0.30098 - 0.37195) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.53338 (95%-conf.int. 0.50833 - 0.56666) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.40660 (95%-conf.int. 0.37964 - 0.43961) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.16162 (95%-conf.int. 0.14459 - 0.18872) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.27996 (95%-conf.int. 0.26000 - 0.30000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.20397 (95%-conf.int. 0.18691 - 0.22774) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.09560 (95%-conf.int. 0.08084 - 0.11889) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.29011 (95%-conf.int. 0.25750 - 0.32250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.14221 (95%-conf.int. 0.12380 - 0.16643) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.39136 (95%-conf.int. 0.35840 - 0.43400) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.38011 (95%-conf.int. 0.35500 - 0.41000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.38418 (95%-conf.int. 0.36165 - 0.40694) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.20763 (95%-conf.int. 0.17940 - 0.24782) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.19996 (95%-conf.int. 0.17778 - 0.22222) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.20267 (95%-conf.int. 0.17820 - 0.22803) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.13557 (95%-conf.int. 0.11112 - 0.17300) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.15189 (95%-conf.int. 0.13148 - 0.17223) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.14117 (95%-conf.int. 0.12167 - 0.16129) --------------------------------------------- baseline ROUGE-1 Average_R: 0.51412 (95%-conf.int. 0.48618 - 0.55460) baseline ROUGE-1 Average_P: 0.06175 (95%-conf.int. 0.05864 - 0.06543) baseline ROUGE-1 Average_F: 0.11010 (95%-conf.int. 0.10544 - 0.11535) --------------------------------------------- baseline ROUGE-2 Average_R: 0.23121 (95%-conf.int. 0.20496 - 0.27739) baseline ROUGE-2 Average_P: 0.02499 (95%-conf.int. 0.02250 - 0.02750) baseline ROUGE-2 Average_F: 0.04502 (95%-conf.int. 0.04057 - 0.04952) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.24857 (95%-conf.int. 0.22002 - 0.29479) baseline ROUGE-SU* Average_P: 0.00458 (95%-conf.int. 0.00416 - 0.00505) baseline ROUGE-SU* Average_F: 0.00899 (95%-conf.int. 0.00819 - 0.00989) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.26179 (95%-conf.int. 0.24551 - 0.28302) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.55539 (95%-conf.int. 0.53334 - 0.57778) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.35536 (95%-conf.int. 0.33731 - 0.37435) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.08872 (95%-conf.int. 0.07592 - 0.10236) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.19993 (95%-conf.int. 0.17500 - 0.21875) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.12270 (95%-conf.int. 0.10462 - 0.13790) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.07020 (95%-conf.int. 0.05968 - 0.08490) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.32257 (95%-conf.int. 0.29886 - 0.34659) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11465 (95%-conf.int. 0.09937 - 0.13373) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.29341 (95%-conf.int. 0.26570 - 0.32289) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.62200 (95%-conf.int. 0.58333 - 0.66111) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.39817 (95%-conf.int. 0.36508 - 0.42698) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.09989 (95%-conf.int. 0.07996 - 0.11738) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.22492 (95%-conf.int. 0.18750 - 0.25000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.13811 (95%-conf.int. 0.11210 - 0.15815) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08639 (95%-conf.int. 0.06878 - 0.10822) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.39526 (95%-conf.int. 0.34204 - 0.43750) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.14096 (95%-conf.int. 0.11521 - 0.17041) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.31472 (95%-conf.int. 0.29054 - 0.34874) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.49990 (95%-conf.int. 0.47500 - 0.52083) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.38567 (95%-conf.int. 0.35938 - 0.41428) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09989 (95%-conf.int. 0.07996 - 0.11738) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.16358 (95%-conf.int. 0.13637 - 0.18182) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.12382 (95%-conf.int. 0.10080 - 0.14094) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.09349 (95%-conf.int. 0.07566 - 0.11750) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.24405 (95%-conf.int. 0.21494 - 0.26753) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.13413 (95%-conf.int. 0.11269 - 0.15962) --------------------------------------------- baseline ROUGE-1 Average_R: 0.49934 (95%-conf.int. 0.47452 - 0.52469) baseline ROUGE-1 Average_P: 0.11558 (95%-conf.int. 0.10542 - 0.12591) baseline ROUGE-1 Average_F: 0.18754 (95%-conf.int. 0.17275 - 0.20259) --------------------------------------------- baseline ROUGE-2 Average_R: 0.14164 (95%-conf.int. 0.12557 - 0.15481) baseline ROUGE-2 Average_P: 0.03168 (95%-conf.int. 0.02622 - 0.03598) baseline ROUGE-2 Average_F: 0.05174 (95%-conf.int. 0.04349 - 0.05839) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.25573 (95%-conf.int. 0.23196 - 0.27702) baseline ROUGE-SU* Average_P: 0.01519 (95%-conf.int. 0.01310 - 0.01731) baseline ROUGE-SU* Average_F: 0.02863 (95%-conf.int. 0.02484 - 0.03249) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.22861 (95%-conf.int. 0.20432 - 0.26293) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.60013 (95%-conf.int. 0.56875 - 0.64375) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.33072 (95%-conf.int. 0.30144 - 0.37304) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.13010 (95%-conf.int. 0.11655 - 0.15030) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.37147 (95%-conf.int. 0.35714 - 0.40000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.19247 (95%-conf.int. 0.17567 - 0.21828) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.05988 (95%-conf.int. 0.04895 - 0.07620) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.41727 (95%-conf.int. 0.38571 - 0.45857) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.10444 (95%-conf.int. 0.08704 - 0.13059) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.21880 (95%-conf.int. 0.20130 - 0.24946) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.65742 (95%-conf.int. 0.61428 - 0.70714) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.32797 (95%-conf.int. 0.30421 - 0.36830) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.14007 (95%-conf.int. 0.12524 - 0.16307) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.46671 (95%-conf.int. 0.43333 - 0.50833) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.21523 (95%-conf.int. 0.19524 - 0.24664) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.04908 (95%-conf.int. 0.04169 - 0.06212) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.44466 (95%-conf.int. 0.40556 - 0.48704) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.08818 (95%-conf.int. 0.07592 - 0.10997) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.27586 (95%-conf.int. 0.25366 - 0.31277) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.64461 (95%-conf.int. 0.61667 - 0.68333) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.38591 (95%-conf.int. 0.36208 - 0.42838) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.15022 (95%-conf.int. 0.13148 - 0.17624) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.37497 (95%-conf.int. 0.35000 - 0.41250) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.21422 (95%-conf.int. 0.19135 - 0.24568) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.07443 (95%-conf.int. 0.06248 - 0.09351) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.41371 (95%-conf.int. 0.38523 - 0.45341) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.12574 (95%-conf.int. 0.10818 - 0.15412) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44590 (95%-conf.int. 0.41905 - 0.49145) baseline ROUGE-1 Average_P: 0.11064 (95%-conf.int. 0.10529 - 0.11589) baseline ROUGE-1 Average_F: 0.17713 (95%-conf.int. 0.16828 - 0.18578) --------------------------------------------- baseline ROUGE-2 Average_R: 0.10949 (95%-conf.int. 0.10286 - 0.11873) baseline ROUGE-2 Average_P: 0.02621 (95%-conf.int. 0.02500 - 0.02679) baseline ROUGE-2 Average_F: 0.04225 (95%-conf.int. 0.04032 - 0.04359) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.19443 (95%-conf.int. 0.17225 - 0.23348) baseline ROUGE-SU* Average_P: 0.01315 (95%-conf.int. 0.01210 - 0.01412) baseline ROUGE-SU* Average_F: 0.02459 (95%-conf.int. 0.02270 - 0.02637) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.40570 (95%-conf.int. 0.38112 - 0.43043) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.41664 (95%-conf.int. 0.40417 - 0.42917) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.41021 (95%-conf.int. 0.40010 - 0.42127) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.12618 (95%-conf.int. 0.09213 - 0.15715) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.12736 (95%-conf.int. 0.10455 - 0.15000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.12651 (95%-conf.int. 0.09786 - 0.15349) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.16003 (95%-conf.int. 0.14626 - 0.17418) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.19219 (95%-conf.int. 0.18182 - 0.20325) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.17385 (95%-conf.int. 0.16666 - 0.18117) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.32531 (95%-conf.int. 0.29662 - 0.34783) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.50000 (95%-conf.int. 0.50000 - 0.50000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.39338 (95%-conf.int. 0.37157 - 0.41026) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.10870 (95%-conf.int. 0.07857 - 0.13810) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.17142 (95%-conf.int. 0.13571 - 0.20715) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.13280 (95%-conf.int. 0.09924 - 0.16572) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.09578 (95%-conf.int. 0.08368 - 0.10631) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.25143 (95%-conf.int. 0.25000 - 0.25428) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.13824 (95%-conf.int. 0.12499 - 0.14969) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.43810 (95%-conf.int. 0.40858 - 0.46956) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.13844 (95%-conf.int. 0.13206 - 0.14359) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.21005 (95%-conf.int. 0.20199 - 0.21726) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.19623 (95%-conf.int. 0.15595 - 0.23334) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.05785 (95%-conf.int. 0.04868 - 0.06447) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08920 (95%-conf.int. 0.07409 - 0.10103) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.19667 (95%-conf.int. 0.17543 - 0.21862) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.02335 (95%-conf.int. 0.02093 - 0.02516) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.04167 (95%-conf.int. 0.03762 - 0.04479) --------------------------------------------- baseline ROUGE-1 Average_R: 0.61513 (95%-conf.int. 0.59134 - 0.63913) baseline ROUGE-1 Average_P: 0.05508 (95%-conf.int. 0.05290 - 0.05833) baseline ROUGE-1 Average_F: 0.10104 (95%-conf.int. 0.09747 - 0.10635) --------------------------------------------- baseline ROUGE-2 Average_R: 0.14075 (95%-conf.int. 0.11892 - 0.16297) baseline ROUGE-2 Average_P: 0.01166 (95%-conf.int. 0.00985 - 0.01350) baseline ROUGE-2 Average_F: 0.02152 (95%-conf.int. 0.01819 - 0.02486) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.36169 (95%-conf.int. 0.34050 - 0.38460) baseline ROUGE-SU* Average_P: 0.00350 (95%-conf.int. 0.00326 - 0.00382) baseline ROUGE-SU* Average_F: 0.00694 (95%-conf.int. 0.00646 - 0.00756) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.31133 (95%-conf.int. 0.27559 - 0.36349) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.39995 (95%-conf.int. 0.39091 - 0.40909) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34838 (95%-conf.int. 0.32430 - 0.38135) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.09301 (95%-conf.int. 0.07349 - 0.12264) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.11984 (95%-conf.int. 0.10500 - 0.13500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10405 (95%-conf.int. 0.08535 - 0.12782) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08817 (95%-conf.int. 0.06633 - 0.12628) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.16611 (95%-conf.int. 0.15308 - 0.17769) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11176 (95%-conf.int. 0.09247 - 0.14024) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.24045 (95%-conf.int. 0.21701 - 0.27960) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.42489 (95%-conf.int. 0.41250 - 0.43750) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.30564 (95%-conf.int. 0.28572 - 0.33721) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.06226 (95%-conf.int. 0.04469 - 0.08211) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.11415 (95%-conf.int. 0.09286 - 0.13572) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.08009 (95%-conf.int. 0.05991 - 0.10022) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05373 (95%-conf.int. 0.04119 - 0.07621) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.18853 (95%-conf.int. 0.18000 - 0.19572) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.08164 (95%-conf.int. 0.06741 - 0.10630) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.42575 (95%-conf.int. 0.37891 - 0.51043) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.18740 (95%-conf.int. 0.17813 - 0.19687) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.25912 (95%-conf.int. 0.24324 - 0.28193) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.09330 (95%-conf.int. 0.07184 - 0.12198) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.03867 (95%-conf.int. 0.03387 - 0.04355) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.05439 (95%-conf.int. 0.04603 - 0.06385) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.17103 (95%-conf.int. 0.12525 - 0.25668) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.03905 (95%-conf.int. 0.03510 - 0.04250) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.06241 (95%-conf.int. 0.05492 - 0.07152) --------------------------------------------- baseline ROUGE-1 Average_R: 0.37623 (95%-conf.int. 0.35717 - 0.38911) baseline ROUGE-1 Average_P: 0.07614 (95%-conf.int. 0.06761 - 0.08169) baseline ROUGE-1 Average_F: 0.12636 (95%-conf.int. 0.11408 - 0.13479) --------------------------------------------- baseline ROUGE-2 Average_R: 0.09216 (95%-conf.int. 0.06810 - 0.11274) baseline ROUGE-2 Average_P: 0.01714 (95%-conf.int. 0.01357 - 0.02000) baseline ROUGE-2 Average_F: 0.02883 (95%-conf.int. 0.02274 - 0.03355) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.10685 (95%-conf.int. 0.09782 - 0.11789) baseline ROUGE-SU* Average_P: 0.00557 (95%-conf.int. 0.00446 - 0.00622) baseline ROUGE-SU* Average_F: 0.01055 (95%-conf.int. 0.00853 - 0.01175) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.26250 (95%-conf.int. 0.24646 - 0.27638) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.40024 (95%-conf.int. 0.36875 - 0.43125) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.31666 (95%-conf.int. 0.29661 - 0.33658) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.14267 (95%-conf.int. 0.13175 - 0.15217) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.22882 (95%-conf.int. 0.20000 - 0.25000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.17551 (95%-conf.int. 0.16033 - 0.18919) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08453 (95%-conf.int. 0.07589 - 0.09030) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.20597 (95%-conf.int. 0.17428 - 0.23285) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11917 (95%-conf.int. 0.10795 - 0.12962) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.26250 (95%-conf.int. 0.24646 - 0.27638) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.40024 (95%-conf.int. 0.36875 - 0.43125) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.31666 (95%-conf.int. 0.29661 - 0.33658) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.14267 (95%-conf.int. 0.13175 - 0.15217) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.22882 (95%-conf.int. 0.20000 - 0.25000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.17551 (95%-conf.int. 0.16033 - 0.18919) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08453 (95%-conf.int. 0.07589 - 0.09030) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.20597 (95%-conf.int. 0.17428 - 0.23285) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.11917 (95%-conf.int. 0.10795 - 0.12962) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.31157 (95%-conf.int. 0.29708 - 0.32800) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.27160 (95%-conf.int. 0.25000 - 0.28929) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.28984 (95%-conf.int. 0.27384 - 0.30838) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.14343 (95%-conf.int. 0.13158 - 0.15446) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.12309 (95%-conf.int. 0.11538 - 0.13077) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.13227 (95%-conf.int. 0.12364 - 0.14095) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.10764 (95%-conf.int. 0.10180 - 0.11350) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.08859 (95%-conf.int. 0.07452 - 0.09856) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.09657 (95%-conf.int. 0.08717 - 0.10424) --------------------------------------------- baseline ROUGE-1 Average_R: 0.50625 (95%-conf.int. 0.46130 - 0.54000) baseline ROUGE-1 Average_P: 0.05970 (95%-conf.int. 0.05192 - 0.06490) baseline ROUGE-1 Average_F: 0.10675 (95%-conf.int. 0.09351 - 0.11587) --------------------------------------------- baseline ROUGE-2 Average_R: 0.17727 (95%-conf.int. 0.15263 - 0.19565) baseline ROUGE-2 Average_P: 0.01945 (95%-conf.int. 0.01602 - 0.02184) baseline ROUGE-2 Average_F: 0.03504 (95%-conf.int. 0.02905 - 0.03929) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.31296 (95%-conf.int. 0.26296 - 0.35096) baseline ROUGE-SU* Average_P: 0.00503 (95%-conf.int. 0.00367 - 0.00584) baseline ROUGE-SU* Average_F: 0.00991 (95%-conf.int. 0.00724 - 0.01148) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.12846 (95%-conf.int. 0.11475 - 0.13725) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.33322 (95%-conf.int. 0.22222 - 0.38889) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.18225 (95%-conf.int. 0.16666 - 0.20289) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.01093 (95%-conf.int. 0.00935 - 0.01204) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.09086 (95%-conf.int. 0.03409 - 0.12500) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.01878 (95%-conf.int. 0.01710 - 0.02196) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.05255 (95%-conf.int. 0.03333 - 0.06557) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.12112 (95%-conf.int. 0.04545 - 0.18182) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.07230 (95%-conf.int. 0.03846 - 0.09638) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.00429 (95%-conf.int. 0.00380 - 0.00468) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.02562 (95%-conf.int. 0.00769 - 0.03846) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.00703 (95%-conf.int. 0.00509 - 0.00834) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.05255 (95%-conf.int. 0.03333 - 0.06557) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.12112 (95%-conf.int. 0.04545 - 0.18182) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.07230 (95%-conf.int. 0.03846 - 0.09638) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.00429 (95%-conf.int. 0.00380 - 0.00468) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.02562 (95%-conf.int. 0.00769 - 0.03846) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.00703 (95%-conf.int. 0.00509 - 0.00834) --------------------------------------------- baseline ROUGE-1 Average_R: 0.47570 (95%-conf.int. 0.44262 - 0.53333) baseline ROUGE-1 Average_P: 0.16917 (95%-conf.int. 0.12308 - 0.20769) baseline ROUGE-1 Average_F: 0.24556 (95%-conf.int. 0.20000 - 0.28272) --------------------------------------------- baseline ROUGE-2 Average_R: 0.07875 (95%-conf.int. 0.06122 - 0.10714) baseline ROUGE-2 Average_P: 0.02604 (95%-conf.int. 0.02344 - 0.03125) baseline ROUGE-2 Average_F: 0.03838 (95%-conf.int. 0.03390 - 0.04278) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.19509 (95%-conf.int. 0.16838 - 0.24715) baseline ROUGE-SU* Average_P: 0.03108 (95%-conf.int. 0.01516 - 0.04198) baseline ROUGE-SU* Average_F: 0.05176 (95%-conf.int. 0.02857 - 0.06720) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.24122 (95%-conf.int. 0.20628 - 0.26866) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.39973 (95%-conf.int. 0.34000 - 0.45000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.30075 (95%-conf.int. 0.25364 - 0.33645) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.11523 (95%-conf.int. 0.08248 - 0.14286) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.19974 (95%-conf.int. 0.14444 - 0.25000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.14609 (95%-conf.int. 0.10493 - 0.18182) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.08346 (95%-conf.int. 0.06596 - 0.09641) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.22939 (95%-conf.int. 0.17593 - 0.27315) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.12220 (95%-conf.int. 0.09669 - 0.14252) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.24122 (95%-conf.int. 0.20628 - 0.26866) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.39973 (95%-conf.int. 0.34000 - 0.45000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.30075 (95%-conf.int. 0.25364 - 0.33645) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.11523 (95%-conf.int. 0.08248 - 0.14286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.19974 (95%-conf.int. 0.14444 - 0.25000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.14609 (95%-conf.int. 0.10493 - 0.18182) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.08346 (95%-conf.int. 0.06596 - 0.09641) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.22939 (95%-conf.int. 0.17593 - 0.27315) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.12220 (95%-conf.int. 0.09669 - 0.14252) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.24122 (95%-conf.int. 0.20628 - 0.26866) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.26649 (95%-conf.int. 0.22667 - 0.30000) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.25312 (95%-conf.int. 0.21375 - 0.28347) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07620 (95%-conf.int. 0.05222 - 0.09524) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.08562 (95%-conf.int. 0.05714 - 0.10714) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.08061 (95%-conf.int. 0.05452 - 0.10084) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.09038 (95%-conf.int. 0.06923 - 0.10458) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.11248 (95%-conf.int. 0.08404 - 0.13445) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.10003 (95%-conf.int. 0.07803 - 0.11765) --------------------------------------------- baseline ROUGE-1 Average_R: 0.39759 (95%-conf.int. 0.39071 - 0.40299) baseline ROUGE-1 Average_P: 0.08919 (95%-conf.int. 0.08514 - 0.09122) baseline ROUGE-1 Average_F: 0.14565 (95%-conf.int. 0.13993 - 0.14877) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06366 (95%-conf.int. 0.04602 - 0.07937) baseline ROUGE-2 Average_P: 0.01368 (95%-conf.int. 0.00959 - 0.01712) baseline ROUGE-2 Average_F: 0.02251 (95%-conf.int. 0.01586 - 0.02816) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.14591 (95%-conf.int. 0.13813 - 0.15196) baseline ROUGE-SU* Average_P: 0.00786 (95%-conf.int. 0.00703 - 0.00838) baseline ROUGE-SU* Average_F: 0.01490 (95%-conf.int. 0.01338 - 0.01588) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.26449 (95%-conf.int. 0.23932 - 0.29625) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.48899 (95%-conf.int. 0.46111 - 0.51667) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.34253 (95%-conf.int. 0.31516 - 0.37356) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07768 (95%-conf.int. 0.05980 - 0.09504) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.15002 (95%-conf.int. 0.11875 - 0.17500) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.10211 (95%-conf.int. 0.07996 - 0.12291) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.07435 (95%-conf.int. 0.06117 - 0.08947) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.28189 (95%-conf.int. 0.25000 - 0.30909) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.11721 (95%-conf.int. 0.09886 - 0.13825) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.25226 (95%-conf.int. 0.23110 - 0.28000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.60016 (95%-conf.int. 0.57143 - 0.62857) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.35449 (95%-conf.int. 0.33177 - 0.38459) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07768 (95%-conf.int. 0.05980 - 0.09504) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.20003 (95%-conf.int. 0.15833 - 0.23333) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.11166 (95%-conf.int. 0.08731 - 0.13480) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.06333 (95%-conf.int. 0.05492 - 0.07575) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.39282 (95%-conf.int. 0.36111 - 0.42408) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.10876 (95%-conf.int. 0.09534 - 0.12769) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.28766 (95%-conf.int. 0.26950 - 0.31250) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.40011 (95%-conf.int. 0.37917 - 0.42083) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.33389 (95%-conf.int. 0.32069 - 0.35317) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07768 (95%-conf.int. 0.05980 - 0.09504) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.10911 (95%-conf.int. 0.08636 - 0.12727) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.09051 (95%-conf.int. 0.07098 - 0.10856) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.08208 (95%-conf.int. 0.07285 - 0.09577) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.17932 (95%-conf.int. 0.16364 - 0.19481) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.11202 (95%-conf.int. 0.10249 - 0.12608) --------------------------------------------- baseline ROUGE-1 Average_R: 0.53678 (95%-conf.int. 0.51826 - 0.56290) baseline ROUGE-1 Average_P: 0.08413 (95%-conf.int. 0.07804 - 0.09159) baseline ROUGE-1 Average_F: 0.14529 (95%-conf.int. 0.13622 - 0.15614) --------------------------------------------- baseline ROUGE-2 Average_R: 0.11597 (95%-conf.int. 0.09147 - 0.14428) baseline ROUGE-2 Average_P: 0.01700 (95%-conf.int. 0.01368 - 0.01981) baseline ROUGE-2 Average_F: 0.02962 (95%-conf.int. 0.02378 - 0.03484) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.27384 (95%-conf.int. 0.25667 - 0.30092) baseline ROUGE-SU* Average_P: 0.00807 (95%-conf.int. 0.00712 - 0.00920) baseline ROUGE-SU* Average_F: 0.01566 (95%-conf.int. 0.01388 - 0.01777) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.42382 (95%-conf.int. 0.38169 - 0.48148) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.34290 (95%-conf.int. 0.31071 - 0.37143) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.37860 (95%-conf.int. 0.34540 - 0.41765) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.23304 (95%-conf.int. 0.19192 - 0.28165) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.18468 (95%-conf.int. 0.15770 - 0.21154) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.20573 (95%-conf.int. 0.17290 - 0.23888) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.21278 (95%-conf.int. 0.17485 - 0.26460) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.15000 (95%-conf.int. 0.12356 - 0.16971) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.17509 (95%-conf.int. 0.14376 - 0.20522) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.40594 (95%-conf.int. 0.36439 - 0.46017) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.51107 (95%-conf.int. 0.46111 - 0.55555) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.45190 (95%-conf.int. 0.41091 - 0.50256) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.25225 (95%-conf.int. 0.20122 - 0.30575) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.32489 (95%-conf.int. 0.26875 - 0.38125) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.28356 (95%-conf.int. 0.22985 - 0.33739) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.20161 (95%-conf.int. 0.16431 - 0.25225) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.33628 (95%-conf.int. 0.27500 - 0.38182) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.25087 (95%-conf.int. 0.20547 - 0.30204) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.44182 (95%-conf.int. 0.39525 - 0.50195) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.26317 (95%-conf.int. 0.23947 - 0.28684) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.32945 (95%-conf.int. 0.29996 - 0.36215) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.17473 (95%-conf.int. 0.14067 - 0.20924) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.09994 (95%-conf.int. 0.08055 - 0.11667) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.12697 (95%-conf.int. 0.10454 - 0.14953) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.23252 (95%-conf.int. 0.19012 - 0.28903) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.08994 (95%-conf.int. 0.07592 - 0.10186) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.12918 (95%-conf.int. 0.10937 - 0.14958) --------------------------------------------- baseline ROUGE-1 Average_R: 0.58004 (95%-conf.int. 0.54101 - 0.62202) baseline ROUGE-1 Average_P: 0.08353 (95%-conf.int. 0.07658 - 0.08798) baseline ROUGE-1 Average_F: 0.14595 (95%-conf.int. 0.13404 - 0.15316) --------------------------------------------- baseline ROUGE-2 Average_R: 0.21225 (95%-conf.int. 0.17795 - 0.24588) baseline ROUGE-2 Average_P: 0.02819 (95%-conf.int. 0.02372 - 0.03141) baseline ROUGE-2 Average_F: 0.04974 (95%-conf.int. 0.04184 - 0.05560) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.33741 (95%-conf.int. 0.27785 - 0.40135) baseline ROUGE-SU* Average_P: 0.00791 (95%-conf.int. 0.00647 - 0.00873) baseline ROUGE-SU* Average_F: 0.01545 (95%-conf.int. 0.01265 - 0.01704) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.32207 (95%-conf.int. 0.30492 - 0.33740) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.49989 (95%-conf.int. 0.48148 - 0.51852) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.39150 (95%-conf.int. 0.37323 - 0.40731) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.15519 (95%-conf.int. 0.13093 - 0.18241) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.25002 (95%-conf.int. 0.21875 - 0.28125) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.19136 (95%-conf.int. 0.16376 - 0.22115) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.10694 (95%-conf.int. 0.09541 - 0.11763) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.25561 (95%-conf.int. 0.24243 - 0.26894) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.15047 (95%-conf.int. 0.13683 - 0.16139) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.21522 (95%-conf.int. 0.19318 - 0.23734) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.37489 (95%-conf.int. 0.34375 - 0.40625) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27329 (95%-conf.int. 0.24790 - 0.29695) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.11683 (95%-conf.int. 0.08536 - 0.14839) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.21417 (95%-conf.int. 0.16667 - 0.26190) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.15108 (95%-conf.int. 0.11290 - 0.18941) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.05733 (95%-conf.int. 0.04741 - 0.06728) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.17135 (95%-conf.int. 0.15000 - 0.19286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.08575 (95%-conf.int. 0.07284 - 0.09871) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.35842 (95%-conf.int. 0.31133 - 0.41449) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.45428 (95%-conf.int. 0.40909 - 0.51515) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.40045 (95%-conf.int. 0.35065 - 0.46208) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.13642 (95%-conf.int. 0.09654 - 0.17544) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.17488 (95%-conf.int. 0.12500 - 0.21667) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.15317 (95%-conf.int. 0.10884 - 0.19385) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.14558 (95%-conf.int. 0.11265 - 0.17664) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.23440 (95%-conf.int. 0.19231 - 0.27821) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.17920 (95%-conf.int. 0.13986 - 0.21785) --------------------------------------------- baseline ROUGE-1 Average_R: 0.41076 (95%-conf.int. 0.40909 - 0.41325) baseline ROUGE-1 Average_P: 0.07985 (95%-conf.int. 0.07638 - 0.08333) baseline ROUGE-1 Average_F: 0.13366 (95%-conf.int. 0.12873 - 0.13846) --------------------------------------------- baseline ROUGE-2 Average_R: 0.09628 (95%-conf.int. 0.07606 - 0.11653) baseline ROUGE-2 Average_P: 0.01760 (95%-conf.int. 0.01408 - 0.02113) baseline ROUGE-2 Average_F: 0.02974 (95%-conf.int. 0.02376 - 0.03574) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.17201 (95%-conf.int. 0.16804 - 0.17719) baseline ROUGE-SU* Average_P: 0.00695 (95%-conf.int. 0.00631 - 0.00742) baseline ROUGE-SU* Average_F: 0.01335 (95%-conf.int. 0.01217 - 0.01422) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.08942 (95%-conf.int. 0.07691 - 0.10065) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.22479 (95%-conf.int. 0.18750 - 0.25625) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.12783 (95%-conf.int. 0.10896 - 0.14448) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.02083 (95%-conf.int. 0.01596 - 0.02572) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.05711 (95%-conf.int. 0.04285 - 0.07143) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.03050 (95%-conf.int. 0.02323 - 0.03780) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.01069 (95%-conf.int. 0.00924 - 0.01184) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.06851 (95%-conf.int. 0.05429 - 0.08000) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.01847 (95%-conf.int. 0.01576 - 0.02053) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.05967 (95%-conf.int. 0.04802 - 0.07143) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.17124 (95%-conf.int. 0.13571 - 0.20715) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.08844 (95%-conf.int. 0.06991 - 0.10581) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.01036 (95%-conf.int. 0.00512 - 0.01346) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.03333 (95%-conf.int. 0.01667 - 0.04167) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.01579 (95%-conf.int. 0.00783 - 0.02034) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.00889 (95%-conf.int. 0.00687 - 0.01089) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.07400 (95%-conf.int. 0.05370 - 0.09074) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.01587 (95%-conf.int. 0.01217 - 0.01935) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.05967 (95%-conf.int. 0.04802 - 0.07143) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.17124 (95%-conf.int. 0.13571 - 0.20715) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.08844 (95%-conf.int. 0.06991 - 0.10581) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.01036 (95%-conf.int. 0.00512 - 0.01346) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.03333 (95%-conf.int. 0.01667 - 0.04167) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.01579 (95%-conf.int. 0.00783 - 0.02034) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.00889 (95%-conf.int. 0.00687 - 0.01089) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.07400 (95%-conf.int. 0.05370 - 0.09074) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.01587 (95%-conf.int. 0.01217 - 0.01935) --------------------------------------------- baseline ROUGE-1 Average_R: 0.43970 (95%-conf.int. 0.42052 - 0.45412) baseline ROUGE-1 Average_P: 0.06026 (95%-conf.int. 0.05582 - 0.06404) baseline ROUGE-1 Average_F: 0.10595 (95%-conf.int. 0.09851 - 0.11206) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06279 (95%-conf.int. 0.05096 - 0.07456) baseline ROUGE-2 Average_P: 0.00828 (95%-conf.int. 0.00655 - 0.01000) baseline ROUGE-2 Average_F: 0.01462 (95%-conf.int. 0.01160 - 0.01760) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.18185 (95%-conf.int. 0.15895 - 0.20014) baseline ROUGE-SU* Average_P: 0.00378 (95%-conf.int. 0.00312 - 0.00424) baseline ROUGE-SU* Average_F: 0.00741 (95%-conf.int. 0.00611 - 0.00828) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.33428 (95%-conf.int. 0.32121 - 0.35135) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.51399 (95%-conf.int. 0.50000 - 0.54286) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.40452 (95%-conf.int. 0.39111 - 0.41812) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.10341 (95%-conf.int. 0.08555 - 0.12642) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.16661 (95%-conf.int. 0.14167 - 0.19167) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.12737 (95%-conf.int. 0.10660 - 0.15198) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.11676 (95%-conf.int. 0.10597 - 0.13421) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.28864 (95%-conf.int. 0.27408 - 0.31111) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.16541 (95%-conf.int. 0.15310 - 0.18295) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.33428 (95%-conf.int. 0.32121 - 0.35135) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.51399 (95%-conf.int. 0.50000 - 0.54286) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.40452 (95%-conf.int. 0.39111 - 0.41812) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.14478 (95%-conf.int. 0.12632 - 0.17580) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.23321 (95%-conf.int. 0.20833 - 0.26667) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.17830 (95%-conf.int. 0.15726 - 0.21137) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.11676 (95%-conf.int. 0.10597 - 0.13421) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.28864 (95%-conf.int. 0.27408 - 0.31111) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.16541 (95%-conf.int. 0.15310 - 0.18295) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.33428 (95%-conf.int. 0.32121 - 0.35135) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.51399 (95%-conf.int. 0.50000 - 0.54286) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.40452 (95%-conf.int. 0.39111 - 0.41812) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.14478 (95%-conf.int. 0.12632 - 0.17580) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.23321 (95%-conf.int. 0.20833 - 0.26667) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.17830 (95%-conf.int. 0.15726 - 0.21137) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.11676 (95%-conf.int. 0.10597 - 0.13421) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.28864 (95%-conf.int. 0.27408 - 0.31111) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.16541 (95%-conf.int. 0.15310 - 0.18295) --------------------------------------------- baseline ROUGE-1 Average_R: 0.57373 (95%-conf.int. 0.55215 - 0.59555) baseline ROUGE-1 Average_P: 0.06734 (95%-conf.int. 0.06250 - 0.07391) baseline ROUGE-1 Average_F: 0.12046 (95%-conf.int. 0.11271 - 0.13134) --------------------------------------------- baseline ROUGE-2 Average_R: 0.10341 (95%-conf.int. 0.08555 - 0.12642) baseline ROUGE-2 Average_P: 0.01099 (95%-conf.int. 0.00934 - 0.01264) baseline ROUGE-2 Average_F: 0.01985 (95%-conf.int. 0.01683 - 0.02296) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.32462 (95%-conf.int. 0.30743 - 0.34186) baseline ROUGE-SU* Average_P: 0.00514 (95%-conf.int. 0.00452 - 0.00582) baseline ROUGE-SU* Average_F: 0.01011 (95%-conf.int. 0.00892 - 0.01143) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-1 Average_R: 0.18955 (95%-conf.int. 0.16514 - 0.21373) G5S2R2_NOCOLLAPSE ROUGE-1 Average_P: 0.66705 (95%-conf.int. 0.62500 - 0.70000) G5S2R2_NOCOLLAPSE ROUGE-1 Average_F: 0.29475 (95%-conf.int. 0.26196 - 0.32716) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-2 Average_R: 0.07983 (95%-conf.int. 0.06599 - 0.09351) G5S2R2_NOCOLLAPSE ROUGE-2 Average_P: 0.32032 (95%-conf.int. 0.28000 - 0.35000) G5S2R2_NOCOLLAPSE ROUGE-2 Average_F: 0.12760 (95%-conf.int. 0.10752 - 0.14742) --------------------------------------------- G5S2R2_NOCOLLAPSE ROUGE-SU* Average_R: 0.03571 (95%-conf.int. 0.02755 - 0.04378) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_P: 0.46054 (95%-conf.int. 0.40250 - 0.49750) G5S2R2_NOCOLLAPSE ROUGE-SU* Average_F: 0.06613 (95%-conf.int. 0.05172 - 0.08037) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_R: 0.17992 (95%-conf.int. 0.15866 - 0.20103) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_P: 0.63354 (95%-conf.int. 0.60833 - 0.65834) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-1 Average_F: 0.27981 (95%-conf.int. 0.25168 - 0.30772) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_R: 0.07979 (95%-conf.int. 0.06490 - 0.09351) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_P: 0.32001 (95%-conf.int. 0.28000 - 0.35000) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-2 Average_F: 0.12752 (95%-conf.int. 0.10451 - 0.14742) --------------------------------------------- G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_R: 0.03093 (95%-conf.int. 0.02463 - 0.03719) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_P: 0.40022 (95%-conf.int. 0.37250 - 0.42250) G5S2R2_NOCOLLAPSE_NODUPELIM ROUGE-SU* Average_F: 0.05729 (95%-conf.int. 0.04623 - 0.06827) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-1 Average_R: 0.17992 (95%-conf.int. 0.15866 - 0.20103) G5S2R2_NODUPELIM ROUGE-1 Average_P: 0.63354 (95%-conf.int. 0.60833 - 0.65834) G5S2R2_NODUPELIM ROUGE-1 Average_F: 0.27981 (95%-conf.int. 0.25168 - 0.30772) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-2 Average_R: 0.07979 (95%-conf.int. 0.06490 - 0.09351) G5S2R2_NODUPELIM ROUGE-2 Average_P: 0.32001 (95%-conf.int. 0.28000 - 0.35000) G5S2R2_NODUPELIM ROUGE-2 Average_F: 0.12752 (95%-conf.int. 0.10451 - 0.14742) --------------------------------------------- G5S2R2_NODUPELIM ROUGE-SU* Average_R: 0.03093 (95%-conf.int. 0.02463 - 0.03719) G5S2R2_NODUPELIM ROUGE-SU* Average_P: 0.40022 (95%-conf.int. 0.37250 - 0.42250) G5S2R2_NODUPELIM ROUGE-SU* Average_F: 0.05729 (95%-conf.int. 0.04623 - 0.06827) --------------------------------------------- baseline ROUGE-1 Average_R: 0.44941 (95%-conf.int. 0.43993 - 0.45853) baseline ROUGE-1 Average_P: 0.10103 (95%-conf.int. 0.09474 - 0.10736) baseline ROUGE-1 Average_F: 0.16477 (95%-conf.int. 0.15671 - 0.17288) --------------------------------------------- baseline ROUGE-2 Average_R: 0.06966 (95%-conf.int. 0.05455 - 0.08046) baseline ROUGE-2 Average_P: 0.01489 (95%-conf.int. 0.01277 - 0.01596) baseline ROUGE-2 Average_F: 0.02449 (95%-conf.int. 0.02069 - 0.02662) --------------------------------------------- baseline ROUGE-SU* Average_R: 0.20240 (95%-conf.int. 0.19513 - 0.20949) baseline ROUGE-SU* Average_P: 0.01189 (95%-conf.int. 0.01052 - 0.01326) baseline ROUGE-SU* Average_F: 0.02242 (95%-conf.int. 0.02001 - 0.02485)