MOPRD: A Multidisciplinary Open Peer Review Dataset

Open peer review is a growing trend in academic publications. Public access to peer review data can benefit both the academic and publishing communities. It also serves as a great support to studies on review comment generation and further to the realization of automated scholarly paper review. However, most of the existing peer review datasets do not provide data that cover the whole peer review process. Apart from this, their data are not diversified enough as the data are mainly collected from the field of computer science. These two drawbacks of the currently available peer review datasets need to be addressed to unlock more opportunities for related studies. In response, we construct MOPRD, a multidisciplinary open peer review dataset. This dataset consists of paper metadata, multiple version manuscripts, review comments, meta-reviews, author's rebuttal letters, and editorial decisions. Moreover, we propose a modular guided review comment generation method based on MOPRD. Experiments show that our method delivers better performance as indicated by both automatic metrics and human evaluation. We also explore other potential applications of MOPRD, including meta-review generation, editorial decision prediction, author rebuttal generation, and scientometric analysis. MOPRD is a strong endorsement for further studies in peer review-related research and other applications.

Disciplines of papers of MOPRD. One paper may fall into more than one discipline.


Download from Baidu Pan

If our resources are used, please cite:


@article{lin-moprd-2023, author = {Lin, Jialiang and Song, Jiaxin and Zhou, Zhangping and Chen, Yidong and Shi, Xiaodong}, title = {{MOPRD: A multidisciplinary open peer review dataset}}, journal = {Neural Computing and Applications}, volume = {35}, number = {34}, pages = {24191-24206}, DOI = {10.1007/s00521-023-08891-5}, year = {2023}, type = {Journal Article} }

If you have any questions or comments, please free feel to send emails to the first author Jialiang Lin through me at linjialiang dot net .

Copyright (c) 2022-2024, Natural Language Processing Lab at Xiamen University
All rights reserved.