Browsed by
Tag: Dataset

Micropinion Generation Dataset

Micropinion Generation Dataset

This dataset is contains 330 user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc. You will find two versions of the dataset :- “raw” and “pre-processed”.  The `raw` folder has the original reviews from CNET (full review text, pros and cons) without any pre-processing. The `pre-processed` folder contains sentences from the full review section of the reviews. All the pros and cons from the original reviews are omitted in this version.

Opinosis Dataset – Topic related review sentences

Opinosis Dataset – Topic related review sentences

This dataset contains sentences extracted from user reviews on a given topic. Example topics are “performance of Toyota Camry” and “sound quality of ipod nano”, etc. In total there are 51 such topics  with each topic having approximately 100 sentences (on average). The reviews were obtained from various sources – Tripadvisor (hotels), Edmunds.com (cars) and Amazon.com (various electronics).  This dataset was used for the following automatic text summarization project .