https://hdl.handle.net/21.11129/0000-000D-FE97-B (persistent URL to this page)
The corpus contains 1,030 online communication messages, randomly selected from Network News Transfer Protocol (NNTP) newsgroups, the bug tracking system Bugzilla and the bug tracking system GitHub. NNTP articles, Bugzilla and GitHub comments were selected randomly so that the sample exhibits similar characteristics to the population as a whole. Each message was annotated manually as a request or a non-request. The corpus was created as part of the work presented in the current paper and it is described in section 3. We intend to make the corpus available freely.