Country Sites Products & Services Careers Reuters.com

You are here >

HOME > PUBLICATIONS


What is available

Naming and versioning scheme

How to apply

Publications

Statistics
Invisible Placement Image
Publications

Please note that Section 3.3 of the Agreement requires all corpus users to provide to Reuters a copy of any paper published using data from the Corpus. Additions and updates to this page should be sent to: research.corpus@reuters.com.

Publications received so far include:

  1. J. Mayfield, P. McNamee, C. Costello, C. Piatko and A. Banerjee,
    "JHU/APL at TREC 2001: Experiments in Filtering and in Arabic, Video, and Web Retrieval".
    TE. M. Voorhees and D. K. Harman (eds.), Proceedings of the Tenth Text Rtrieval Conference (TREC 2001), NIST Special Publication, 2002.
  2. T.G. Rose, M. Stevenson and M. Whitehead,
    "The Reuters Corpus Volume 1 - from Yesterday's News to Tomorrow's Language Resources" [245k PDF].
    In Proceedings of the Third International Conference on Language Resources and Evaluation, Las Palmas de Gran Canaria, 29-31 May 2002.
  3. M. Weeks, Victoria J. Hodge, and Jim Austin,
    "A Hardware Accelerated Novel IR System."
    In Proceedings of PDP-2002, 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing, Las Palmas de Gran Canaria, Canary Islands. 9-11 January 2002.
  4. Y. Zhang, and J. Callan,
    "The Bias Problem in Language Models and Adaptive Filtering".
    E. M. Voorhees and D. K. Harman (eds.), Proceedings of the Tenth Text Retrieval Conference (TREC 2001), NIST Special Publication, 2002.
  5. Wermter S., and Hung C.
    "Selforganizing Classification on the New Reuters News Corpus".
    Proceedings of the International Conference on Computational Linguistics, Taipeh, Taiwan, August 2002.
  6. David D. Lewis
    "Applying Support Vector Machines to the {TREC-2001} Batch Filtering and Routing Tasks" in E. M. Voorhees and D. K. Harman (eds.), Proceedings of the Tenth Text REtrieval Conference (TREC 2001), NIST Special Publication, 2002.
  7. Avi Arampatzis.
    "Unbiased S-D Threshold Optimization, Initial Query Degradation, Decay, and Incrementality, for Adaptive Document Filtering." in E. M. Voorhees and D. K. Harman (eds.), Proceedings of the Tenth Text REtrieval Conference (TREC 2001), NIST Special Publication, 2002.
  8. James Curran and Miles Osborne.
    "A very very large corpus doesn't always yield reliable estimates" Joint CoNLL02 - Workshop on Very Large Corpora, Taipei, Taiwan. 2002.
  9. James Curran and Marc Moens.
    "Scaling Context Space" Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, University of Philadelphia, Pennsylvania. 2002.
  10. Clark P, Harrison P, Thompson J.
    "A Knowledge-Driven Approach to Text Meaning Processing", in Proceedings HLT Workshop on Text Meaning, May 31st 2003, Association of Computational Linguistics.
  11. Erik F. Tjong Kim Sang and Fien De Meulder.
    "Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition" in Proceedings of CoNLL-2003, Edmonton, Canada, 2003, pp. 142-147.
  12. Khmelev D., Teahan W.
    "A repetition based measure for verification of text collections and for text categorization." SIGIR'2003, July 28-August 1, 2003, Toronto, Canada.
  13. Stephen Robertson and Ian Soboroff
    "The TREC 2002 Filtering Track Report." In Proceedings of the Eleventh Text Retrieval Conference (TREC 2002), Gaithersburg, MD, November 2002.
  14. Ian Soboroff and Stephen Robertson
    "Building a Filtering Collection for TREC 2002." In Proceedings of the Twenty-Sixth ACM Conference on Research and Development in Information Retrieval (SIGIR 2003), Toronto, Ontario, July 2003.
  15. Oren Glickman and Ido Dagan,
    "Identifying Lexical Paraphrases From a Single Corpus: A Case Study for Verbs." Proceedings of Recent Advantages in Natural Language Processing (RANLP '03), September 10-12, 2003.
  16. Filip Ginter, Jorma Boberg, Journi Järvinen and Tapio Salakoski,
    "New Techniques for Disambiguation in Natural Language and Their Application to Biological Text." Journal of Machine Learning Research,5:605-621, 2004.