
What is available

Naming and versioning scheme

How to apply

 |
Publications |

Statistics
|
 |
Please note that Section 3.3 of the Agreement
requires all corpus users to provide to Reuters a copy of any paper published
using data from the Corpus. Additions and updates to this page should be
sent to: research.corpus@reuters.com.
Publications received so far include:
- J. Mayfield, P. McNamee, C. Costello, C. Piatko and A. Banerjee,
"JHU/APL at TREC 2001: Experiments in Filtering and in Arabic,
Video, and Web Retrieval".
TE. M. Voorhees and D. K. Harman (eds.), Proceedings of
the Tenth Text Rtrieval Conference (TREC 2001), NIST Special Publication,
2002.
- T.G. Rose, M. Stevenson and M. Whitehead,
"The
Reuters Corpus Volume 1 - from Yesterday's News to Tomorrow's Language
Resources" [245k PDF].
In Proceedings of the Third International Conference on Language Resources
and Evaluation, Las Palmas de Gran Canaria, 29-31 May 2002.
- M. Weeks, Victoria J. Hodge, and Jim Austin,
"A Hardware Accelerated Novel IR System."
In Proceedings of PDP-2002, 10th Euromicro Workshop on Parallel, Distributed
and Network-based Processing, Las Palmas de Gran Canaria, Canary Islands.
9-11 January 2002.
- Y. Zhang, and J. Callan,
"The Bias Problem in Language Models and Adaptive Filtering".
E. M. Voorhees and D. K. Harman (eds.), Proceedings of
the Tenth Text Retrieval Conference (TREC 2001), NIST Special Publication,
2002.
- Wermter S., and Hung C.
"Selforganizing Classification on the New Reuters News
Corpus".
Proceedings of the International Conference on Computational
Linguistics, Taipeh, Taiwan, August 2002.
- David D. Lewis
"Applying Support Vector Machines to the {TREC-2001} Batch
Filtering and Routing Tasks" in
E. M. Voorhees and D. K. Harman (eds.), Proceedings of the
Tenth Text REtrieval Conference (TREC 2001), NIST Special Publication, 2002.
- Avi Arampatzis.
"Unbiased S-D Threshold Optimization, Initial Query Degradation,
Decay, and Incrementality, for Adaptive Document Filtering." in
E. M. Voorhees and D. K. Harman (eds.), Proceedings of the Tenth
Text REtrieval Conference (TREC 2001), NIST Special Publication, 2002.
- James Curran and Miles Osborne.
"A very very large corpus doesn't always yield reliable
estimates" Joint CoNLL02 - Workshop on Very Large Corpora, Taipei, Taiwan. 2002.
- James Curran and Marc Moens.
"Scaling Context Space" Proceedings of the 40th Annual Meeting of
the Association for Computational Linguistics,
University of Philadelphia, Pennsylvania. 2002.
- Clark P, Harrison P, Thompson J.
"A Knowledge-Driven Approach to Text Meaning Processing", in Proceedings
HLT Workshop on Text Meaning, May 31st 2003, Association of Computational Linguistics.
- Erik F. Tjong Kim Sang and Fien De Meulder.
"Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity
Recognition" in Proceedings of CoNLL-2003, Edmonton, Canada, 2003, pp. 142-147.
- Khmelev D., Teahan W.
"A repetition based measure for verification of text collections and
for text categorization." SIGIR'2003, July 28-August 1, 2003, Toronto, Canada.
- Stephen Robertson and Ian Soboroff
"The TREC 2002 Filtering Track Report." In Proceedings of the Eleventh
Text Retrieval Conference (TREC 2002),
Gaithersburg, MD, November 2002.
- Ian Soboroff and Stephen Robertson
"Building a Filtering Collection for TREC 2002." In Proceedings
of the Twenty-Sixth ACM Conference on Research
and Development in Information Retrieval (SIGIR 2003), Toronto, Ontario, July 2003.
- Oren Glickman and Ido Dagan,
"Identifying Lexical Paraphrases From a Single Corpus: A Case Study for Verbs."
Proceedings of Recent Advantages in Natural Language Processing (RANLP '03), September 10-12, 2003.
- Filip Ginter, Jorma Boberg, Journi Järvinen and Tapio Salakoski,
"New Techniques for Disambiguation in Natural Language and Their Application to Biological Text."
Journal of Machine Learning Research,5:605-621, 2004.
|