|
|
Re: Newspaper article corpora: msg#00068
science.linguistics.corpora
|
Subject: |
Re: Newspaper article corpora |
There is also a free Reuters one put together by Lewis:
http://www.daviddlewis.com/resources/testcollections/reuters21578/
Chris
Mark Davies wrote:
Does anyone know of corpora devoted to English language
newspaper articles (possibly available online)? I saw that
Reuters has developed one, but there are a few hurdles to
navigate in order to access it.
The BNC has about 10-11 million words from newspapers, and is available
as part of the VIEW site (http://view.byu.edu).
This site also allows you to search just one part of the 100 million
word corpus (i.e. just the newspapers), so this may be a useful feature
for you. Finally, you can even zero in on just sub-registers of
newspapers (broadsheet, tabloid, sports reporting, etc).
Best,
Mark Davies
=================================================
Mark Davies
Assoc. Prof., Linguistics
Brigham Young University
(phone) 801-422-9168 / (fax) 801-422-0906
http://davies-linguistics.byu.edu
** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
=================================================
|
|