From I studied and read about “Using
corpus analysis software to analyze specialized texts” I can summarize the
knowledge as follow:
π¬ What is a corpus? π¬
A corpus can be generally defined as a collection
of naturally-occurring texts in a computer-readable format which can be
retrieved and analyzed using corpus analysis software.
π Sources of language corpora π
We can subscribe to a large corpus provider such as
the British National Corpus (BNC) or use web concordancing (for instance http://corpus.leeds.ac.uk, http://corpus.byu.edu/) or compile own
corpora and analyse data using corpus analysis software like ‘Antconc’ , ‘Wordsmith’
or ‘Paraconc’.
π Designing a specialized corpus π
We have to designing a specialized corpus as
follow:
- Corpus size
- Text extracts vs. full texts
- Number of texts
- Medium
- Subject and text type
- Other considerations
π₯Example: my specialized corpus profile
| 
Size | 
50072 words | 
| 
Source of corpus data | |
| 
Number of texts | 
148 texts | 
| 
Medium | 
Written | 
| 
Subject | 
England country and travel in England | 
| 
Text type | 
New article | 
| 
Authorship | |
| 
Language | 
Texts written in English mostly by
  native speaker | 
| 
Publication date | 
Recent texts (Retrieved in August
  2018) | 
π Analyzing Specialized
Corpus π
We have to analyzing specialized corpus from these topics as follow:
- Terminologies and collocation
- Local grammar
- Style
- Content knowledge
 
 
No comments:
Post a Comment