From I studied and read about “Using
corpus analysis software to analyze specialized texts” I can summarize the
knowledge as follow:
π¬ What is a corpus? π¬
A corpus can be generally defined as a collection
of naturally-occurring texts in a computer-readable format which can be
retrieved and analyzed using corpus analysis software.
π Sources of language corpora π
We can subscribe to a large corpus provider such as
the British National Corpus (BNC) or use web concordancing (for instance http://corpus.leeds.ac.uk, http://corpus.byu.edu/) or compile own
corpora and analyse data using corpus analysis software like ‘Antconc’ , ‘Wordsmith’
or ‘Paraconc’.
π Designing a specialized corpus π
We have to designing a specialized corpus as
follow:
- Corpus size
- Text extracts vs. full texts
- Number of texts
- Medium
- Subject and text type
- Other considerations
π₯Example: my specialized corpus profile
Size
|
50072 words
|
Source of corpus data
|
|
Number of texts
|
148 texts
|
Medium
|
Written
|
Subject
|
England country and travel in England
|
Text type
|
New article
|
Authorship
|
|
Language
|
Texts written in English mostly by
native speaker
|
Publication date
|
Recent texts (Retrieved in August
2018)
|
π Analyzing Specialized
Corpus π
We have to analyzing specialized corpus from these topics as follow:
- Terminologies and collocation
- Local grammar
- Style
- Content knowledge
No comments:
Post a Comment