Wednesday, September 19, 2018

(Week 5) Using corpus analysis software to analyze specialized texts


From I studied and read about “Using corpus analysis software to analyze specialized texts” I can summarize the knowledge as follow:

πŸ’¬ What is a corpus? πŸ’¬
A corpus can be generally defined as a collection of naturally-occurring texts in a computer-readable format which can be retrieved and analyzed using corpus analysis software.

πŸ‘‰ Sources of language corpora πŸ‘ˆ
We can subscribe to a large corpus provider such as the British National Corpus (BNC) or use web concordancing (for instance http://corpus.leeds.ac.uk, http://corpus.byu.edu/) or compile own corpora and analyse data using corpus analysis software like ‘Antconc’ , ‘Wordsmith’ or ‘Paraconc’.

πŸ’ Designing a specialized corpus πŸ’
We have to designing a specialized corpus as follow:
- Corpus size
- Text extracts vs. full texts
- Number of texts
- Medium
- Subject and text type
- Other considerations

πŸ’₯Example: my specialized corpus profile

Size
50072 words
Source of corpus data
Number of texts
148 texts
Medium
Written
Subject
England country and travel in England
Text type
New article
Authorship

Language
Texts written in English mostly by native speaker
Publication date
Recent texts (Retrieved in August 2018)








πŸ‘Š Analyzing Specialized Corpus πŸ‘Š
We have to analyzing specialized corpus from these topics as follow:
- Terminologies and collocation
- Local grammar
- Style
- Content knowledge

No comments:

Post a Comment

Powered by Blogger. Blogger Template by Intikali.org. Supported by Iskael and BlogSpot Design.