The following scripts are used to process the raw texts in the corpus of Royal Soceity Journals.
written in Perl consists of four scripts marked with No.0, 1, 2 and 3. Run them according to the order of numbers, and then twenty subcorpora only containing full-length articles from corpus of royal society will be created.
are used to calculate relative entropy and linguistic concreteness, and each zipped document is responsible for one of three functions (concreteness, lemma and POS trigram).
includes a code script and texts. Due to the large size of all texts in PTRS, just several samples are uploaded here. The code scripts were written by Linux shell and R. After you running scripts, the relevant texts and data will be obtained accordingly.
You can implement the four python scripts to achieve the results on relative entropy and concreteness in the zipped folder "concrentess''.
@article{sun2021evolutionary,
title={The evolutionary pattern of language in scientific writings: A case study of Philosophical Transactions of Royal Society (1665--1869)},
author={Sun, Kun and Liu, Haitao and Xiong, Wenxin},
journal={Scientometrics},
volume={126},
number={2},
pages={1695--1724},
year={2021},
publisher={Springer}
}