Supplementary materials, including data and R Markdown Notebook, for a paper titled The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia
Gede Primahadi Wijaya
Rajeg
& I Made
Rajeg
Universitas
Udayana, Indonesia
This
repository is licensed with the
Creative
Commons Attribution-NonCommercial-ShareAlike 4.0 International
License.
Please cite this repository (in OSF) (Rajeg and Rajeg 2021) as follows if you use the data and other materials here in your research and/or teaching (in Unified Style Sheet for Linguistics):
Rajeg, Gede Primahadi Wijaya & I Made Rajeg. 2021. Supplementary materials for The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia. Open Science Framework (OSF). https://doi.org/10.17605/OSF.IO/Y42F6.
Or using the Zenodo repository version:
Rajeg, Gede Primahadi Wijaya & I Made Rajeg. 2021. Supplementary materials for The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia. Zenodo. https://doi.org/10.5281/zenodo.5166425.
The repository provides supplementary materials for our paper titled
The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia,
presented at the International Seminar on Austronesian Languages and
Literature IX (10 September 2021) (conference
website). The materials include
(i) the data; (ii) the R Markdown Notebook interleaving our paper-texts
and R codes used for writing the whole paper and running the statistical
analyses and visualisations; and (iii) the figures included in the paper
(see the figures
folder). The study is based on the open-access, large
corpora of naturalistic colloquial Malay/Indonesian published by the
Max Planck Institute for Evolutionary Anthropology (MPI EVA) Jakarta
Field Station
(JFS) (Gil et
al. 2015).
The data
folder holds the data used in this paper.
indo-prov-latlong.csv
provides latitude and longitude data for the whole provinces in Indonesiamalayic_happy_freq_long_lat.tsv
provides the original data for the latitude and longitude and those manually culled from Google Mapsmalayic_happy.tsv
contains the original raw data for the HAPPINESS lexicalisationmalayic_LIKE_df.tsv
contains the distribution of morphs glossed as ‘to like’ in all regionsmalayic_LIKE_df_WK_ENT.tsv
contains distribution of morphs glossed as ‘to like’ in West Kalimantan and East Nusa Tenggara regionsnon_acquisition_malayic_sessions_dataset_project.tsv
contains the metadata information for the Malayic subset of the MPI EVA JFS corpora; the metadata include the session names, regions, languoid, word-count per session, genre, mode, among others
The following R packages are used in the data processing, statistical
analyses, visualisation, and knitting the content of the R Markdown
Notebook file (austronesian-paper-2021-gpwrajeg.Rmd
) into MS Word
format. Please make sure that they are installed in R to run the codes
in the R Notebook and reproduce the results.
-
tidyverse collection of packages (Wickham et al. 2019; Wickham 2021b) – to conduct the data manipulation, processing, and visualisation), especially the functions from the following packages:
-
bookdown (Xie 2021, 2016) and knitr (Xie 2015, 2020) – to print the table and knit the R Markdown Notebook into MS Word document
-
rmarkdown (Allaire et al. 2021; Xie, Allaire, and Grolemund 2018; Xie, Dervieux, and Riederer 2020) – to write the paper, combining the R codes and regular texts
-
maps (Brownrigg 2018) and mapdata (Richard A. Becker and Ray Brownrigg. 2018) – to generate the Indonesian map
-
ggthemes (Arnold 2021) – to customise theme for map visualisation
-
ggrepel (Slowikowski 2020) – to make automatic, non-overlapping text labels
The R Session info sub-section below shows the R version (R Core Team 2021) and operating system used for this project.
devtools::session_info()
#> ─ Session info ───────────────────────────────────────────────────────────────
#> setting value
#> version R version 4.0.5 (2021-03-31)
#> os macOS Big Sur 10.16
#> system x86_64, darwin17.0
#> ui X11
#> language (EN)
#> collate en_US.UTF-8
#> ctype en_US.UTF-8
#> tz Asia/Makassar
#> date 2021-08-10
#>
#> ─ Packages ───────────────────────────────────────────────────────────────────
#> package * version date lib source
#> assertthat 0.2.1 2019-03-21 [1] CRAN (R 4.0.0)
#> backports 1.1.7 2020-05-13 [1] CRAN (R 4.0.0)
#> cachem 1.0.5 2021-05-15 [1] CRAN (R 4.0.2)
#> callr 3.6.0 2021-03-28 [1] CRAN (R 4.0.2)
#> cli 2.4.0 2021-04-05 [1] CRAN (R 4.0.2)
#> crayon 1.4.1 2021-02-08 [1] CRAN (R 4.0.2)
#> desc 1.2.0 2018-05-01 [1] CRAN (R 4.0.0)
#> devtools 2.3.0 2020-04-10 [1] CRAN (R 4.0.0)
#> digest 0.6.25 2020-02-23 [1] CRAN (R 4.0.0)
#> ellipsis 0.3.1 2020-05-15 [1] CRAN (R 4.0.0)
#> evaluate 0.14 2019-05-28 [1] CRAN (R 4.0.0)
#> fastmap 1.0.1 2019-10-08 [1] CRAN (R 4.0.0)
#> fs 1.4.1 2020-04-04 [1] CRAN (R 4.0.0)
#> glue 1.4.1 2020-05-13 [1] CRAN (R 4.0.0)
#> htmltools 0.4.0 2019-10-04 [1] CRAN (R 4.0.0)
#> knitr 1.30 2020-09-22 [1] CRAN (R 4.0.2)
#> magrittr 2.0.1 2020-11-17 [1] CRAN (R 4.0.2)
#> memoise 2.0.0 2021-01-26 [1] CRAN (R 4.0.2)
#> pkgbuild 1.0.8 2020-05-07 [1] CRAN (R 4.0.0)
#> pkgload 1.1.0 2020-05-29 [1] CRAN (R 4.0.0)
#> prettyunits 1.1.1 2020-01-24 [1] CRAN (R 4.0.0)
#> processx 3.5.1 2021-04-04 [1] CRAN (R 4.0.2)
#> ps 1.6.0 2021-02-28 [1] CRAN (R 4.0.2)
#> R6 2.4.1 2019-11-12 [1] CRAN (R 4.0.0)
#> Rcpp 1.0.7 2021-07-07 [1] CRAN (R 4.0.2)
#> remotes 2.1.1 2020-02-15 [1] CRAN (R 4.0.0)
#> rlang 0.4.11 2021-04-30 [1] CRAN (R 4.0.2)
#> rmarkdown 2.7 2021-02-19 [1] CRAN (R 4.0.2)
#> rprojroot 1.3-2 2018-01-03 [1] CRAN (R 4.0.0)
#> sessioninfo 1.1.1 2018-11-05 [1] CRAN (R 4.0.0)
#> stringi 1.5.3 2020-09-09 [1] CRAN (R 4.0.2)
#> stringr 1.4.0 2019-02-10 [1] CRAN (R 4.0.0)
#> testthat 3.0.2 2021-02-14 [1] CRAN (R 4.0.2)
#> usethis 1.6.1 2020-04-29 [1] CRAN (R 4.0.0)
#> withr 2.4.1 2021-01-26 [1] CRAN (R 4.0.2)
#> xfun 0.22 2021-03-11 [1] CRAN (R 4.0.2)
#> yaml 2.2.1 2020-02-01 [1] CRAN (R 4.0.0)
#>
#> [1] /Users/Primahadi/Rlibs
#> [2] /Library/Frameworks/R.framework/Versions/4.0/Resources/library
Allaire, JJ, Yihui Xie, Jonathan McPherson, Javier Luraschi, Kevin Ushey, Aron Atkins, Hadley Wickham, Joe Cheng, Winston Chang, and Richard Iannone. 2021. Rmarkdown: Dynamic Documents for r. https://CRAN.R-project.org/package=rmarkdown.
Arnold, Jeffrey B. 2021. Ggthemes: Extra Themes, Scales and Geoms for Ggplot2. https://github.com/jrnold/ggthemes.
Brownrigg, Ray. 2018. Maps: Draw Geographical Maps. https://CRAN.R-project.org/package=maps.
Gil, David, Uri Tadmor, John Bowden, and Bradley Taylor. 2015. “Data from the Jakarta Field Station, Department of Linguistics, Max Planck Institute for Evolutionary Anthropology, 1999-2015.” https://lingweb.eva.mpg.de/archive/jakarta/data.php.html.
Müller, Kirill, and Hadley Wickham. 2021. Tibble: Simple Data Frames. https://CRAN.R-project.org/package=tibble.
R Core Team. 2021. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
Rajeg, Gede Primahadi Wijaya, and I Made Rajeg. 2021. “Supplementary Materials for The Lexicalisation of HAPPINESS in the Malayic Varieties of Indonesia.” Open Science Framework (OSF). https://doi.org/10.17605/OSF.IO/Y42F6.
Richard A. Becker, Original S code by, and Allan R. Wilks. R version by Ray Brownrigg. 2018. Mapdata: Extra Map Databases. https://CRAN.R-project.org/package=mapdata.
Slowikowski, Kamil. 2020. Ggrepel: Automatically Position Non-Overlapping Text Labels with Ggplot2. http://github.com/slowkow/ggrepel.
Wickham, Hadley. 2016. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. https://ggplot2.tidyverse.org.
———. 2019. Stringr: Simple, Consistent Wrappers for Common String Operations. https://CRAN.R-project.org/package=stringr.
———. 2021a. Tidyr: Tidy Messy Data. https://CRAN.R-project.org/package=tidyr.
———. 2021b. Tidyverse: Easily Install and Load the Tidyverse. https://CRAN.R-project.org/package=tidyverse.
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019. “Welcome to the tidyverse.” Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, Winston Chang, Lionel Henry, Thomas Lin Pedersen, Kohske Takahashi, Claus Wilke, Kara Woo, Hiroaki Yutani, and Dewey Dunnington. 2020. Ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics. https://CRAN.R-project.org/package=ggplot2.
Wickham, Hadley, Romain François, Lionel Henry, and Kirill Müller. 2021. Dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, and Jim Hester. 2020. Readr: Read Rectangular Text Data. https://CRAN.R-project.org/package=readr.
Xie, Yihui. 2015. Dynamic Documents with R and Knitr. 2nd ed. Boca Raton, Florida: Chapman; Hall/CRC. https://yihui.org/knitr/.
———. 2016. Bookdown: Authoring Books and Technical Documents with R Markdown. Boca Raton, Florida: Chapman; Hall/CRC. https://bookdown.org/yihui/bookdown.
———. 2020. Knitr: A General-Purpose Package for Dynamic Report Generation in r. https://yihui.org/knitr/.
———. 2021. Bookdown: Authoring Books and Technical Documents with r Markdown. https://CRAN.R-project.org/package=bookdown.
Xie, Yihui, J. J. Allaire, and Garrett Grolemund. 2018. R Markdown: The Definitive Guide. Boca Raton, Florida: Chapman; Hall/CRC. https://bookdown.org/yihui/rmarkdown.
Xie, Yihui, Christophe Dervieux, and Emily Riederer. 2020. R Markdown Cookbook. Boca Raton, Florida: Chapman; Hall/CRC. https://bookdown.org/yihui/rmarkdown-cookbook.