This repository contains text data and code for analysis of language used on Open Access websites. The project owners are Lauren B. Collister (lbcollister@pitt.edu) and Melissa Cantrell (melissa.cantrell@colorado.edu).
Results from this study have been published in the following outlets:
Cantrell, Melissa H.; Collister, Lauren B. 2019. The status quo bias and the uptake of open access. First Monday, 24:7. DOI: https://doi.org/10.5210/fm.v24i7.10089. (The data for this paper are found in the "OAComplexity" folder of this repository.)
Collister, Lauren B. and Cantrell, Melissa H. 2021. From “Patchy Endorsements” to Intentional Advocacy: Deconstructing Bias in the Language of Open Access. Journal of Librarianship and Scholarly Communication, 9(1), p.eP2395. DOI: https://doi.org/10.7710/2162-3309.2395. (The data for this paper are found in the "PatchyEndorsements" folder of this repository.)
The text files in this repository were pulled directly from source websites, and therefore copyright for those texts belong to the source indicated. We are using these texts under Fair Use, particularly a transformative use as they were used for textual analysis.
All code and text beyond the source texts are owned by Melissa Cantrell and Lauren Collister, and licensed for your re-use under a Creative Commons - Attribution 4.0 license.
Inside the OAComplexity folder, the files found in the "texts" folder contain HTML from the main content pane of several websites.
The code for the analysis of the texts is included in the Jupyter Notebook file "OAComplexity.ipynb". The code and output can be viewed in GitHub or run on your own machine after cloning the repository.
The raw numbers for our analysis are available in the spreadsheet titled "analysis_numbers.csv". There is also an Excel .xlsx file with the below information included as comments, and the publisher names in a different order to match other data collected for this project.
Two spreadsheets containing data for our qualitative analyses of Open Access definitions, choice language, and costs of Open Access publishing are included on the spreadsheets in the "qualitative analysis" folder.
Please see the README file in this folder for more information.
This folder contains spreadsheets containing the sources and analyses. Please see the README file in this folder for more details.
This folder contains preliminary analyses for an upcoming project. Please see the README file in this folder for more information as it is added.
For any questions about this project please contact Lauren B. Collister (lbcollister@pitt.edu) and Melissa Cantrell (melissa.cantrell@colorado.edu).