py-bats

While this code works and is all well and good, it is very specific to our workflow. I am creating a manuscript preparation package at manus

py-bats

Introduction

This is a Python implemetation of a workflow engine used at Northwestern University Libraries for the Bulletin of Applied Transgender Studies. It is a simple workflow that by first converting the docx files to md files, then finding the references section of the md file. Then, a separate script will call the GPT API to generate a list of BibTeX references. Finally, the script will create a JATSXML file from the generated markdown file that has BibTex references.

Docx to Markdown

From the py_bats directory, run the following command:

python3 docx2md.py -i <input_file> -o <output_file>

This will create a markdown file from the docx file. The markdown file will be saved in the output_file location.

Markdown Extractor

You will also find a .txt file that contains the references in plaintext. We want to convert all of the citations to biblatex format. Use copilot, GPT, or zotero, as you wish. Make sure you change the file extension to .bib when you are done.

What is left to do in this project.

Automate the docx > markdown conversion
Add metadata to the markdown file
Split references into a txt file
Call GPT to convert the references to BibTeX
Add the BibTeX references to the markdown file
Produce JATSXML file from the markdown file
Parse author information and add to metadata section

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
templates		templates
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
author_metadata.py		author_metadata.py
bib2md.py		bib2md.py
bibchecker.py		bibchecker.py
docx2bib.py		docx2bib.py
gpt4bibtex.py		gpt4bibtex.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

py-bats

Introduction

Docx to Markdown

Markdown Extractor

About

Releases

Packages

Languages

aerithnetzer/py-bats

Folders and files

Latest commit

History

Repository files navigation

py-bats

Introduction

Docx to Markdown

Markdown Extractor

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages