GitHub - coastalcph/CopyrightLLMs: Copyright Violations and Large Language Models

Installation

conda create -y -n copyright-llm python=3.10 
conda activate copyright-llm

pip install torch --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

How to run

See available model configurations in llm_wrapper.py under MODEL_CONFIGS and available runtime parameters in run.py under RunConfig.

Example to sequentially run OPT (default 6.7B) and Falcon (default 7B) models and save outputs to custom path:

python run.py \
    --multirun \
    model_name=opt,falcon \
    dataset_name=books \
    dataset_path=./books \
    save_path=./hf-outputs

Example to run GPT-3.5-Turbo and save results locally to custom path:

export OPENAI_API_KEY="my-key-123"

python run.py \
    model_name=gpt-3.5 \
    dataset_name=books \
    dataset_path=./books \
    save_strategy=LOCAL \
    save_path=./gpt-outputs

The codebase also supports saving the results to WandB by adding save_strategy=WANDB as an argument. Before running you also need to export your wandb key or be logged in to wandb.

How to Cite

@inproceedings{karamolegkou-etal-2023-copyright,
    title = "Copyright Violations and Large Language Models",
    author = "Karamolegkou, Antonia  and
      Li, Jiaang  and
      Zhou, Li  and
      S{\o}gaard, Anders",
    editor = "Bouamor, Houda  and
      Pino, Juan  and
      Bali, Kalika",
    booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.emnlp-main.458",
    doi = "10.18653/v1/2023.emnlp-main.458",
    pages = "7403--7412",
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
books		books
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
llm_wrapper.py		llm_wrapper.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

How to run

How to Cite

About

Releases

Packages

Contributors 2

Languages

License

coastalcph/CopyrightLLMs

Folders and files

Latest commit

History

Repository files navigation

Installation

How to run

How to Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages