Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Updated .gitignore to remove cache-based items * Updated snakemake installation * Updated documentation to include snakemake generic cluster commands * `ruff` formatting changes * Updated cpu request command * Ignore vscode * Use `threads` instead of `resources.threads` * Remove dependency of external FTP servers for fastq contaminant genomes * Update zip_url for get_contaminant_genomes rule * Format using `snakefmt` * Remove `groups`, add additional details on what changing settings will do * Include information on why tissue name was left blank * Do not provide a tissue name for default resources. Require all rules to define one, or set it empty (`tissue_name=""`) * Perform `prefetch` and `dump fastq` by default * Format using ruff * Use inline if-else statements for (i think?) cleaner/clearer requirements for `rule all` * Fix formatting changes from `snakefmt` because I don't like them. Change the get_contaminant_genomes url because I made a mistake making the initial zip archive by excluding the `Adapters` component * Mark root output as "directory" in rule `get_contaminant_genomes` * Properly indent `zip` and closing parenthesis * Fix comment * Reduce complexity of prefetch, also fix downloading to scratch directory * Expand nested list to create a single list * Ignore any control files * Change profile to a default of `cluster` * Ignore `master_control.csv` * Added `ruff.toml` to modify settings for ruff * Updated environment requirements for genome generation * Added functions to generate the required genome-related files automatically * Moved genome generation to its own rule, modified star genome indexing to a rule with a better name Fixed rules that relied on `rule generate_genome`, as this rule is different than it was previously * Updated config values based on auto-generating genome-related files * Update documentation to install latest version of python * Remove master_control.csv from git tracking * Re-add master_control.csv * Add blank master control * re-ignore master control * Added environment for genome generation * Use delimiter directly * Rename genome values * Remove genome validation because the genome is created automatically now * Remove genome validation, it is no longer required * Hide ruff.toml so it is not shown to users * Ignore all control files * Custom cache for getting latest ensembl release as the traditional `@cache` method requires function arguments * Fix genome sizes filename * Fix missing commas * Reorder output files * Install python in the `generate_genome` environment * Use private URL variable names * Fix comment * Fix snakemake's expected output location of genome-related files * Add `rich` to environment * Remove un-necessary imports
- Loading branch information