Skip to content

generate RNA molecules from haploid genome #11

@yihangs

Description

@yihangs

Hi,

Thank you for developing such great software! I have one sample with pair-end reads (1.fastq, 2.fastq), and I want to simulate molecules based on this sample and the haploid reference genome without the need to phase diploid genomes. How can I achieve that?

I have the following observations:

  1. Contradictory to the paper saying "BEAGLE requires at least two samples to perform its phasing operation", I use one sample as the input and camparee can still output two parental genomes.
  2. The config file says that if we set the parameter "pooled" to be True, camparee will skip the phasing step. However, I tried it but still got two parental genomes.
  3. I manually changed ploidy.txt file and set all the chromosomes to be haploid. The software failed at GenomeBuilderStep.

By the way, in the output file molecule_file.txt, each transcript id has the following form: 1_ENST00000564768.1_1/1_ENST00000564768.1_2, does "1_" represent the index of the sample, and "_1"/"_2" in the last represent the index of parental genome?

Also, there are some bugs in camparee_controller.py line 253, 257, "input_sample['filename']", "input_sample['filenames']". There is no parameter "filename" or "filenames" in our current config file.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions