Skip to content

Reference gene count variation #14

@dhooper1

Description

@dhooper1

Hi TOGA2 team,

Thanks first for a great resource! I was wondering if you could help explain why the number of reference genes might differ between query assemblies run against the same reference. In the summary output files generated by TOG2, two examples attached here, the total number of reference genes considered present plus those considered lost/missing is different across queries.

summary.apus_apus.txt

summary.hemiprocne_comata.txt

I am trying to compare results between queries but am having trouble proceeding when the denominator (i.e., the total number of reference genes) differs between queries. In the examples shared above there were 16,071 reference genes when Apus_apus was used as query and 16,139 when Hemiprocne_comata was used as query. Both TOGA2 runs used the same reference genome, annotation, and isoform files and both query assemblies were aligned to the same reference assembly.

Many thanks for your time and guidance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions