You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 30, 2021. It is now read-only.
Since "the same" variant can occur on multiple records in our tables (see example records shown by query below), it breaks the experiment we are trying to do with table gvcf_variants_expanded.
In VCF format, the same variant can be encoded in multiple ways and we're not trying to solve that deeper issue here. Instead its just a simplistic grouping of data -> when the location in the genome, reference bases, and alternate bases match exactly, group those together in the same single record in the table.
SELECT
contig_name,
start_pos,
reference_bases,
alt,
numsamples,
dataset
FROM (
SELECT
contig_name,
start_pos,
reference_bases,
GROUP_CONCAT(alternate_bases) WITHIN RECORD AS alt,
COUNT(call.callset_name) WITHIN RECORD AS numsamples,
'gvcf' AS dataset,
FROM
[google.com:biggene:test.pgp_gvcf_variants]
WHERE
contig_name = '2'
AND start_pos=222324034
AND reference_bases='CTA'),
(
SELECT
contig_name,
start_pos,
reference_bases,
GROUP_CONCAT(alternate_bases) WITHIN RECORD AS alt,
COUNT(call.callset_name) WITHIN RECORD AS numsamples,
'exp' AS dataset,
FROM
[google.com:biggene:test.pgp_gvcf_variants_expanded]
WHERE
contig_name = '2'
AND start_pos=222324034
AND reference_bases='CTA')
ORDER BY
dataset,
alt
The text was updated successfully, but these errors were encountered:
Since "the same" variant can occur on multiple records in our tables (see example records shown by query below), it breaks the experiment we are trying to do with table
gvcf_variants_expanded
.In VCF format, the same variant can be encoded in multiple ways and we're not trying to solve that deeper issue here. Instead its just a simplistic grouping of data -> when the location in the genome, reference bases, and alternate bases match exactly, group those together in the same single record in the table.
The text was updated successfully, but these errors were encountered: