Enhance standard templates #75

aaverbec · 2019-02-18T00:41:30Z

No description provided.

Enhance templates to convert bigint to timestamp Also add split-by logic to sqoop statement

afoerster · 2019-02-20T14:05:52Z

Hey @aaverbec looks like this has some conflicts that need to be resolved before the tests can be run

aaverbec · 2019-02-20T18:18:47Z

I tried resolving the conflicts, but it wouldn't let me commit the changes back. So I just copied my resolved code back to the branch. On the remaining conflicts, they should take my changes. I'm not sure how to make github accept my resolved conflicts. (I fix all of the conflicts and then click the merge option and it just spins for an hour+ saying it is trying to merge)

aaverbec · 2019-02-20T18:19:36Z

If you'd prefer, I can re-fork my repository from the current repo, and then re-make my changes, and re-submit a new PR.

afoerster · 2019-02-20T18:35:52Z

templates/sqoop-parquet-full-load/test-rowcount.sh

@@ -16,8 +16,7 @@ set -e
 # Check parquet table
 AVRO=$({{ conf.impala_cmd }} avro-table-rowcount.sql -B 2> /dev/null)
 PARQUET=$({{ conf.impala_cmd }} report-table-rowcount.sql -B 2> /dev/null)
-SOURCE=$(cat sourceCount.txt)
-
+SOURCE=$({{ conf.source_database.cmd }} source-table-rowcount.sql -s -r -N -B 2> /dev/null)


Why is stderr piped to /dev/null

afoerster · 2019-02-20T18:36:36Z

templates/shared/parquet-table-rowcount.sql

@@ -14,6 +14,6 @@

 -- Query Parquet table in Impala
 USE {{ conf.staging_database.name }};
-INVALIDATE METADATA {{ table.destination.name }}_parquet;
-SELECT COUNT(*) FROM {{ table.destination.name }}_parquet;
+INVALIDATE METADATA {{ table.destination.name }}{% if conf.user_defined is defined and conf.user_defined.parquet_suffix is defined %}{{ conf.user_defined.parquet_suffix }}{% endif %};


Can you give an example of how this user_defined block is used and what it's for?

This raises a flag because it will break existing scripts that are expecting the _parquet suffix

afoerster · 2019-02-20T18:38:20Z

templates/sqoop-parquet-full-load/avro-table-create.sql

 {% for column in table.columns %}
-`{{ column.name.replace('/','_') }}` {{ map_datatypes(column).avro }} COMMENT "{{ column.comment }}"
+`{{ cleanse_column(column.name) }}` {{ map_datatypes_v2(column).avro }} COMMENT "{{ column.comment }}"


Good use of cleanse_column

afoerster · 2019-02-20T18:39:30Z

templates/sqoop-parquet-full-load/copy-avsc.sh

I'm not sure -p is the safe thing to do here. If the user has misconfigured something the application will happily keep running. What was your thought process for adding this? Can you manually create the directories first?

afoerster · 2019-02-20T18:42:39Z

templates/sqoop-parquet-full-load/report-table-create.sql

+USE `{{ conf.staging_database.name }}`;
+CREATE EXTERNAL TABLE IF NOT EXISTS `{{ table.destination.name }}` (
+{%- for column in table.columns %}
+`{{ cleanse_column(column.name) }}` {{ map_datatypes_v2(column, 'parquet') }} COMMENT "{{ column.comment }}" {%- if not loop.last -%}, {% endif %}


You removed the decimal logic, does map_datatypes_v2 handle the decimal type?

afoerster · 2019-02-20T18:47:18Z

I've never tried the github tools for merging so I don't know about that. I'd recommend merging on the command line or re-forking like you said.

aaverbec and others added 8 commits September 18, 2018 13:16

Updated Shared Templates

cbdd414

New Template

11b1918

Updates to Kudu Template

5608a63

Add Alter Table to make file

2002ee8

Enhance Templates

8c9c05d

Enhance templates to convert bigint to timestamp Also add split-by logic to sqoop statement

Additional parquet enhancements

a35f23e

Remove Product View

cba3416

Delete view

f599fe0

aaverbec and others added 2 commits February 20, 2019 11:35

Update tables-drop.sql

f21bd46

merge conflicts

894db3f

afoerster suggested changes Feb 20, 2019

View reviewed changes

fix v2

3880623

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance standard templates #75

Enhance standard templates #75

aaverbec commented Feb 18, 2019

afoerster commented Feb 20, 2019

aaverbec commented Feb 20, 2019

aaverbec commented Feb 20, 2019

afoerster Feb 20, 2019

afoerster Feb 20, 2019

afoerster Feb 20, 2019

afoerster Feb 20, 2019

afoerster Feb 20, 2019

afoerster Feb 20, 2019

afoerster commented Feb 20, 2019

Enhance standard templates #75

Are you sure you want to change the base?

Enhance standard templates #75

Conversation

aaverbec commented Feb 18, 2019

afoerster commented Feb 20, 2019

aaverbec commented Feb 20, 2019

aaverbec commented Feb 20, 2019

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster Feb 20, 2019

Choose a reason for hiding this comment

afoerster commented Feb 20, 2019