updated models and load functions for almeida #18

sherrie9 · 2023-09-23T00:22:00Z

No description provided.

dfornika

Looking great @sherrie9. I've added a few notes, mostly just asking to remove lines of commented-out code if they're not needed.

dfornika · 2023-09-26T21:37:43Z

tb_db/parsers.py


    with open(speciation_path, 'r') as f:
        reader = csv.DictReader(f)
-        count = 0
+        #count = 0


Let's remove this count variable if it isn't going to be used rather than leaving it commented out

dfornika · 2023-09-26T21:38:26Z

tb_db/parsers.py

+                #species.append(speci)
+                #count+=1


Can we remove these lines?

dfornika · 2023-09-26T21:43:24Z

tb_db/parsers.py

    qcs = []
    with open(qc_path, 'r') as f:
        reader = csv.DictReader(f)
        for row in reader:
            qc = {
                'sample_id': row['sample_id'],
-                'sample_name': row['sample_id'],
+                #'sample_name': row['sample_id'],
+                'sequencing_run_id' : sequencing_run_id,
                'most_abundant_species_name':row['most_abundant_species_name'],
                'most_abundant_species_fraction_total_reads': float(row['most_abundant_species_fraction_total_reads']),


This is optional but I have a suggestion:

I try to wrap conversions to int and float in try / catch blocks like this:

try: total_bases = int(row['total_bases']) except ValueError as e: total_bases = None

If there are several fields that need to be converted then I'll do something like this:

float_fields = [ 'average_base_quality', 'percent_bases_above_q30', 'percent_gc', ] for field in float_fields: try: row[field] = float(row[field]) except ValueError as e: row[field] = None

dfornika · 2023-09-26T21:43:51Z

tb_db/parsers.py

@@ -185,7 +185,7 @@ def parse_cgmlst(cgmlst_path: str, uncalled='-'):
        reader = csv.DictReader(f)
        for row in reader:
            sample_id = row.pop('sample_id')
-            sample_id = sample_id[:6]
+            #sample_id = sample_id


Could we remove this line?

dfornika · 2023-09-26T21:44:41Z

tb_db/models.py

+    #accession = Column(String)
+    #collection_date = Column(Date)


Let's remove these if they aren't being used

dfornika · 2023-09-26T21:45:32Z

tb_db/models.py

@@ -71,9 +71,9 @@ class Library(Base):
    """

    sample_id = Column(Integer, ForeignKey("sample.id"), nullable=False)
-    sample_name = Column(String)
+    #sample_name = Column(String)


Could we remove this?

dfornika · 2023-09-26T21:45:41Z

tb_db/models.py

    sequencing_run_id = Column(String)
-    library_id = Column(String)
+    #library_id = Column(String)


Could we remove this?

dfornika · 2023-09-26T21:47:04Z

tests/test_crud.py

@@ -349,7 +349,7 @@ def create_cgmlst_profile(self, sample, profile,runid):
        assert(created_cgmlst_profile.library_id == created_libraries[0].id)


-
-TestSampleCrudMachine = SampleCrudMachine.TestCase
+#comment the below out as it says 'dict' object has no attribute 'startswith' that i am not sure how to fix, refer to line 305 in crud.py        


We'll have to take a look at the overall testing processes and make sure everything passes, but I'm not going to let it block merging this. We can address that later.

updated models and load functions for almeida

568534f

dfornika requested changes Sep 26, 2023

View reviewed changes

cleaned up

34a96b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated models and load functions for almeida #18

updated models and load functions for almeida #18

sherrie9 commented Sep 23, 2023

dfornika left a comment

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

dfornika Sep 26, 2023

updated models and load functions for almeida #18

Are you sure you want to change the base?

updated models and load functions for almeida #18

Conversation

sherrie9 commented Sep 23, 2023

dfornika left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment