Dtype mismatch in cat: bfloat16 and float16 #812
Labels
nemo
Issues needed to support NVIDIA NeMo models.
program-coverage
Requests for model and program coverage
🚀 Model / language coverage
First, I applied this diff to thunder:
The diff was necessary to get the beginning of the output below, which conveys that a
cat
operator is what is at fault:Full log of the run
Instructions on how to run NeVA are in #343.
Pitch
This is for the NeVA model #343 .
Alternatives / Potential work-arounds
It seems like our
cat
checks are too stringent, in that torch allows mismatched dtypes here:I suppose torch semantics are to cast each type to the first type?
Note this is very similar to #750. It seems like the issue in #750 just appeared in
cat
even though the error was earlier, but now we are finding the issue incat
through some other code.Minimal Repro
cc @tfogal
The text was updated successfully, but these errors were encountered: