-
Notifications
You must be signed in to change notification settings - Fork 683
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Docs] MessagePack IDL, Pydantic Support, and Attribute Access (#6022)
* [Docs] MessagePack IDL, Pydantic Support and Attribute Access Signed-off-by: Future-Outlier <eric901201@gmail.com> * support Signed-off-by: Future-Outlier <eric901201@gmail.com> * update Signed-off-by: Future-Outlier <eric901201@gmail.com> * lint Signed-off-by: Future-Outlier <eric901201@gmail.com> * Trigger CI Signed-off-by: Future-Outlier <eric901201@gmail.com> * Trigger CI Signed-off-by: Future-Outlier <eric901201@gmail.com> * lint Signed-off-by: Future-Outlier <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/dataclass.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * nit Signed-off-by: Future-Outlier <eric901201@gmail.com> * nit Signed-off-by: Future-Outlier <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/dataclass.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/dataclass.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * Update docs/user_guide/data_types_and_io/pydantic_basemodel.md Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> * format Signed-off-by: Future-Outlier <eric901201@gmail.com> --------- Signed-off-by: Future-Outlier <eric901201@gmail.com> Signed-off-by: Han-Ru Chen (Future-Outlier) <eric901201@gmail.com> Co-authored-by: David Espejo <82604841+davidmirror-ops@users.noreply.github.com>
- Loading branch information
1 parent
09a6fb8
commit e13babb
Showing
4 changed files
with
132 additions
and
8 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
103 changes: 103 additions & 0 deletions
103
docs/user_guide/data_types_and_io/pydantic_basemodel.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,103 @@ | ||
(pydantic_basemodel)= | ||
|
||
# Pydantic BaseModel | ||
|
||
```{eval-rst} | ||
.. tags:: Basic | ||
``` | ||
|
||
`flytekit` version >=1.14 supports natively the `JSON` format that Pydantic `BaseModel` produces, enhancing the | ||
interoperability of Pydantic BaseModels with the Flyte type system. | ||
|
||
:::{important} | ||
Pydantic BaseModel V2 only works when you are using flytekit version >= v1.14.0. | ||
::: | ||
|
||
With the 1.14 release, `flytekit` adopted `MessagePack` as the serialization format for Pydantic `BaseModel`, | ||
overcoming a major limitation of serialization into a JSON string within a Protobuf `struct` datatype like the previous versions do: | ||
|
||
to store `int` types, Protobuf's `struct` converts them to `float`, forcing users to write boilerplate code to work around this issue. | ||
|
||
:::{important} | ||
By default, `flytekit >= 1.14` will produce `msgpack` bytes literals when serializing, preserving the types defined in your `BaseModel` class. | ||
If you're serializing `BaseModel` using `flytekit` version >= v1.14.0 and you want to produce Protobuf `struct` literal instead, you can set environment variable `FLYTE_USE_OLD_DC_FORMAT` to `true`. | ||
|
||
For more details, you can refer the MESSAGEPACK IDL RFC: https://github.com/flyteorg/flyte/blob/master/rfc/system/5741-binary-idl-with-message-pack.md | ||
::: | ||
|
||
```{note} | ||
You can put Dataclass and FlyteTypes (FlyteFile, FlyteDirectory, FlyteSchema, and StructuredDataset) in a pydantic BaseModel. | ||
``` | ||
|
||
```{note} | ||
To clone and run the example code on this page, see the [Flytesnacks repo][flytesnacks]. | ||
``` | ||
|
||
To begin, import the necessary dependencies: | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:lines: 1-9 | ||
``` | ||
|
||
Build your custom image with ImageSpec: | ||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:lines: 11-14 | ||
``` | ||
|
||
## Python types | ||
We define a `pydantic basemodel` with `int`, `str` and `dict` as the data types. | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:pyobject: Datum | ||
``` | ||
|
||
You can send a `pydantic basemodel` between different tasks written in various languages, and input it through the Flyte console as raw JSON. | ||
|
||
:::{note} | ||
All variables in a data class should be **annotated with their type**. Failure to do should will result in an error. | ||
::: | ||
|
||
Once declared, a dataclass can be returned as an output or accepted as an input. | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:lines: 26-41 | ||
``` | ||
|
||
## Flyte types | ||
We also define a data class that accepts {std:ref}`StructuredDataset <structured_dataset>`, | ||
{std:ref}`FlyteFile <files>` and {std:ref}`FlyteDirectory <folder>`. | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:lines: 45-86 | ||
``` | ||
|
||
A data class supports the usage of data associated with Python types, data classes, | ||
flyte file, flyte directory and structured dataset. | ||
|
||
We define a workflow that calls the tasks created above. | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:pyobject: basemodel_wf | ||
``` | ||
|
||
You can run the workflow locally as follows: | ||
|
||
```{literalinclude} /examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py | ||
:caption: data_types_and_io/pydantic_basemodel.py | ||
:lines: 99-100 | ||
``` | ||
|
||
To trigger a task that accepts a dataclass as an input with `pyflyte run`, you can provide a JSON file as an input: | ||
``` | ||
pyflyte run \ | ||
https://raw.githubusercontent.com/flyteorg/flytesnacks/b71e01d45037cea883883f33d8d93f258b9a5023/examples/data_types_and_io/data_types_and_io/pydantic_basemodel.py \ | ||
basemodel_wf --x 1 --y 2 | ||
``` | ||
|
||
[flytesnacks]: https://github.com/flyteorg/flytesnacks/tree/master/examples/data_types_and_io/ |