Skip to content

Integrate Context-Aware Chunking and PDF Support #747

Integrate Context-Aware Chunking and PDF Support

Integrate Context-Aware Chunking and PDF Support #747

Triggered via pull request October 8, 2024 19:40
Status Failure
Total duration 2m 54s
Artifacts

lint.yml

on: pull_request
Matrix: lint
lint-workflow-complete
0s
lint-workflow-complete
Fit to window
Zoom out
Zoom in

Annotations

12 errors and 10 warnings
ruff
Process completed with exit code 3.
mypy
Process completed with exit code 1.
pylint: src/instructlab/sdg/utils/chunkers.py#L13
E0401: Unable to import 'docling.datamodel.base_models' (import-error)
pylint: src/instructlab/sdg/utils/chunkers.py#L14
E0401: Unable to import 'docling.datamodel.document' (import-error)
pylint: src/instructlab/sdg/utils/chunkers.py#L15
E0401: Unable to import 'docling.document_converter' (import-error)
pylint: src/instructlab/sdg/utils/chunkers.py#L34
E0211: Method 'chunk_documents' has no argument (no-method-argument)
pylint: src/instructlab/sdg/utils/chunkers.py#L137
E1121: Too many positional arguments for method call (too-many-function-args)
pylint: src/instructlab/sdg/utils/chunkers.py#L161
E0213: Method '_num_chars_from_tokens' should have "self" as first argument (no-self-argument)
pylint: src/instructlab/sdg/utils/chunkers.py#L210
E1121: Too many positional arguments for method call (too-many-function-args)
pylint: src/instructlab/sdg/utils/chunkers.py#L554
E0213: Method 'export_documents' should have "self" as first argument (no-self-argument)
pylint: src/instructlab/sdg/utils/taxonomy.py#L14
E0401: Unable to import 'docling_parse.docling_parse' (import-error)
pylint: src/instructlab/sdg/utils/taxonomy.py#L282
E0606: Possibly using variable 'document_contents' before assignment (possibly-used-before-assignment)
pylint: src/instructlab/sdg/utils/chunkers.py#L71
C0303: Trailing whitespace (trailing-whitespace)
pylint: src/instructlab/sdg/utils/chunkers.py#L435
C0303: Trailing whitespace (trailing-whitespace)
pylint: src/instructlab/sdg/utils/chunkers.py#L49
C0123: Use isinstance() rather than type() for a typecheck. (unidiomatic-typecheck)
pylint: src/instructlab/sdg/utils/chunkers.py#L55
W1309: Using an f-string that does not have any interpolated variables (f-string-without-interpolation)
pylint: src/instructlab/sdg/utils/chunkers.py#L108
W1309: Using an f-string that does not have any interpolated variables (f-string-without-interpolation)
pylint: src/instructlab/sdg/utils/chunkers.py#L114
W0221: Number of parameters was 0 in 'ChunkerBase.chunk_documents' and is now 1 in overriding 'TextSplitChunker.chunk_documents' method (arguments-differ)
pylint: src/instructlab/sdg/utils/chunkers.py#L165
R0902: Too many instance attributes (9/7) (too-many-instance-attributes)
pylint: src/instructlab/sdg/utils/chunkers.py#L194
W0221: Number of parameters was 0 in 'ChunkerBase.chunk_documents' and is now 1 in overriding 'SemanticChunker.chunk_documents' method (arguments-differ)
pylint: src/instructlab/sdg/utils/chunkers.py#L380
W0612: Unused variable 'i' (unused-variable)
pylint: src/instructlab/sdg/utils/chunkers.py#L382
W0612: Unused variable 'j' (unused-variable)