Skip to content

Implement Content-Code for binary data.  #89

@titusz

Description

@titusz

We could extract printable strings (with different encodings) from all kinds of binary data like executables or custom binary formats with https://github.com/getreu/stringsext ... and create a text similarity signature.

The question is if we still call this Content-ID-Text of if we create a custom Content-ID-Binary that signals that text was extracted from a binary format without any format-specific structured parsing.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions