Skip to content

Commit

Permalink
Merge pull request #123 from biscuit-auth/perf-doc
Browse files Browse the repository at this point in the history
first draft of advice for authorization perf optimization
  • Loading branch information
divarvel authored Nov 8, 2023
2 parents c5bdc72 + 71d23cf commit 4f3da38
Show file tree
Hide file tree
Showing 2 changed files with 58 additions and 0 deletions.
1 change: 1 addition & 0 deletions docs/src/SUMMARY.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,7 @@
- [Interoperability & Reusability](./recipes/interoperability-reusability.md)
- [Role Based Access Control](./recipes/role-based-access-control.md)
- [Per request attenuation](./recipes/per-request-attenuation.md)
- [Authorization performance](./recipes/authorization-performance.md)

## Reference

Expand Down
57 changes: 57 additions & 0 deletions docs/src/recipes/authorization-performance.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
# Authorization performance

Authorization is likely part of your request handling hot path. As such, it is natural to try and make it as fast as possible.

## The first rule of performance optimization

> Don't do it
Benchmarks done with biscuit-rust show that the whole process (parsing, signatures verification and datalog generation and evaluation) usually clocks in at around one millisecond. In a lot of cases, that will not be a bottleneck and thus not where you should work on performance optimization. That being said, in some cases you will need to optimize the authorization process, so this page is there to help you do so.

## Authorization process breakdown

The authorization process can be broken down in 4 parts:

- parsing;
- signature verification;
- datalog generation;
- datalog evaluation.

Parsing is typically one of the fastest steps; it depends only on the token size. Signature verification is where most of the time is spent; it depends mostly on the number of blocks (and their size). Datalog generation and datalog evaluation happen in tandem. That's the part where you have the most leverage. Datalog generation purely depends on how your application is designed. In many cases it can be done statically and thus have a negligible contribution to the overall runtime. Datalog evaluation depends on the actual datalog code that is evaluated.

## Measure

When it comes to performance optimization, the first step is always to measure the execution time of each step. First to determine if optimization is even needed, then to quantify progress.
This part entirely depends on your tech stack. You can start with coarse-grained traces telling you how long the whole authorization process takes, and then only dig down if optimization is needed.

Datalog generation is not likely to be the bottleneck in simple cases, with static authorization rules. However, if your datalog generation involves database queries and complex generation logic, then you have optimization opportunities. Large or complex datalog rule sets can take time to evaluate, making datalog evaluation a good target for optimization.
There might be a balance between datalog generation and evaluation (i.e. making the datalog generation process more complex in order to simplify evaluation), so optimizations should always be considered over the whole authorization process.

## Datalog performance contributors

As stated above, there are a lot of external factors that contribute to the final time and resource costs of the authorization process.

Other things being equal, some elements in datalog code tend to have a disproportionate effect on performance. This section lists the most common ones, in order to help you find the source of slowdowns.

With biscuit-rust, you can see how much time was spent evaluating datalog in an authorizer with `Authorizer.execution_time()`. This does not replace performance measurements, but can give you a simple way to compare datalog snippets. Authorizer snapshots carry this information and can be inspected with biscuit-cli through `biscuit inspect-snapshot` or with the web inspector.

### Number of rules

The number of rules is a direct contributor to evaluation performance. The datalog engine tries to match every rule with every fact to produce new facts, and then tries again with new facts until no new facts are produced.

- authorization contexts with a lot of rules will take more time to compute
- rules generating facts matched by other rule will require more iterations before convergence

### Expression evaluation

_This part is implementation-dependent, advice applies primarily to the rust implementation._

Rules can contain expressions, that are evaluated after facts are matched. The biscuit specification describes an evaluation strategy based on a stack machine, which aims at providing a fast evaluation.

#### Expensive operations

Operations on booleans, integers and dates are really simple operations and thus quite fast. Same for string equality, thanks to string interning (comparing two strings for equality is turned into an equality test on integers). Other string operations like prefix / suffix / substring tests are a bit more costly. Regex matching tends to be the worst offender, especially when there are a lot of different regexes. Regex compilation is memoized, so the cost can be amortized when one regex is used to match against several strings. However, if several regexes are matched against a single string, then the regex compilation costs will not be amortized.

#### Splitting expressions

Expressions are tried in order. If an expression evaluates to false (or fails to evaluate), other expressions are not evaluated. Splitting simple conditions and placing them first allows rules to fail fast by only evaluating complex operations when needed.

0 comments on commit 4f3da38

Please sign in to comment.