-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hybrid Cache API proposal #54647
Comments
Here we go: as usual I'll go back and forth between this and FusionCache to share my experience with it.
LGTM, one question though: can we expect all the "extra state" needed will be in the concrete impl (eg:
Perfect, no notes 👍
Sorry but I did not understand this part, can you elaborate more?
Makes sense, interfaces (at least as of today) are less evolvable, and the only other approach would be interface-per-featture, where new features will be defined in new interfaces that will be added over time and that consumers may check for support (like the opt-in buffered distributed cache), but that is not always possible and in the long run may lead to a lot of different interfaces. Watch out for people asking for interfaces nonetheless though, because testing etc (been there).
Imho it's worth exploring the idea, not so sure about how good it would work in practice over time: anybody knows of an example of a project that successfully used default interface members to evolve it over time without breakings?
In general looks good, but I don't see an overload of Also I don't see methods to specify the distributed cache to use: will it be picked automatically from DI (if one is registered)?
In FusionCache I solved this by having different methods on the builder, like If you are interested in some ideas, read here for more.
LGTM, but out of curiosity: why an interface here and not a class? Not that I necessarily would prefer one, but wouldn't the same rationale for Also, any plan in supporting the fluent builder approach without DI? I mean something like this; // OPTION 1
var builder = new HybridCacheBuilder()
.WithFoo()
.WithBar();
// OPTION 2 (like WebApplication.CreateBuilder)
var builder = HybridCache.CreateBuilder()
.WithFoo()
.WithBar();
var cache = builder.Build(); Currently FusionCache does not support it for... reasons... but I'm thinking about adding it and other libs have already done it.
One note about naming: since there are both
100% agree, there's a lot of value even just for that. After all that's the whole reason a library like Also, it seems the new memory cache thing (discussed here) will not have cache stampede protection, so it would be even more useful. Btw, about the L1: what will you use? The new memory cache mentioned above, the old
Agree, even though some people will end up asking for the sync version because sometimes that's the only thing you can do (luckily these places are fewer and fewer every day), just warning you.
Yes, good call, no absolute terms 👍
Does this mean that compression will be a cross-cutting concern implemented by the
Naming is hard. I would suggest a more specific
Naming is hard. I would suggest the use of "By" here:
Shouldn't
Good old Progressive Enhancement, this is the way.
One thing to note is that since the tags values will be a lot, that means high cardinality, which in the observability world (metrics in particular) can make systems easily explode.
Nice catch about being able to directly use One note though: does this design mean that if I provide my own serializer it will be used only for non-string/non-byte[] types or for any type?
Intuitive design 👍
Eh, this is delicate. I don't know if you have already thought about this, so I'll share my own exp on this. With this approach, since any reader can check the version signifier before proceeding, there will be no problems of corrupted entries: this is true. But the problem is that when upgrading a live system in the future (say from To share what has been my exp with FusionCache, what I did was use the version signifier as an additional prefix/suffix in the cache key used in the distributed cache: in this way More space consumed in the distributed cache? Yes, but only temporarily, and only IF the entire system is not updated at the same time (this will depend on each user's scenario). Of course I'm not necessarily saying this is a better design, but just exposing a different one for you to reason about (if you haven't already!).
Interesting! Haven't thought of this before.
So in the end you decided to go on with the invalidation by tag? How did you solve the problems highlighted previously. I'll re-quote here for brevity:
The example was for an hypothetical Basically, the gist of it was that it is basically impossible to do invalidation by tag(s) consistently by not really doing it but instead trying to simulate it by relying on a sort of "in-memory barrier" that will do additional checks before returning a value. That was because if the "invalidation dates" or similar would be stored in memory, and they would be wiped at the next restart of the app. Above you said "separately, the system maintains a cache of known tags and their last-invalidation-time" but still in memory? If so, the problems highlighted above still stands. Unless of course you came up with a different technique, in which case I'm really interested to discover what will be 😬
Interesting! It's something I thought about for some time but haven't ened up doing yet, so I was wodnering: what are you planning to do when the limits are crossed? Log it? Throw an exception? Skip/ignore?
Interesting, again. Never thought about this. |
Since the timestamp is stored in the cache itself, it will also be fetched from L2 and inserted to L1 on the first get after the app has been restarted. |
@andreaskromann I'm not following here, are you talking about each invalidation's timestamp? Where will it be stored? In a single cache entry for all the invalidations by tag(s)? One cache entry per tag? |
on tag expiration; I was deliberately deferring on that, but:
So yes, tag expiration will outlive process restart |
@jodydonetti I guess Marc explained it above, but yes one entry per tag. Regarding the API proposal, I was positively surprised so much of what we discussed made it into the proposal. It looks very promising. The auxiliary API for invalidations is still undefined, so it will be interesting to see. Another thing I noticed was that the concept of serving stale values with a background refresh (stale-while-revalidate) didn't make the cut. |
Stale with background refresh is highly desirable, and I'm confident it will get added later. One huge problem is the safety of the callback (vs escaping the execution path of the calling code), meaning we'll need this to be opt-in and contextual per usage. That's another reason for the |
I'll eagerly await to see which design will make it work reasonably, can't wait 😬
Mmmh... that's why I asked: it's not the first time I played with such approach (and others) but they never really worked, at least not in a reasonable way. Tags are stored inside each entry, sure, but when a user asks for an entry it does it by cache key, and that is the only thing we know upfront. So what happens is, with a concrete example for a get:
On top of this, should each "invalidation tag entry" go through the same cycle as any normal key? Eh, that's not necessarily so immediate to answer, and a fun one. Anyway, at this point I think it's better if I just stop here and wait for the design/api surface to get out, so I can reason on something concrete and don't waste your time in speculations or spoilers. |
|
|
Next step: address the feedback and put a clean copy of the API in a comment and/or PR. We can finalize over email/GH. |
merged: #55084 |
actually, I need to check the normal "who closes, when" - reopening |
@jodydonetti @joegoldman2 you both raised the |
API Approved (offline)! |
@mgravell and I know that you almost answered my notes some time ago but then the mobile app discarded your answer, and I see that this has been already closed, but you think you'll be able to find some time to type them again? ps: of course the parts related to tag invalidation are already answered in the other issue, which I'll answer to in a moment. |
cool and thanks for replay 👍 |
Hello, check the flags property of the entry options:
https://source.dot.net/#Microsoft.Extensions.Caching.Abstractions/Hybrid/HybridCacheEntryFlags.cs,0796b099317330da
SongOfYouth ***@***.***> ezt írta (időpont: 2024. okt. 31.,
Cs 2:38):
… Can we indicate only lcoal or distributed when set value? not all cache
need to be distributed even a IDistributedCache has been registered.
—
Reply to this email directly, view it on GitHub
<#54647 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAL7DERWVTIUYUGJBN67OSLZ6GC2DAVCNFSM6AAAAABE7YFXCKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINBYHAZTEMBYGI>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
The key not be saved to garnet when registered builder.Services.AddHybridCache(options =>
{
options.MaximumPayloadBytes = 1024 * 1024;
options.MaximumKeyLength = 1024;
options.DefaultEntryOptions = new HybridCacheEntryOptions
{
Expiration = TimeSpan.FromMinutes(5),
LocalCacheExpiration = TimeSpan.FromMinutes(5)
};
}); builder.Services.AddStackExchangeRedisCache(options =>
{
options.Configuration =
builder.Configuration.GetConnectionString("RedisConnectionString");
}); when i call the |
@SongOfYouth Are you using the V9 (preview) version of the Microsoft.Extensions.Caching.StackExchangeRedis library? (the V8 version doesn't work with Garnet, but the V9 version does) |
You are right, i tried the pre version and i works. |
May be it is necessary to provide a cache.GetAsync(key, typeof(MyType)<,...>); because i don't know the real type when use it to AOP caching. |
I have a use case that i need to check for item availability in the cache, without adding an item in case of not found, so is it possible to expose the method that check for item existence to be public? |
@mgravell I'm trying the latest version from NuGet and there is no GetAsync. It's advertised above in the API definition but seems missing. I really need it in a project where only want to look up a value in the cache or return "still processing". To be precise I'd like to see a TryGetAsync which ist safe if the value is not available. Also I'd like to see a GetOrCreate overload that doesn't have to provide a value in the create phase. We have some tasks where we do the heavy work and only at the we know if this can be cached or not. |
You can mimic this functionality with the flags. But I absolutely agree
with you about those features being part of the API directly.
Steffen Forkmann ***@***.***> ezt írta (időpont: 2024. nov.
19., K 8:11):
… @mgravell <https://github.com/mgravell> I'm trying the latest version
from NuGet and there is no GetAsync. It's advertised above in the API
definition but seems missing.
I really need it in a project where only want to look up a value in the
cache or return "still processing". To be precise I'd like to see a
TryGetAsync which ist safe if the value is not available.
Also I'd like to see a GetOrCreate overload that doesn't have to provide a
value in the create phase. We have some tasks where we do the heavy work
and only at the we know if this can be cached or not.
—
Reply to this email directly, view it on GitHub
<#54647 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAL7DEQIYQLWH2CPV3S4MPT2BLQDZAVCNFSM6AAAAABE7YFXCKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIOBUHA3DEOBUGM>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I don't get the part |
Look at the Falgs |
The problem is: the flags are provided before the task runs. But I only know if we need to cache from within the task. |
In general I'd prefer not to link to other libs in other people's repo, even less so if the other lib is mine, but since this is a Microsoft repo and the lib is free/OSS and FusionCache has been named even in the DotNetConf presentation (thanks again Marc!) I'll make an exception. So anyway: in FusionCache you have both NOTE: I don't know what the result would be when skipping the factory. I suppose a Hope this helps. Hi @forki and @dmm-l-mediehus
I think they removed that method in the final bits of the abstraction, and I think (speculation) it's because it would have exposed the entry type (eg: the envelope, not the value type itself) which is better to keep internal-only, and FWIW I think they made the right call. Anyway you can get kinda the same result with the
See above, hope this helps. Hi @forki
Based on the public api surface area, I think this is not possible with HybridCache. Having said that in FusionCache it's possible, and it's called Adaptive Caching. Hope this helps. I'll now shamefully retreat in my corner 🫣 |
Does HybridCache have something like FusionCahce's backplane? It is important for synchronizing cache between nodes |
Not sure if this is the right place to post this, but I'm using the preview version (9.0.0-preview.9.24556.5) with only the local in memory cache I'm trying to use the RemoveByTag functionality, like so: But this does not remove entries with specified tag. Is this supposed to work or still work in progress? |
|
Thanks. Then I will patiently wait for it to be implemented. Tags for in memory cache is the main features that I am excited for. |
Not yet |
Not yet, this is a feature under development yet. |
@forki This is how I've implemented a public async ValueTask<TItem?> GetValueAsync<TItem>(string key, CancellationToken cancellationToken = default) =>
await cache.GetOrCreateAsync<TItem?>(key, factory: null!,
new HybridCacheEntryOptions()
{
Flags = HybridCacheEntryFlags.DisableUnderlyingData
},
cancellationToken: cancellationToken); To call the method in your code: var value = await _cache.GetValueAsync<Guid?>(key, cancellationToken);
if (value != null)
{
// The key was found in cache
} The crucial part that allows you to determine if the key exists in cache is passing a nullable type to |
The problem is that, when the type you need to work with is actually nullable (either a nullable value type like |
is this correct? Flags = HybridCacheEntryFlags.DisableLocalCacheWrite | HybridCacheEntryFlags.DisableDistributedCacheWrite ? |
Getting the following error when using these two flags
This is caused by HybridCache attempting to call the null callback that I've provided in my previous example. |
I think it should be the 3 of them together: Flags =
HybridCacheEntryFlags.DisableUnderlyingData
| HybridCacheEntryFlags.DisableLocalCacheWrite
| HybridCacheEntryFlags.DisableDistributedCacheWrite; One to skip the factory, one to skip the memory write and one to skip the distributed write. |
When using the
Do you plan to add a similar reload to the |
Please add the |
If I'm not wrong the api surface area went GA with .NET 9 in November, meaning it is now frozen: now only the implementation can be changed. |
Is there a way to configure an app to use two different |
Can this class be preloaded? |
This is the API proposal related to Epic: IDistributedCache updates in .NET 9
Hybrid Cache specification
Hybrid Cache is a new API designed to build on top of the existing
Microsoft.Extensions.Caching.Distributed.IDistributedCache
API, to fill multiple functional gaps in the usability of theIDistributedCache
API,including:
Overview
The primary API is a new
abstract
class,HybridCache
, in a newMicrosoft.Extensions.Caching.Distributed
package:This type acts as the primary API that users will interact with for caching using this feature, replacing
IDistributedCache
(which now becomes a backend API); the purpose ofHybridCache
is to encapsulate the state required to implement new functionality. This required additional state means that the feature cannot be implemented simply as extension methods
on top of
IDistributedCache
- for example for stampede protection we need to track a bucket of in-flight operations so that we can join existing backend operations. Every feature listed(except perhaps for the pass-thru API usage) requires some state or additional service.
Microsoft will provide a concrete implementation of
HybridCache
via dependency injection, but it is explicitly intended that the API can be implemented independently if desired.Why "Hybrid Cache"?
This name seems to capture the multiple roles being fulfilled by the cache implementation. A number of otions have been considered, including "read thru cache",
"advanced cache", "distributed cache 2"; this seems to work, though.
Why not
IHybridCache
?providing this at the definition level halves this aspect of the API surface for concrete implementations, providing a consistent experince
IHybridCache
it is harder to extend than with an abstract base class thatcan implement features with default implementations that implementors can
override
as desiredIt is noted that in both cases, "default interface methods", also serve this function; if provide a mechanism to achieve this same goal with an
IHybridCache
approach.If we feel that "default interface methods" are now fully greenlit for this scenario, we could indeed use an
IHybridCache
approach.Registering and configuring
HybridCache
Registering hybrid cache is performed by
HybridCacheServiceExtensions
:where
IHybridCacheBuilder
here functions purely as a wrapper (via.Services
) to provide contextual API services to configure related services such as serialization,for API discoverability, for example making it trivial to configure serialization, rather than having to magically know about the existence of specific services that
can be added to influence behaviour. The return value is the same input services collection, for chaining purposes.
The
HybridCacheOptions
provides additional global options for the cache, including payload max quota and a default cache configuration (primarily: lifetime).The user will often also wish to register an out-of-process
IDistributedCache
backend (Redis, SQL Server, etc) in the usual manner, asdiscussed here. Note that this is not required; it is anticipated that simply having
the L1 cache with stampede protection against the backend provides compelling value. Options specific to the chosen
IDistributedCache
backend willbe configured as part of that
IDistributedCache
registration, and are not considered here.Using
HybridCache
The
HybridCache
instance will be dependency-injected into code that requires them; from there, the primary API isGetOrCreateAsync
which providesa stateless and stateful overload pair:
It should be noted that all APIs are designed as
async
, withValueTask<T>
used to respect that values may be availablesynchronously (in the cache-hit case); however, the fact that we're caching means we can reasonably assume this operation
will be non-trivial, and possibly one or both of an an out-of-process backend store call (with non-trivial payload) and an underlying data fetch (with non-trivial total time);
async
is strongly desirable.The simplest use-case is the stateless option, typically used with a lambda callback using "captured" state, for example:
The
GetOrCreateAsync
name is chosen for parity withIMemoryCache
; ittakes a
string
key, and a callback that is used to fetch the underlying data if it is not available in any other cache. In some high throughput scenarios, it may bepreferable to avoid this capture overhead using a
static
callback and the stateful overload:Optionally, this API allows:
HybridCacheEntryOptions
, controlling the duration of the cache entry (see below)For the options, timeout is only described in relative terms:
The
Flags
also allow features such as specific caching tiers or compression to be electively disabled on a per-scenario basis. It will directed thatentry options should usually be shared (
static readonly
) and reused on a per-scenario basis. To this end, the type is immutable. If nooptions
is supplied,the default from
HybridCacheOptions
is used; this has an implied "reasonable" default timeout (low minutes, probably) in the eventuality that none is specified.In many cases,
GetOrCreateAsync
is the only API needed, but additionally,HybridCache
has auxiliary APIs:These APIs provide for explicit manual fetch/assignment, and for explicit invalidation at the
key
ortag
level.The
HybridCacheEntry
type is used only to encapsulate return state forGetAsync
; anull
response indicatesa cache-miss.
Backend services
To provide the enhanced capabilities, some new additional services are required;
IDistributedCache
has both performance and feature limitations that make it incomplete for this purpose. Forout-of-process caches, the
byte[]
nature ofIDistributedCache
makes for allocation concerns, so a new API is optionally supported, based on similar work for Output Cache; however, the system functions without demanding it and allpre-existing
IDistributedCache
implementations will continue to work. The system will type-test for the new capability:If the
IDistributedCache
service injected also implements this optional API, these buffer-based overloads will be used in preference to thebyte[]
API. We will absorb the workto implement this API efficiently in the Redis implementation, and advice on others.
This feature has been prototyped using a FASTER backend cache implementation; it works very well:
(the top half of the table uses
IDistributedCache
; the bottom half usesIBufferDistributedCache
, and assumes the caller will utilize pooling etc, which hybrid cache: will)Similarly, invalidation (at the
key
andtag
level) will be implemented via an optional auxiliary service; however this API is still in design and is not discussed here.It is anticipated that cache hit/miss/etc usage metrics will be reported via normal profiling APIs. By default this
will be global, but by enabling
HybridCacheOptions.ReportTagMetrics
, per-tag reporting will be enabled.Serializer configuration
By default, the system will "just work", with defaults:
string
will be treated as UTF-8 bytesbyte[]
will be treated as raw bytesSystem.Text.Json
, as a reasonable in-box experienceHowever, it is important to be able to configure other serializers. Towards this, two serialization APIs are proposed:
With this API, serializers can be configured at both granular and coarse levels using the
WithSerializer
andWithSerializerFactory
APIs at registration;for any
T
, if aIHybridCacheSerializer<T>
is known, it will be used as the serializer. Otherwise, the set ofIHybridCacheSerializerFactory
entrieswill be enumerated; the last (i.e. most recently added/overridden) factory that returns
true
and provides aserializer
: wins (this value may be cached),with that
serializer
being used. This allows, for example, a protobuf-net serializer implementation to detect types marked[ProtoContract]
, orthe use of
Newtonsoft.Json
to replaceSystem.Text.Json
.Binary payload implementation
The payload sent to
IDistributedCache
is not simply the raw buffer data; it also contains header metadata, to include:1
:All times are managed via
TimeProvider
. Upon fetching an entry from the cache, the expiration is compared using the current time;expired entries are discarded as though they had not been received (this avoids a problem with time skew between in-process
and out-of-process stores, although out-of-process stores are still free to actively expire items).
Separately, the system maintains a cache of known tags and their last-invalidation-time (in absolute terms); if a cache entry has any tag that has a
last-invalidation-time after the creation time of the cache entry, then it is discarded as though it had not been received. This
effectively implements "tag" expiration without requiring that a backend is itself capable of categorized/"tagged" deletes (this feature is
not efficient or effective to implement in Redis, for example).
Additional implementation notes and assumptions
null``, non-empty
string` valuesIDistributedCache
with mismatches logged and the entry discarded)
<T>
) what they are requesting; if this is incorrect forthe received data, an error may occur
IDistributedCache
registration, and the backend store is secure from tampering and exfiltration. Specifically: the data will not be additionally encrypted
foo
andFOO
are separate;a-b
anda%2Db
are separate, etc; if the data retrievedhas a non-matching
key
, it will be logged and discardedstring
comparer and will apply safe logic; it will not be possible to specify a custom comparerwhere n := number of chars in the key and m := number of bytes in the value
IDistributedCache
is not explicitly documented (it is an implementation detail), and should be treated as an opaque BLOB<Foo>
and<Bar>
(different types) with the same cache key: the behaviour is undefined (it may or may not error, depending on the serializer and type compatibility); likewise, if a type is heavily refactored (i.e. in a way that impacts serializer compatibility) without changing the cache key: the behaviour is undefinedThe text was updated successfully, but these errors were encountered: