An alternative to Riak Counters with idempotent writes within a client defined window.
Below are a few documents that are relevant to this gem, please read before considering using this gem for anything important.
Document Link | Description |
---|---|
[docs/riak_counter_drift.md] | Why Riak Counters may or may not work for your use case (Counter Drift). |
[docs/implementation.md] | Implementation details about this gem as well as some of the reasoning behind the approach. |
[docs/usage.md] | Suggested usage of this gem from your application, and implications of changing various settings. |
[docs/release_notes.md] | Information about what changed in each version |
Why shouldn't I use Riak Counters?
CRDT PNCounters (two plain GCounters) such as Riak Counters are non-idempotent and store nothing about a counter request_id other than the final value. This means that if an increment operation fails in any number of ways (500 response from server, process that made the call dies, network connection is interrupted, operation times out, etc), your application now has no idea whether or not the increment actually happened.
What is Counter Drift?
In the above situation of a failed increment operation, your application has two choices:
- Retry the operation: This could result in the operation occuring twice causing what is called positive counter drift
- Don't retry the operation: This could result in the operation never occuring at all causing negative counter drift
As such it doesn't make sense to use plain GCounters or PNCounters to store any counter that needs to be accurate.
More information about Riak Counters and Drift: [docs/riak_counter_drift.md]
The data type implemented is a PNCounter CRDT with an ordered array of request_ids for each GCounter actor. Request ids are stored with the GCounter, so operations against this counter are idempotent while the request_id remains in any actor's array.
High Level API
Function | Description |
---|---|
Riak::Ledger.new |
Creates a new Ledger instance |
Riak::Ledger.find! |
Finds an existing Ledger in Riak, merges it locally, and then writes the merged value back to Riak |
#credit! , #debit! , #update! |
Reads the existing state of the ledger from Riak, merges it locally, and adds a new request_id and positive or negative value |
Ledger Options
Name | Description |
---|---|
:retry_count [Integer] |
When a write to Riak is a "maybe" (500, timeout, or any other error condition), resubmit the request :retry_count number of times, and return false if it is still unsuccessful |
:history_length [Integer] |
Keep up to :history_length number of request_ids in each actor's section of the underlying GCounter. When the (:history_length + 1)th request_id is written then merged, add the oldest request_id's value to the actor's total |
More information about the implementation and how edge cases can be avoided: [docs/implementation.md]
Depending on your use case, you may want to tweak the configuration options :history_length
and :retry_count
.
The default :history_length
is 10. This means that if a request_id fails, but your application is unable to determine whether or not the counter was actually incremented, you have a buffer space or window of 9 additional request_ids on that counter before you can no longer retry the original failed request_id without assuming counter drift is happening.
The default :retry_count
is also 10. This means that if a request_id fails, the actor that attempted the request_id will continue trying 9 more times. If the request to change the counter still fails after the 10th try, the operation will return false
for failure. At this point your application can attempt to try the request_id again, or return a failure to the user with a note that the request_id will be retried in the future.
An example of a failure might look like the following:
-
request_id1 fails with actor1, and because of the nature of the failure, your application is unsure whether or not the counter was actually incremented.
- If your
:retry_count
is low, you can quickly determine in your application that something went wrong, and inform the user that the request_id was unsuccessful for now, but will be attempted later - If your
:retry_count
is high, the user will be kept waiting longer, but the odds of the request_id eventually working are higher
- If your
-
If after the initial retries, the request_id was still a failure, your application must decide what to do next
- If your
:history_length
is low, your options are limited. You must continue to retry that same failed request_id for that user (using any available actor) until it is successful. If you allow additional request_ids to take place on the same counter before retrying, you run a high risk of counter drift. - If your
:history_length
is medium-high, then you have an allowance of (:history_length
- 1) additional request_ids for that counter before you run the risk of counter drift.
- If your
Note
This gem cannot guarentee request_id idempotence of a counter for greater than :history_length
number of request_ids.
More information about configuration and implications of changing various settings: [docs/usage.md]
Document Link | Description |
---|---|
[http://hal.upmc.fr/docs/00/55/55/88/PDF/techreport.pdf] | CRDT paper from Shapiro et al. at INRIA |
[http://basho.com/counters-in-riak-1-4/] | Riak Counters |
[github.com/basho/riak_dt] | Other Riak Data Types |
Add this line to your application's Gemfile:
gem 'riak-ruby-ledger'
And then execute:
$ bundle
Or install it yourself as:
$ gem install riak-ruby-ledger
require 'riak' # riak-client gem
require 'ledger' # riak-ruby-ledger gem
# Name each of your threads
Thread.current["name"] = "ACTOR1"
# Create a Riak::Client instance
client = Riak::Client.new pb_port: 8087
# Get the Riak::Bucket instance
bucket = client["ledgers"]
# Set allow_mult to true
bucket.allow_mult = true unless bucket.allow_mult
# Default option values
options = {
:actor => Thread.current["name"], # Actor ID, one per thread or serialized writer
:history_length => 10, # Number of request_ids to store per actor per type (credit or debit)
:retry_count => 10 # Number of times to retry Riak requests if they fail
}
# Create the ledger object
ledger = Riak::Ledger.new(bucket, "player_1", options)
ledger.credit!(50, "request_id1")
ledger.value # 50
ledger.debit!(10, "request_id2")
ledger.value # 40
ledger.debit!(10, "request_id2")
ledger.value # 40
If no request id is given, a UUID will be generated. If a credit! or debit! call fails, the generated request id will be provided for retries
ledger.credit!(50)
ledger.value # 50
ledger.debit!(10)
ledger.value # 40
ledger.debit!(10)
ledger.value # 40
ledger = Riak::Ledger.find!(client["ledgers"], "player_1", options)
ledger.value # 40
If a call to #debit!
or #credit!
does not return false, then the request_id can be considered saved, because it would have retried otherwise. Still, for debugging, testing, or external failure policies, #has_request_id?
is also exposed
ledger.has_request_id? "request_id2" # true
ledger.has_request_id? "request_id1" # true
For this example, the :history_length
is lowered to 3 so it gets reached faster.
options = {:history_length => 3}
ledger = Riak::Ledger.new(client["ledgers"], "player_2", options)
ledger.credit!(10, "req1")
ledger.credit!(10, "req2")
ledger.credit!(10, "req3")
ledger.credit!(10, "req4")
ledger.credit!(10, "req5")
ledger.value #60
ledger.has_request_id? "req1" #false
ledger.has_request_id? "req2" #false
ledger.has_request_id? "req3" #true
# req3 is still in the history because the most previous write does not trigger a merge of the actor's total
# Performing a find! will trigger the merge however
ledger = Riak::Ledger.find!(client["ledgers"], "player_2", options)
ledger.has_request_id? "req3" #false
ledger.has_request_id? "req4" #true
ledger.has_request_id? "req5" #true
ledger.has_request_id? "req6" #true
ledger.delete()
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request