crdts for morta s why haven t offline first apps taken
play

CRDTs for Mortas Why havent offline-first apps taken off? Syncing - PowerPoint PPT Presentation

CRDTs for Mortas Why havent offline-first apps taken off? Syncing is hard You must admit that local apps are a distributed system Actual https://actualbudget.com available offline must be fast focus on privacy arbitrary


  1. CRDTs for Mortas

  2. Why haven’t “offline-first” apps taken off?

  3. Syncing is hard

  4. You must admit that local apps are a distributed system

  5. Actual — https://actualbudget.com

  6. available offline must be fast focus on privacy arbitrary queries … a local app!

  7. a mobile app! uh oh, syncing?

  8. simple data (510 MB database) syncing on top of sqlite

  9. unreliable ordering conflicts also… needs to work 100% of the time

  10. unreliable ordering conflicts

  11. { x: 2 } { y: 4 } { x: 3 } { z: 10 }

  12. State X State X eventual consistency

  13. how do we solve unreliable ordering?

  14. we need to assign timestamps

  15. “Time is relative; its only worth depends upon what we do as it is passing.”  Albert Einstein

  16. 1 3 2 4

  17. • vector clock • hybrid logical clock (HLC * • per-device • assigns “timestamps” to changes * https://cse.buffalo.edu/tech-reports/201404.pdf

  18. { x: 3, timestamp: “2019-11-05T15:29:40.273Z-0001-eede1195b7d94dd5" } { x: 5, timestamp: “2019-11-04T15:35:40.273Z-0001-85b8c0d2bbb57d99" }

  19. “Simplicity is a prerequisite for reliability.”  Edsger W. Dijkstra

  20. class MutableTimestamp extends Timestamp { // clock.js setMillis(n) { let _clock = null ; this ._state.millis = n; } function setClock(clock) { // Calculate the next logical time and counter. _clock = clock; setCounter(n) { // Ensure that the logical time never goes backward; } this ._state.counter = n; // * if all logical clocks are equal, increment the max counter, } // * if max = old > message, increment local counter, function getClock() { // * if max = messsage > old, increment message counter, return _clock; setNode(n) { // * otherwise, clocks are monotonic, reset counter } this ._state.node = n; var lNew = Math.max(Math.max(lOld, phys), lMsg); } var cNew = function makeClock(timestamp, merkle = {}) { } lNew === lOld && lNew === lMsg return { timestamp: MutableTimestamp. from (timestamp), merkle }; ? Math.max(cOld, cMsg) + 1 } MutableTimestamp. from = timestamp => { : lNew === lOld return new MutableTimestamp( ? cOld + 1 function serializeClock(clock) { timestamp.millis(), : lNew === lMsg return JSON.stringify({ timestamp.counter(), ? cMsg + 1 timestamp: clock.timestamp.toString(), timestamp.node() : 0; merkle: clock.merkle ); }); }; // Check the result for drift and counter overflow } if (lNew - phys > config.maxDrift) { // Timestamp generator initialization throw new Timestamp.ClockDriftError(); function deserializeClock(clock) { // * sets the node ID to an arbitrary value } const data = JSON.parse(clock); // * useful for mocking/unit testing if (cNew > 65535) { return { Timestamp.init = function (options = {}) { throw new Timestamp.OverflowError(); timestamp: Timestamp. from (Timestamp.parse(data.timestamp)), if (options.maxDrift) { } merkle: data.merkle config.maxDrift = options.maxDrift; }; } // Repack the logical time/counter } }; clock.timestamp.setMillis(lNew); clock.timestamp.setCounter(cNew); function makeClientId() { /** return uuidv4() * Timestamp send. Generates a unique, monotonic timestamp suitable return new Timestamp( .replace(/-/g, '') * for transmission to another system in string format clock.timestamp.millis(), .slice(-16); */ clock.timestamp.counter(), } Timestamp.send = function (clock) { clock.timestamp.node() // Retrieve the local wall time ); var phys = Date.now(); }; // timestamp.js var config = { // Unpack the clock.timestamp logical time and counter /** // Maximum physical clock drift allowed, in ms var lOld = clock.timestamp.millis(); * Converts a fixed-length string timestamp to the structured value maxDrift: 60000 var cOld = clock.timestamp.counter(); */ }; Timestamp.parse = function (timestamp) { // Calculate the next logical time and counter if ( typeof timestamp === 'string') { class Timestamp { // * ensure that the logical time never goes backward var parts = timestamp.split('-'); constructor (millis, counter, node) { // * increment the counter if phys time does not advance if (parts && parts.length === 5) { this ._state = { var lNew = Math.max(lOld, phys); var millis = Date.parse(parts.slice(0, 3).join('-')).valueOf(); millis: millis, var cNew = lOld === lNew ? cOld + 1 : 0; var counter = parseInt(parts[3], 16); counter: counter, var node = parts[4]; node: node // Check the result for drift and counter overflow if (!isNaN(millis) && !isNaN(counter)) }; if (lNew - phys > config.maxDrift) { return new Timestamp(millis, counter, node); } throw new Timestamp.ClockDriftError(lNew, phys, config.maxDrift); } } } valueOf() { if (cNew > 65535) { return null ; return this .toString(); throw new Timestamp.OverflowError(); }; } } Timestamp.since = isoString => { toString() { // Repack the logical time/counter return isoString + '-0000-0000000000000000'; return [ clock.timestamp.setMillis(lNew); }; new Date( this .millis()).toISOString(), clock.timestamp.setCounter(cNew); ( Timestamp.DuplicateNodeError = class extends Error { '0000' + return new Timestamp( constructor (node) { this .counter() clock.timestamp.millis(), super (); .toString(16) clock.timestamp.counter(), this .type = 'DuplicateNodeError'; .toUpperCase() clock.timestamp.node() this .message = 'duplicate node identifier ' + node; ).slice(-4), ); } ('0000000000000000' + this .node()).slice(-16) }; }; ].join('-'); } // Timestamp receive. Parses and merges a timestamp from a remote Timestamp.ClockDriftError = class extends Error { // system with the local timeglobal uniqueness and monotonicity are constructor (...args) { millis() { // preserved super (); return this ._state.millis; Timestamp.recv = function (clock, msg) { this .type = 'ClockDriftError'; } var phys = Date.now(); this .message = ['maximum clock drift exceeded'].concat(args).join(' '); } counter() { // Unpack the message wall time/counter }; return this ._state.counter; var lMsg = msg.millis(); } var cMsg = msg.counter(); Timestamp.OverflowError = class extends Error { constructor () { node() { // Assert the node id and remote clock drift super (); return this ._state.node; if (msg.node() === clock.timestamp.node()) { this .type = 'OverflowError'; } throw new Timestamp.DuplicateNodeError(clock.timestamp.node()); this .message = 'timestamp counter overflow'; } } hash() { if (lMsg - phys > config.maxDrift) { }; return murmurhash.v3( this .toString()); throw new Timestamp.ClockDriftError(); } } setClock(makeClock( new Timestamp(0, 0, makeClientId()))); } // Unpack the clock.timestamp logical time and counter var lOld = clock.timestamp.millis(); var cOld = clock.timestamp.counter();

  21. unreliable ordering conflicts

  22. manual conflict resolution

  23. manual conflict resolution

  24. CRDTs

  25. partially ordered monoid in the category of endofunctors with a least upper bound

  26. conflict-free replicated data types GCounter LWWElement-Set PNCounter ORSet GSet ORSWOT ??? 2PSet and more…

  27. conflict-free replicated data types commutative idempotent 2  3  3  2  5 f(x) f(x) f(x) f(x) f(x) f(x)

  28. { x: 300, timestamp: “2019-11-05T15:29:40.273Z-0000-eede1195b7d94dd5” } { y: 73, timestamp: “2019-11-02T15:35:32.743Z-0000-85b8c0d2bbb57d99" } { x: 8, timestamp: “2019-11-02T15:35:32.743Z-0001-85b8c0d2bbb57d99" } { z: 114, timestamp: “2019-11-02T15:35:32.743Z-0002-85b8c0d2bbb57d99" } { x: 300, y: 73, z: 114 } { x: 300, y: 73 } { x: 300 } {}

  29. { x: 300, timestamp: “2019-11-05T15:29:40.273Z-0000-eede1195b7d94dd5” } { y: 73, timestamp: “2019-11-02T15:35:32.743Z-0000-85b8c0d2bbb57d99" } { x: 8, timestamp: “2019-11-02T15:35:32.743Z-0001-85b8c0d2bbb57d99" } { z: 114, timestamp: “2019-11-02T15:35:32.743Z-0002-85b8c0d2bbb57d99" } LWWMap { x: 300, y: 73, z: 114 } { x: 300, y: 73 } { x: 8, y: 73 } { x: 8 } {}

  30. Map → Last-Write-Wins-Map (LWWMap) Set → Grow-Only Set (GSet)

  31. { id: “0aead5b3-203e-475f-b3f5-1ab9ace69620”, timestamp: “2019-11-05T15:29:40.273Z-0000-eede1195b7d94dd5” } { id: “0aead5b3-203e-475f-b3f5-1ab9ace69620”, timestamp: “2019-11-02T15:35:32.743Z-0000-85b8c0d2bbb57d99" } { id: “e5b4c695-a632-4cec-a646-d61b32b2351f”, timestamp: “2019-11-02T15:35:32.743Z-0001-85b8c0d2bbb57d99" } (“0aead5b3-203e-475f-b3f5-1ab9ace69620”, GSet “e5b4c695-a632-4cec-a646-d61b32b2351f”)

  32. How to take basic relational data and turn it into CRDTs?

  33. SQLite table GSet of LWWMaps

  34. A new table: messages_crdt

  35. update("transactions", { id: "30127b2e-f74c-4a19-af65-debfb7a6a55b", name: "Kroger", amount: 450 }) // becomes { dataset: "transactions", row: "30127b2e-f74c-4a19-af65-debfb7a6a55b", column: "name", value: "Kroger", timestamp: "2019-11-02T15:35:32.743Z-0000-85b8c0d2bbb57d99" } { dataset: "transactions", row: "30127b2e-f74c-4a19-af65-debfb7a6a55b", column: "amount", value: 450, timestamp: "2019-11-02T15:35:32.743Z-0001-85b8c0d2bbb57d99" }

Recommend


More recommend