Fundamentals

Atomicity: the entire transaction happens at once, or doesn't happen at all
Consistency: database must be consistent before and after transactions (follow all rules, no violations)
Isoluation: multiple transactions can occur independently without inteference
Durability: changes of a successful transaction occurs even if the system failure occurs
Isolation Levels
- Dirty Read: read the updated value in a transaction that hasn't been commited yet
- Non-repeatable Read: during the course of a transaction, data in a row changes
- Phantom Read: two identical queries return different results (a new row has just been added by another transaction)
- MySQL is repeatable read, PostgreSQL is read commited
- https://www.youtube.com/watch?v=GAe5oB742dw

in situations where read is not efficient (like a LSM tree), use bloom filter can eliminate some of the non-existing keys, thus return "no, key is not there" to save read time.
it may produce falst positives (it says it's there but it is actually not there) because of hash collisions
How to choose an appropriate size for bloom filter?
- for a chosen expected number of keys and acceptable false positive rates, use this formula:
  - m = -n * log2(p) / (log(2))^2
  - n = number of keys
  - p = acceptable false positive rates
  - m = size of bloom filter in bits
- optimal number of hash functions (k):
  - k = (m / n) * log(2)

the challenge of designing and implementing a server that can efficiently handle a large number of concurrent client connections, specifically 10,000.
solution:
- asynchronous I/O
- Non-Blocking I/O
- Efficient Data Structures
- Connection Pooling

captures changes made within a database
Implmentations:
- Query based
- Trigger based
- Log based
- Proprietary (developed by database vendor) based

when a large number of requests try to access a resources in a small time frame due to reasons like cache eviction, hardware restart, etc.
Flooded backend may further cause system failures
To mitigate this issue
- randomized cache expiration time
- rate limiting
- backend sharding
- asynchronous cache updates
- backoff time on client side

a protocol to detect node failures in a distributed and de-centralized cluster
each node passes information (heartbeat, timestamp, etc) to a random set of other nodes to let them know its still alive
other nodes, upon receiving the information, adds information about it self, and pass those information to another set of random nodes
eventually all nodes will be ping and get the updated information.
If after a while, all of the nodes haven't heard about one particular node, than that node is considered as down.
Cassandar uses Gossip Protocol to detect node failures.

123