Skip to content

CodeHariK/Database

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Secretary Bird

Strategy

1. Persistence. How not to lose or corrupt your data. Recovering from a crash.
2. Indexing. Efficiently querying and manipulating your data. (B-tree).
3. Concurrency. How to handle multiple (large number of ) clients. And transactions.

Vector Db

Search

Data Structure

Probablistic Data structures

Bloom filter

Bloom filter is a space-efficient probabilistic data structure, that is used to test whether an element is a member of a set. False positive matches are possible, but false negatives are not - in other words, a query returns either "possibly in set" or "definitely not in set". Elements can be added to the set, but not removed.

A Bloom filter is a representation of a set of n items, where the main requirement is to make membership queries; i.e., whether an item is a member of a set.

Uses

Cache filtering

Content delivery networks deploy web caches around the world to cache and serve web content to users with greater performance and reliability. A key application of Bloom filters is their use in efficiently determining which web objects to store in these web caches. To prevent caching one-hit-wonders, a Bloom filter is used to keep track of all URLs that are accessed by users.

Web Crawler

HyperLogLog

HyperLogLog is an algorithm for the count-distinct problem, Probabilistic cardinality estimators.

Count–min sketch

The goal of the basic version of the count–min sketch is to consume a stream of events, one at a time, and count the frequency of the different types of events in the stream.

HeavyKeeper TopK

TDigest

Cache

LSM-Tree

#04 - Database Storage: Log-Structured Merge Trees & Tuples (CMU Intro to Database Systems)

https://github.com/facebook/rocksdb/wiki

https://github.com/krasun/lsmtree https://github.com/skyzh/mini-lsm

OLAP

Concurrency

Compress

About

Bplustree database and datastructures

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published