SAL - simple algorithms and datastructures library

A c++ header only library containing efficient algorithms and data structures implemented simply.

Simplicity here refers to how close the implementation matches the core concepts of each algorithm,
rather than the triviality of each algorithm.

For detailed documentation and some analysis, see my portfolio

Features

decoupled algorithms so most files are standalone

contest friendly
paste specific functions in without the rest of the library

simple implementation of efficient algorithms

learning friendly
engineered for readability

Getting started

open cmd/terminal and change directory to somewhere in your include path (ex. /usr/local/include)
type git clone --recursive git@github.com:LemonPi/sal.git
if you missed #2 and cloned it without --recursive, get the submodules with git submodule update --init

Motivation

I enjoyed some features of the boost library (namely variant), but found the library too cumbersome as a whole.
Using it in a set up environment was fine, but it was impossible to understand conceptual slices of it.
It wasn’t the best resource to learn the concepts from, and you couldn’t (in most cases) use functions from it for contests.

The focus for this library is on representing the concepts

For examples, look at algotest.cpp and datatest.cpp

Algorithms

sal/algo/numerics.h —- numeric

Θ(lg(exponent)) modular exponentiation
Θ(lg(exponent)) integer exponentiation
Θ(lg(n)) fibonacci generation
Θ(prime) cyclic number generation and detection (1/prime in a given base)
Θ(M(n)) checking for perfect square (M(n) is the complexity of the multiplication algorithm used)
Θ(lg(ab)) greatest common denominator of integers a and b
totient (number coprimes below)
Θ(n^3) matrix chain multiplication ordering
prime factorize
number of total factors
sum of total factors

sal/algo/perm.h —- permutation and combination

kth permutation of an indexable sequence
all permutations (non-distinct)
all permutations (distinct)
generate all possible combinations using each element in sequence
count ways to generate a sum using a set number of elements of given value

sal/algo/prime.h —- prime generation and manipulation

sequence of primes
nth prime
prime closest to a number
prime after a number
count primes below a number
check if prime

sal/algo/search.h —- basic searching, substring matching, and finding longest common features

binary search on sorted sequence
intersection of a set of sets
order statistic select
matching word inside sentence
longest common substring
longest common subsequence
longest common subsequence length
suffix array (data structure for substring searches)

sal/algo/sort.h —- comparison, distributive, and hybrid sorts

partition
bubble sort
linear insertion sort
binary insertion sort
merge sort
quick sort
heap sort
counting sort
radix sort
tim sort
patience sort

sal/algo/string.h —- edit distances

levenshtein distance

sal/algo/utility.h —- utility and testing functions

random integer generation
timer
perturb a sequence (soft random shuffle)
fast pointer hashing

Data structures

sal/data/list.h —- basic linked list

sal/data/heap.h —- binary heap

maxheap by default
Ө(1) time search
Ө(n) time construction and batch insert
Ө(lgn) time insert
Ө(lgn) time extract top
Ө(lgn) time change key (check key if value was indirectly changed)

sal/data/matrix.h —- 2D matrix

square identities
random matrices
efficient multiplication of a sequence of matrices
resizable

sal/data/infint.h —- infinity precision integers (Sercan Tutar’s)

sal/data/tree.h —- red black tree and augmentations of it

easily extensible base RB tree
node iterator
order statistic tree augmentation
treap
order statistic tree

sal/data/interval.h —- interval tree

overlapping regions or time intervals
collision resolution
scheduling

sal/data/graph.h —- directed and undirected graphs

adjacency matrix (not really supported)
adjacency list
vertex iterator
adjacency iterator

sal/data/graph/search.h —- graph searches

Ө(V + E) breadth first search
Ө(V + E) depth first search
Ө(V + E) recursive depth first search
visitors for custom actions at certain points of DFS

sal/data/graph/utility.h —- important graph algorithms

Ө(V + E) topological sort
Ө(V + E) check for cycle
Ө(V + E) transpose of graph
Ө(V + E) strongly connected components
O((V + E)lgV) minimum spanning tree

sal/data/graph/shortest.h —- shortest paths for different graphs or trees

single source algorithms:
O(VE) Bellman-Ford (allows negative edges)
O(V + E) shortest dag (directed acyclic graph)
O((V + E)lgV) Dijkstra (non-negative edges)

sal/data/graph/linear.h —- linear programming

O(mn) Difference constraint feasibility (m <- # constraints, n <- # variables)

Example usage

namespace sal will be used implicitely here
functions that take iterator pairs are overloaded to take containers as well

numeric" class="reference-link">sal/algo/numeric.h —- numeric

// 7^91 % 10
modular_pow(7, 91, 10); 
// int 3
// 13789^722341 % 2345
modular_pow(13789, 722341, 2345); 
// int 2029
int_pow(5, 3);
// int 125
fibonacci<Infint>(1000);
// Infint 43466557686937456435688527675040625802564660517371780402481729089536555417949051890403879840079255169295922593080322634775209689623239873322471161642996440906533187938298969649928516003704476137795166849228875
// repeating part of 1/7 in base 10
make_cyclic(10, 7);
// size_t 142857
// length of repeating part of 1/7 in base 10
cycle_length(10, 7);
// size_t 6
// check if number is an integer power of a base
is_pow(4194304,4);
// bool true
// check if number is a perfect square
is_square(21489798124);
// bool false
// greatest common denominator
gcd(56, 91);
// unsigned int 7
// chain matrix multiplication ordering to minimize work
std::vector<Matrix<int>> mats;
size_t row {30}, col {35};
for (int i = 0; i != 100; ++i) {
    // generate sequence of random matrices
    mats.push_back(random_matrix<int>(row, col, 0, 50));
    // next matrix's row must be prev matrix's col
    row = col;
    col = rand() % 100 + 5;    // between 5 - 105
}
mul(mats);
// Matrix<int> 
totient(500);
phi(500);
// big_int 200 (200 numbers below 500 that are coprime with it)
size_t num = 421412;
// prime factorize regular numbers (usually smooth) ----------
factorize(421412);
// vector<size_t> 2 2 137 769 (in order)
num_factors(num);
// size_t 12 (1 2 4 137 274 548 769 1538 3076 105353 210706 421412)
sum_factors(num);
// size_t 743820 (1 + 2 + 4 + 137 + ... + 421412)
// factorize primes or semiprimes ----
big_int semiprime = 32452843 * 32452867;    // 1053187797650881
// would still not be too slow
factorize(semiprime);
// would be faster
factorize_rough(semiprime);
// would be much faster for repeated uses (primes are kept static)

permutation and combination" class="reference-link">sal/algo/perm.h —- permutation and combination

std::string words {"Hello"};
// modifies the sequence without return
// algorithms work on any indexable sequence, string is the example's sequence type
// k is from 0 to s.size()! - 1
perm(words, 1);
// sequence "oHell"
llperms(words);
// vector<sequence> (120 permutations of "Hello")
allperms_distinct(words);
// set<sequence> (60 distinct permutations of "Hello")
std::vector<int> ints {1,2,3,4,5,6};
std::set<int> int_add {combine(ints, 
    [](int a, int b){return a + b;})};
// 2 3 4 5 6 7 8 9 10 11 12 handshake by adding
std::set<int> int_add_odd {combine(ints,
    [](int a, int b){return a + b;},
    [](int a, int b){return (a & 1) && (b & 1);})};
// 2 4 6 8 10 handshake by adding odd elements
std::vector<int> coins {1, 2, 5, 10, 20, 50, 100, 200};
int combos = count_combos(coins, 200);
// size_t 73682 (ways to sum up to 200 using coins)

prime generation and manipulation" class="reference-link">sal/algo/prime.h —- prime generation and manipulation

// keep a sieve object if continuous work with primes is needed
using big_int = size_t;
Sieve<big_int> sieve;
while (true) sieve.next_prime();
// generate infinite stream of primes (each of big_int)
sieve.nth_prime(1000);
// big_int 7919 (1000th prime)
sieve.is_prime(sieve.nth_prime(420000));
// bool true (the 4200000th prime is a prime!)
sieve.count(1000000);
// size_t 78498 primes below a million 
sieve.closest_prime(50000);
// big_int 49999
sieve.next_prime(50000);
// big_int 50021,
sieve.primes_upto(1000000);
// vector<big_int> (78498 primes that are under one million)

basic searching, substring matching, and finding longest common features" class="reference-link">sal/algo/search.h —- basic searching, substring matching, and finding longest common features

std::vector<int> seq {1,3,12,14,15,18,20};
// bin_search assumes sequence is in order and elements are comparable
bin_search(seq.begin(), seq.end(), 12);
// iterator to element 12
bin_search(seq.begin(), seq.end(), 17);
// iterator seq.end()
std::vector<int> seq2 {1,3,5,6,7,8,20,32};
std::vector<int> seq3 {2,3,6,9,20,32,45,55};
intersection(std::set<vector<int>>{seq, seq2, seq3});
// unordered_set {3, 20} (elements shared by all 3 sequences)
std::string a {"It was the best of times..."};
std::string b {"That's the best orange juice!"};
lc_substr(a, b);
// string "s the best o"
lc_subseq(a, b);
// string "as the best o ie"
lc_subseq_len(a, b);
// size_t 16
sub_match(a, std::string{"the best"});
// const_iterator to 't' in a
// find ith smallest element (1 is smallest)
std::vector<int> v {632, 32, 31, 50, 88, 77, 942, 5, 23};
select(v.begin(), v.end(), 4);
// iterator to 4th element (50)

comparison, distributive, and hybrid sorts" class="reference-link">sal/algo/sort.h —- comparison, distributive, and hybrid sorts

std::vector<int> u {632, 32, 31, 88, 77, 942, 5, 23};
partition(u.begin(), u.end());
// iterator to pivot 77
// 23 32 31 5 77 942 88 632 (all elements < 77 on left and > 77 on right)
std::vector<int> v {randgen(1048576, 100000)};    // 2^20
bub_sort(v.begin(), v.end());
// need to know maximum for counting sort, else wastes one pass finding maximum
cnt_sort(v.begin(), v.end(), 1048577);
// can also not specify anything, additional call to min_max
cnt_sort(v.begin(), v.end());
// sort only on only bits 20-12 which only requires 2^8 = 256 bits of storage
cnt_sort(v.begin(), v.end(), 256, [](int n){return (n & 0xFF000) >> 12;});
// binary insertion sort
ins_sort(v.begin(), v.end());
heap_sort(v.begin(), v.end());
// linear insertion sort
lin_sort(v.begin(), v.end());
mer_sort(v.begin(), v.end());
// patience sort, really slow in practice
pat_sort(v.begin(), v.end());
qck_sort(v.begin(), v.end());
// need to know maximum for counting sort, else uses the maximum bit of size
rdx_sort(v.begin(), v.end(), 20);    // 20 bits needed for 2^20 max
tim_sort(v.begin(), v.end());

edit distances" class="reference-link">sal/algo/string.h —- edit distances

// # edit actions including insertion, deletion, or substitution to turn source into new
levenshtein("Saturday", "Sunday");
// size_t 3

utility and testing functions" class="reference-link">sal/algo/utility.h —- utility and testing functions

// precision timing, starts counting when constructed
Timer timing;
// generate random vector of integers for testing
int max_value = 50000;
int element_number = 4000;
randgen(max_value, element_number);
// vector<int> (4000 elements with values between 0 and 50000)
// restart timer
timing.restart();
std::vector<int> v;
v.reserve(element_number);
for (int i = 0; i < element_number; ++i) v.push_back(i);
// vector 1 2 3 4 5 ... 50000
// randomly displace elements of v probabilistically 5 elements away from sorted position
perturb(v.begin(), v.end(), 5);
// print a sequence so elements are separated by space
print(v);
timing.tonow();
// double in microseconds

basic linked list" class="reference-link">sal/data/list.h —- basic linked list

// Construction ----------------------
Basic_list<int> l {5, 7, 11, 13, 17, 19};
std::cout << l;
// 5 7 11 13 17 19
// Queries ---------------------------
l.kth_last(2);
// pointer to node 19 
std::cout << l.kth_last(2)->data << std::endl;
// 19
// Modification ----------------------
// insert at head
l.insert(3);
l.print();
// 3 5 7 11 13 17 19
// append to tail
l.append(23);
// 3 5 7 11 13 17 19 23
// insert after the 2nd last 
l.insert_after(19, l.kth_last(2));
// 3 5 7 11 13 17 19 19 23
l.erase(13);
// 3 5 7 11 17 19 19 23
// remove duplicates
l.remove_dup();
// 3 5 7 11 17 19 23

Binary heap" class="reference-link">sal/data/heap.h —- Binary heap

// great for use as a priority queue
// advantage over std::priority_queue or using make_heap and friends
// can find key in Ө(1) time and change its value in Ө(lgn) time
// can also check key for indirect value update (from comparator)
// Construction ----------------------
// by default maxheap using std::greater<T> as Cmp
Heap<int> maxheap {3, 4, 6, 5, 1, 8, 11, 12};
// can also construct via iterator range
std::vector<int> temp {3, 4, 6, 5, 1, 8, 11, 12};
Heap<int> maxheap2 {temp.begin(), temp.end()}
// give a different comparator
Heap<int, std::less<int>> minheap {3, 4, 6, 5, 1, 8, 11, 12};
// a stateful comparator, for example the one used in minimum spanning tree
// looks up vertices' minimum edge in each step of Prim's algorithm
template <typename Property_map>
struct Prim_cmp {
    using V = typename Property_map::key_type;
    // keep reference to a mapping that might change
    const Property_map& property;
    Prim_cmp(const Property_map& p) : property(p) {}
    // u and v will always be valid vertices, so no need to check for not found
    bool operator()(const V& u, const V& v) const {
        return property.find(u)->second.min_edge < property.find(v)->second.min_edge; 
    }
};
// minimum spanning tree's property map
using Cmp = Prim_cmp<MPM<Graph>>;
using V = Graph::vertex_type;
MPM<Graph> property;
Cmp cmp {property};
// ... load necessary info into property
// constructor for stateful comparator
Heap<V, Cmp> queue {cmp};
// Ө(n) batch insert
queue.batch_insert(g.begin(), g.end());
// Queries ---------------------------
// looking at top (min in a minheap, max in a maxheap)
maxheap.top();
// value_type 12
// index used to change and check elements held at key Ө(1) lookup
maxheap.key(12);
// size_t 1
// Iterations ------------------------
// begin and end as per standard
for (auto v : maxheap) std::cout << v << ' ';
// Modification ----------------------
// pop top and return it
maxheap.extract_top();
// value_type 12 (top() did not modify top)
// change key to something that has "greater" value
// for maxheap: increase key, for minheap: decrease key
// does nothing if you attempt to decrease value
maxheap.change_key(maxheap.key(12), 13);
// directly change using value (does a key lookup in the background)
maxheap.change_val(13, 15);
// indirectly change key by affecting the comparator
// for example in the relaxation step of minimum spanning tree
property[edge.dest()].min_edge = edge.weight();
// to affect change in the queue, check the key (will update it if heap violations exist)
queue.check_key(queue.key(edge.dest()));
// inserts into proper position
maxheap.insert(2);
// batch insertion, as shown above
maxheap.batch_insert(temp.begin(), temp.end());

2D matrix" class="reference-link">sal/data/matrix.h —- 2D matrix

// Construction ----------------------
Matrix<int> A {{2, 5, 6},
           {3, 4, -3},
           {7, 8, 0}};
Matrix<int> B {{-1, 1},
           {5, -2},
           {4, 2}};
// Queries ---------------------------
// size query
A.row();
// size_t 3
B.col();
// size_t 2
// get element (indices from 0)
A.get(1, 2);
// int -3 (2nd row, 3rd col)
// Arithmetic Operations -------------
Matrix<int> F {{1, 1},
               {1, 0}};
// matrix exponentiation
F.pow(5);
// Matrix<int>
// 8  5
// 5  3
// matrix addition
F += Matrix<int> {{2, 5}, {5, 7}};
// 10 10
// 10 10
F + Matrix<int>{{1,1},{1,1},{1,1}};
// runtime error exception from adding different dimensions
// for least work in multiplying matrices, use mul(sequence) from numeric.h
Matrix<int> C = A * B;
// Matrix<int>
// 2 5  6   -1  1       47  4
// 3 4 -3    5 -2  -->  5  -11 
// 7 8  0    4  2       33 -9
// Modification ----------------------
// individual elements can be changed by get()
// size modification
Matrix<int> id3 {identity<int>(3)};
// 1 0 0
// 0 1 0
// 0 0 1
// default for new elements is 0
id3.resize(4,5,2);
// 1 0 0 2 2
// 0 1 0 2 2
// 0 0 1 2 2
// 2 2 2 2 2
id3.resize(2,2);
// 1 0
// 0 1
// clockwise rotation
C.rotate();
// 33   5   47
// -9  -11  4

red black tree and augmentations of it" class="reference-link">sal/data/tree.h —- red black tree and augmentations of it

// Basic tree usage ------------------
Basic_tree<int> t {5, 3, 7, 1, 9, 4, 2, 0, 10, 8, 6};
// insert element
t.insert(5);
// erase element
t.erase(1);
// find element
auto node = t.find(5);
// iterator to node holding 5
t.find(11);
// iterator to end == t.end()
// Iterations ------------------------
// iteration over adjacent nodes (left to right)
for (auto adjacent = node.begin(); adjacent != node.end(); ++adjacent)
    std::cout << *adjacent << ' ';
// * operator returns key
// iteration over all nodes (in order)
using Node = Basic_node<int>;
std::for_each(t.begin(), t.end(), 
    [](const Node& node){std::cout << node.key << ' ';});
// Extending Tree for an augmented data structure ---------
// first step is creating a node
/* requires
    1. key element that can be compared using <
    2. static node pointer nil, sentinel
    3. parent, left, and right pointers initialized to nil
    4. Color attribute, BLACK for nil and RED for non-nil
*/
template <typename T>
struct Order_node {
    static Order_node* nil;
    using key_type = T;
    T key;
    Order_node *parent, *left, *right;
    // # of descendent nodes including itself = left->size + right->size + 1
    size_t size;    
    Color color;
    Order_node() : size{0}, color{Color::BLACK} {}    // sentinel construction
    Order_node(T val) : key{val}, parent{nil}, left{nil}, right{nil}, size{1}, color{Color::RED} {}
};
template <typename T>
Order_node<T>* Order_node<T>::nil {new Order_node{}};
// Augmented tree need to publically inherit from Tree<Node>
// it should use the following protected members from Tree<node>
template <typename Node>
class Order_augment : public Tree<Node> {
    using NP = Node*;    // optional
    using T = typename Node::key_type;    // optional
    using Tree<Node>::root;
    using Tree<Node>::rb_insert;
    using Tree<Node>::transplant;
    using Tree<Node>::rb_insert_fixup;
    using Tree<Node>::rb_delete_fixup;
// iterators can also be inherited from Tree
    using iterator  = typename Tree<Node>::iterator;
    using const_iterator = typename Tree<Node>::const_iterator;    
// override specific virtual methods to maintain augmented property
// tree_insert, rb_delete, rotate_left, and rotate_right
    virtual void tree_insert(NP start, NP node) override {
        NP parent {Node::nil};
        while (start != Node::nil) {
            // modification: simply increment size of each ancestor going down
            ++start->size;    
            parent = start;
            if (node->key < start->key) start = start->left;
            else start = start->right;
        }
        node->parent = parent;
        if (parent == Node::nil) root = node;
        else if (node->key < parent->key) parent->left = node;
        else parent->right = node;
    }
};
// provide convenient user access
template <typename T>
using Order_tree = Order_augment<Order_node<T>>;
// Utilities -------------------------
// non-member functions that work on nodes only
Order_tree<int> t {5, 3, 7, 1, 9, 4, 2, 0, 10, 8, 6};
using NP = Order_node<int>*;
auto node = t.find(5);
// const iterator
// check if t is a valid RB tree, 0 is invalid, else returns blackheight
// PRINTLINE macro is from utility.h
PRINTLINE(t.valid());
// get() on iterators return the raw node pointer
tree_min(node.get());
// node pointer to smallest node rooted at given node pointer
// goes left as far as possible
tree_max(node.get());
// node pointer to largest node, symmetrical to tree_min
tree_predecessor(node.get());
// node pointer to next smallest node
tree_successor(node.get());
// node pointer to next largest node
// traversals take NP root and an operation that acts on NP
inorder_walk(node.get(), [](const NP node){std::cout << node->key << ' ';});
preorder_walk(node.get(), [](const NP node){std::cout << node->key << ' ';});
postorder_walk(node.get(), [](const NP node){std::cout << node->key << ' ';});
// Order statistics tree usage (with t above)
// Queries ---------------------------
// selecting ith smallest
t.select(4);
t[4];
// iterator to node holding 4th smallest (3)
// select is overloaded on constness, operator[] is always non-const
// check the rank of a node pointer
auto node = t.find(5);
t.rank(node.get());
// size_t 6 (node holding 5 is 6th smallest node)

interval tree" class="reference-link">sal/data/interval.h —- interval tree

// for finding overlapping intervals, useful for scheduling and collision detection
// Construction ----------------------
Interval_set<int> t {{16,21},{8,9},{5,8},{15,23},{25,30},{0, 3},{6, 10},{17,19}, 
                    {26,26}, {19,20}};
// Queries ---------------------------
// find any interval overlapping [22, 25]
// overlap by default includes edge, so [25, 26] overlaps [22, 25]
// easily adjustable by changing the strictness of ordering in the no_overlap method
t.find({22, 25});
t.find(22, 25);        // overloaded to accept Intervals as well as low, high pairs
// iterator to some overlapping interval, else end()
// find first (earliest) overlapping interval
t.find_first(20, 22);
// iterator to [15, 23] (note that [16, 21] is also overlapping, but starts later)
// find all intervals that match
for (auto interval : t.find_all(20,22)) PRINTLINE(*interval); 
// vector<node pointers> [15, 23], [16, 21]
// find exact interval match
t.find_exact(16, 21);
// iterator to [16, 21]

directed and undirected graphs" class="reference-link">sal/data/graph.h —- directed and undirected graphs

// Construction ----------------------
// unweighted is equivalent to having uniform weight 1
// undirected and unweighted graph by default
graph<int> g {{5,1},{5,4},{5,10},{1,4},{4,10}};
// directed and and unweighted graph
digraph<int> g {{5,1},{5,4},{5,10},{1,4},{4,10}};
// undirected and weighted graph
graph<int> g {{5,1,10},{5,4,13},{5,10,17},{1,4,19},{4,10,23}};
// directed and weighted graph
digraph<int> g {{5,1,10},{5,4,13},{5,10,17},{1,4,19},{4,10,23}};
// use a different type for edge weight
digraph<int,double> g {{5,1,0.5},{5,4,0.2},{5,10,0.3},{1,4,10.0},{4,10,0.1}};
// Queries ---------------------------
// number of vertex
g.num_vertex();
// size_t 4
// number of edges
g.num_edge();
// size_t 5
// check if vertex exists
g.is_vertex(5);
// bool true
// check if edge exists
g.is_edge(5, 1);
// bool true
// weight of edge (1 for unweighted, 0 if not found)
g.weight(5, 1);
// int 10
// out degree of a vertex
g.degree(5);
// size_t 3
// Modification ----------------------
// adding vertex (names have to be unique)
g.add_vertex(7);
// adding edge (have to be unique)
// adding edge to non-existent vertices will create them
// weight is 1 by default
g.add_edge(6, 4);
// Iterations ------------------------
// iterate over all vertices ordered by <
for (auto v = g.begin(); v != g.end(); ++v)
    std::cout << v;
// 1 4 5 10
// << overloaded for iterators
// adjacency iteration
auto edges = g.adjacent(5);
// alter the weight
for (auto v = edges.first; v != edges.second; ++v)
    v.weight() = 1;
// can also be done given a vertex iterator
// get vertex iterator
auto v = g.vertex(5);
// iterator to vertex named 5
// iterate over adjacent vertices u
for (auto u = v.begin(); u != v.end(); ++u)
    std::cout << u;
// 1 4 10 (5 has out edges to all of them)

graph searches" class="reference-link">sal/data/graph/search.h —- graph searches

/* algorithms expects Graph to have: 
    vertex iterators in begin() and end() (and reverse iterators in rbegin(), rend())
          * gives vertex name of type V
          begin() and end() to give adjacent iterators
      adjacent iterator pair in adjacent(V)
          * gives vertex name of type V, the destination vertex
          dest() gives name of destination vertex (same as * operator)
          weight() gives weight of edge to destination vertex
*/
// breadth first search --------------
// usually used to find distances to a source node
// works with directed and undirected but have to be unweighted
graph<char> g {{'v','r'},{'r','s'},{'s','w'},{'w','t'},{'t','x'},{'w','x'},{'t','u'},
               {'x','u'},{'x','y'},{'u','y'}};
// breadth first search starting from a vertex
auto property = bfs(g, 's');
// property map with distance and parent information
// tracing back parents of each vertex creates a breadth-first tree
for (char vertex = 'u'; ;vertex = property[vertex].parent) {
    std::cout << vertex;
    if (vertex == 's') break;
    std::cout << " <- ";
}
// u <- x <- w <- s
property['u'].distance;
// size_t 3 (number of edges to get to u from s)
// depth first search ----------------
// usually used as part of other algorithms
// can also function to find shortest number of edges from a source node
auto property = dfs(g, 's', 0);    // need dummy int variable for overload resolution
// property map with parent and time stamp information
for (char vertex = 'u'; ;vertex = property[vertex].parent) {
    std::cout << vertex;
    if (vertex == 's') break;
    std::cout << " <- ";
}
// u <- t <- w <- s different from bfs, but same length
digraph<char> e {{'u','v'},{'u','x'},{'x','v'},{'v','y'},{'y','x'},
                {'w','y'},{'w','z'},{'z','z'}};
// create depth-first forest
auto dfs_property = dfs(e);
// not particularly useful by itself, but very useful for other algorithms
// DFS visitor -----------------------
// actions taken at several key points of searching can be customized
struct Graph_visitor {
    // generate vector of vertices to visit
    // acts as a stack, visitation happens from back to front
    // by default visits every vertex
    template <typename Property_map, typename Graph>
    std::vector<typename Graph::vertex_type> initialize_vertex(Property_map& property, const Graph& g) {
        std::vector<typename Graph::vertex_type> exploring;
        for (auto v = g.rbegin(); v != g.rend(); ++v) {
            // mark unexplored
            property[*v] = {unsigned_infinity, 0, *v};
            // expore every vertex
            exploring.push_back(*v);
        }
        return std::move(exploring);
    }
    // root vertex of a new depth-first tree, when a new unconnected component is discovered
    template <typename Graph>
    void start_vertex(typename Graph::vertex_type, const Graph&) {}
    // when each vertex is first discovered
    template <typename Graph>
    void discover_vertex(typename Graph::vertex_type, const Graph&) {}
    // when a vertex's neighbours are all being explored
    template <typename Graph>
    void finish_vertex(typename Graph::vertex_type, const Graph&) {}
    // when an edge leads to an ancestor vertex of the current exploration path
    template <typename Graph>
    void back_edge(typename Graph::vertex_type, const Graph&) {}
    // when an edge leads to a vertex without a shared ancestor 
    template <typename Graph>
    void forward_or_cross_edge(typename Graph::vertex_type, const Graph&) {}
};

important graph algorithms" class="reference-link">sal/data/graph/utility.h —- important graph algorithms

// topological sort ------------------
// for directed acyclic graph (dag)
// orders all vertices so that parent vertices are always before children
// if vertices are events, then sorting gives one possible sequence of events
// directed edge (u,v) means u has to happen before v
digraph<std::string> dress {{"undershorts", "pants"}, {"undershorts", "shoes"}, {"pants", "belt"},
            {"pants", "shoes"}, {"socks", "shoes"}, {"shirt", "belt"}, {"shirt", "tie"}, {"tie", "jacket"},
            {"belt", "jacket"}};
dress.add_vertex("watch");
std::list<string> dress_order;
// give a possible ordering of events
topological_sort(dress, std::front_inserter(dress_order));
for (const std::string& item : dress_order) std::cout << item << ' ';
// watch undershorts socks shirt tie pants shoes belt jacket
// perfectly logical sequence of dressing
// cycle testing ---------------------
// some algorithms need directed acyclic graphs (dag)
has_cycle(dress);
// bool true (if false, then the ordering given would be false)
// transpose -------------------------
// create graph with all the edges reversed
transpose(dress);
// graph (of taking off clothes)
// topological sort gives
// watch shoes socks jacket tie belt shirt pants undershorts
// strongly connected components -----
// a SCC is a part of a graph where any vertex can get to any other vertex inside it
// could model trading networks and the like
digraph<std::string> trade {{"China","USA"},{"USA","EU"},{"USA","Canada"},{"USA","Russia"},{"EU","Brazil"},{"EU","France"},
                {"Brazil","EU"},{"Brazil","Australia"},{"Canada","China"},{"Canada","Russia"},{"Russia","France"},
                {"France","Russia"},{"France","Australia"},{"Australia","Australia"}};
auto mutual_partners = strongly_connected(trade);
// vector of sets of vertex names (strings in this case)
for (const auto& bloc: mutual_partners) {
    for (const std::string& country : bloc) std::cout << country << ' ';
    cout << '\t';
}
// USA China Canada      EU Brazil      Russia France       Australia
// minimum spanning tree -------------
// for a connected, undirected graph with positive edge weights
// find the cheapest way to connect all the vertices
// ex. connecting nodes on a circuit board
graph<char> circuit {{'a','b',4},{'a','h',8},{'b','h',11},{'b','c',8},{'c','i',2},
                {'c','f',4},{'c','d',7},{'d','f',14},{'d','e',9},{'e','f',10},{'f','g',2},
                {'i','h',7},{'i','g',6},{'h','g',1}};
// cheapest way to connect all the circuit nodes (suppose edges were wire lengths)
auto mst = min_span_tree(circuit);
// each vertex-property pair from the MST property map
// hold parent and the min_edge cost to parent
// to actually create the tree, simply iterate over vertex and add edge between parent and child
// pm_to_tree works on any property map and undirected graph
graph<char> min_circuit {pm_to_tree(mst)};

shortest path for different graphs and models" class="reference-link">sal/data/graph/shortest.h —- shortest path for different graphs and models

// Single source shortest paths ------
// Bellman-Ford ----------------------
// most general and slowest of path finding algorithms
// O(VE) time since it might have to backtrack
// can find negative weight cycles, in which case it returns an empty property map
digraph<char> g {{'s','t',6},{'s','y',7},{'t','y',8},{'t','x',5},{'t','z',-4},
                {'x','t',-2},{'y','x',-3},{'y','z',9},{'z','s',2},{'z','x',7}};
bellman_ford(g, 's');
// property map with parent and distance attributes
// Shortest dag ----------------------
// special case where graph is directed and acyclical
// O(V + E) time by topologically sorting the vertices and relaxing in that order
// can have negative weight since acyclical, so never negative cycles
digraph<char> g {{'r','s',5},{'r','t',3},{'s','t',2},{'s','x',6},{'t','x',7},
                {'t','y',4},{'t','z',2},{'x','y',-1},{'x','z',1},{'y','z',-2}};
shortest_dag(g, 's');
// property map with parent and distance attributes
// Dijkstra --------------------------
// directed and non-negative weight
// O((V + E)lgV) time implemented with binary heap priority queue
digraph<char> g {{'s','t',10},{'s','y',5},{'t','y',2},{'t','x',1},{'x','z',4},{'y','t',3},
                {'y','x',9},{'y','z',2},{'z','s',7},{'z','x',6}};
dijkstra(g, 's');
// property map with parent and distance attributes
// remember to actually create tree, filter with pm_to_tree
graph<char> shortest_g {pm_to_tree(mst)};

linear programming" class="reference-link">sal/data/graph/linear.h —- linear programming

// optimizing solutions to a system under certain constraints
// mathematically: Ax <= b
// A <- m x n system
// x <- n x 1 solution
// b <- m x 1 constraints
// difference constraints is a specific case of linear programming
// each limit is a difference of 2 variables
// aka each row of A has one 1 and -1, the rest being 0
// resolves into xj - xi <= bk
// create system ---------------
Constraint_sys<int> system {{1,2,0},{1,5,-1},{2,5,1},{3,1,5},{4,1,4},{4,3,-1},
                                    {5,3,-3},{5,4,-3}};
// set of constraints, ex. x1 - x2 <= 0 for the first constraint
// feasibility problem: does any x exist?
// feasible(system, n <- # of variables)
Constraint_sol<int> solution = feasible(system, 5);
// vector<limit type> -5 -3 0 -1 -4
// so x1 is -5, x2 is -3, ..
solution.empty();
// false (true if system of constraints is unfeasible)
// sum of solution and their spread is minimized