Breadth First Search (BFS) : Input: Graph G (V, E), Either Directed or Undirected, Output

Download as pdf or txt
Download as pdf or txt
You are on page 1of 53

Breadth First Search (BFS)

Input: Graph G = (V, E), either directed or undirected,


and source vertex s V.
Output:
d[v] = distance (smallest # of edges, or shortest path) from s to v,
for all v V. d[v] = if v is not reachable from s.
[v] = u such that (u, v) is last edge on shortest path s
v.
u is vs predecessor.

Builds breadth-first tree with root s that contains all reachable


vertices.

BFS: some points


A vertex is discovered the first time it is encountered during the
search.
A vertex is finished if all vertices adjacent to it have been
discovered.

Color the vertices to keep track of progress.


White Undiscovered.
Gray Discovered but not finished.
Black Finished.
Colors are required only to reason about the algorithm. Can be implemented
without colors.

BFS: Algorithm
BFS(G,s)
white: undiscovered
1. for each vertex u in V[G] {s}
gray: discovered
2
do color[u] white
black: finished
3
d[u]
4
[u] nil
Q: a queue of discovered
5 color[s] gray
vertices
6 d[s] 0
color[v]: color of v
d[v]: distance from s to v
7 [s] nil
[u]: predecessor of v
8 Q
9 enqueue(Q,s)
10 while Q
11
do u dequeue(Q)
12
for each v in Adj[u]
13
do if color[v] = white
14
then color[v] gray
15
d[v] d[u] + 1
16
[v] u
17
enqueue(Q,v)
18
color[u] black

BFS: Analysis
Initialization takes O(V).
Traversal Loop
After initialization, each vertex is enqueued and dequeued at most
once, and each operation takes O(1). So, total time for queuing is
O(V).
The adjacency list of each vertex is scanned at most once. The
sum of lengths of all adjacency lists is (E).

Summing up over all vertices => total running time of


BFS is O(V+E), linear in the size of the adjacency list
representation of graph.

Depth First Search traversal


Input: G = (V, E), directed or undirected. No source
vertex given!
Output:
2 timestamps on each vertex. Integers between 1 and 2|V|.
d[v] = discovery time (v turns from white to gray)
f [v] = finishing time (v turns from gray to black)

[v] : predecessor of v = u, such that v was discovered during the


scan of us adjacency list.

Uses the same coloring scheme for vertices as BFS.

DFS: Algorithm
DFS(G)
1. for each vertex u V[G]
2.
do color[u] WHITE
3.
[u] NIL
4. time 0
5. for each vertex u V[G]
6.
do if color[u] = WHITE
7.
then DFS-Visit(u)

Uses a global timestamp time.

DFS-Visit(u)
1.
color[u] GRAY // White vertex
u has been discovered
2.
time time + 1
3.
d[u] time
4.
for each v Adj[u]
5.
do if color[v] = WHITE
6.
then [v] u
7.
DFS-Visit(v)
8.
color[u] BLACK // Blacken u;
it is finished.
9.
f[u] time time + 1

Properties of DFS
Property 1
DFS-VISIT(G, u) visits all the vertices and edges in the connected
component of v.
Property 2
The discovery edges labeled by DFS-VISIT(G, v) form a spanning tree of
the connected component of v.
Property 3
The DFS(G) form a forest of spanning trees of the connected components
of G.
A

Analysis of DFS
Loops on lines 1-2 & 5-7 take (V) time, excluding
time to execute DFS-Visit.
DFS-Visit is called once for each white vertex vV
when its painted gray the first time.
Lines 4-7 of DFS-Visit is executed |Adj[v]| times. The
total cost of executing DFS-Visit is vV|Adj[v]| = (E)
Total running time of DFS is (V+E).

DFS on directed Graph


Four type of edges are produces
1. Tree edges: are edges (u,v) if v was first discovered
by exploring edge (u,v).
2. Back edges: are edges (u,v) connecting a vertex u
to an ancestor v in DFS tree. Self loops are also
called back edges.
3. Forward edges: are non-tree edges (u,v)
connecting a vertex u to a descendent v in DFS tree.
4. Cross edges: are all other edges. Can go between
vertices in the same DFS tree or they can go
between vertices in different DFS trees.

Classification of edges in DFS tree


Each edge (u,v) can be classified by the color of the
vertex v that is reached when edge is first explored.
1. WHITE indicates a tree edge.
2. GRAY indicates a back edge.
3. BLACK indicates a forward edge or cross edge.
In case 3, if d[u] < d[v] : it is a forward edge.
In case 3, if d[u] > d[v]: it is a cross edge

DFS :Applications
Path Finding:
We can specialize the DFS algorithm to find a path
between two given vertices u and z using the
template method pattern
We call DFS(G, u) with u as the start vertex
We use a stack S to keep track of the path between
the start vertex and the current vertex
As soon as destination vertex z is encountered, we
return the path as the contents of the stack

Path Finding:
Algorithm pathDFS(G, v, z)
1. for each vertex u V[G]
2. do color[u] WHITE
3. Done=FALSE
4. pathDFS-VISIT(G,v,z)

Algorithm pathDFS-VISIT(G, v, z)
1. Color[v] GRAY
2. S.push(v)
3. If (v = z)
4. THEN Done=TRUE
5.
return S.elements
6. for each u Adj[v]
7. do if (color[u] = WHITE)
8.
THEN pathDFS(G,u,z)
9.
if (Done) THEN return;
10. S.pop()
11. Color[v] BLACK

Directed Acyclic Graph (DAG)


A Directed Graph without a cycle.
Undershorts

Socks
Watch
Shoes

Pants

Shirt
Belt

Tie
Jacket
13

a DAG implies an
ordering on events
In a complex DAG, it
can be hard to find a
schedule that obeys
all the constraints.

Topological Sort
For a directed acyclic graph G = (V,E), a topological
sort is a linear ordering of all vertices of G such that
if G contains an edge (u,v), then u appears before v in
the ordering.
A topological sort of a graph can be viewed as an
ordering of its vertices along a horizontal line so that
all directed edges go from left to right.

Topological Sort:Example
Undershorts

Socks
Shoes

Pants

Watch

Shirt
Belt

Tie
Jacket

Socks

Undershorts

Pants

Shoes

Watch

Shirt

Belt

Tie

Jacket

Topological sort
There are often many possible topological sorts of a
given DAG (Directed Acyclic Graph)
Topological orders for this DAG :

1,2,5,4,3,6,7
2,1,5,4,7,3,6
2,5,1,4,7,3,6
Etc.

1
3

2
4

5
7

Each topological order is a feasible schedule.

Topological Sorts for Cyclic Graphs?


1

Impossible!
3

If v and w are two vertices on a cycle, there


exist paths from v to w and from w to v.
Any ordering will contradict one of these
paths

Topological Sort: Algorithm


TOPOLOGICAL-SORT(G)
1. Call DFS(G) to compute finishing time f[v] for each vertex v.
2. As each vertex is finished, insert it onto the front of a linked list.
3. Return the linked list of vertices.

Topological Sort
11/16

17/18

Undershorts

12/15

Socks
Shoes

Pants
Shirt

6/7

Belt

Tie
Jacket

Socks

17/18

1/8

Watch 9/10

13/14

2/5
3/4

Undershorts

Pants

Shoes

Watch

Shirt

Belt

Tie

Jacket

11/16

12/15

13/14

9/10

1/8

6/7

2/5

3/4

All edges of G are going from left to right only

Strongly Connected Components


Strongly connected component of a directed graph
G=(V,E) is a maximal set of vertices U V such that
for every pair of vertices u and v in U, we have both
u v and v u, that is u and v are reachable from
each other.

Strongly Connected Components: Algorithm


STRONGLY-CONNECTED-COMPONENTS(G)
1. Call DFS(G) to compute finishing time f[u] for
each vertex u.
2. Compute GT, transpose of G by reversing all edges.
3. Call DFS(GT), but in the main loop of DFS,
consider the vertices in order of decreasing f[u] as
computed in line 1.
4. Output the vertices of each tree in the depth-first
forest of step 3 as a separate strongly connected
component.

Strongly Connected Components: Example


a

13/14

11/16

1/10

8/9

12/15

3/4

2/7

5/6

g
f
DFS on graph G with 4 SCCs, tree edges are in RED
b

g
f
DFS on Graph GT (transpose of G)

SCC tree
abe

cd

fg

Single Source Shortest Paths

Single Source Shortest Path


Given a weighted directed graph G=(V,E), with
weight function w:ER mapping edges to real
valued weights.
Weight of path p=<v0,v1,v2,vk> is the sum of the
weights of its constituent edges.
k
w(p) = i =1 w(v i 1,vi)
We define the shortest path weight from u to v by
min{w(p): u

(u,v) =

v)} if there is a path from


u to v.
otherwise.

Representing shortest path


Shortest path tree
Rooted at source vertex s,
A directed graph G =(V,E), where V V and
E E, such that
1. V is the set of vertices reachable from s in G.
2. G forms a rooted tree with root s, and
3. For all v V, the unique simple path from s to v in G
is a shortest path from s to v in G.

For each vertex v, we store [v] pointing to its


predecessor vertex. For source vertex s, [s] =NIL

Representing shortest path


u

s 0
5

9
2

3
5
x

11
y

[s] = NIL
[u] = s
[v] = u
[x] = s
[y] = x
(s,s) = 0
(s,u) = 3
(s,v) = 9
(s,x) = 5
(s,y) = 11

Relaxation
Algorithms keep track of d[v], [v]. Initialized as
follows:
INITIALIZE-SINGLE-SOURCE(G, s)
1. for each v V[G] do
2.
d[v] ;
//distance from source
3.
[v] NIL
//parent node
4. d[s] 0
// source to source distance =0

These values are changed when an edge (u, v) is


relaxed:
RELAX(u, v, w)
1. if d[v] > d[u] + w(u, v) then
2.
3.

d[v] d[u] + w(u, v);


[v] u

//new path from s to v through u


is smaller
//set new path

Relaxation
u
5

RELAX(u,v)
u
5

RELAX(u,v)

d[v] > d[u]+w(u,v)

v
6

d[v] <= d[u]+w(u,v)

Dijkstras Algorithm
DIJKSTRA(G, w, s)
1.
INITIALIZE-SINGLE-SOURCE(G,s)
2.
S
3.
QV
4.
while Q
5.
do u EXTRACT-MIN(Q)
6.
SS {u}
7.
for each vertex v Adj[u]
8.
do RELAX(u, v, w)

INITIALIZE-SINGLE-SOURCE(G, s)
1.
for each v V[G] do
2.
d[v] ;
3.
[v] NIL
4. d[s] 0

RELAX(u, v, w)
1.
if d[v] > d[u] + w(u, v) then
2.
d[v] d[u] + w(u, v);
3.
[v] u

Dijkstras Algorithm
DIJKSTRA(G, w, s)
1. for each v V[G] do
2.
d[v] ;
INITIALIZE3.
[v] NIL
SINGLE-SOURCE
4. d[s] 0
5. S
6. QV
7. while Q
// Q is priority queue using minHeap
8. do u EXTRACT-MIN(Q)
9.
SS {u}
10.
for each vertex v Adj[u]
11.
do if d[v] > d[u] + w(u, v)
12.
then d[v] d[u] + w(u, v);
RELAX
13.
[v] u

All Pairs Shortest Path

The All-Pairs Shortest Paths Problem

The Floyd-Warshall Algorithm

Step2: Structure of shortest path

Step2: Structure of shortest path

Step2: Structure of shortest path

Step 3: the Bottom-up Computation

The Floyd-Warshall Algorithm: Version 1

Comments on the Floyd-Warshall Algorithm

The Floyd-Warshall Algorithm: Version 2

Extracting the Shortest Paths

Extracting the Shortest Paths

The Algorithm for Extracting the Shortest Paths

Minimum Spanning Trees

MST: Problem
Undirected, connected graph G = (V,E)
Weight function W: E R (assigning cost or length
or other values to edges)
Cost/weigth of MST: sum of weights of all edges in
MST.
Problem is to find a Minimum spanning tree: tree that
connects all the vertices and having minimum weight.
w(T ) =

( u ,v )T

w(u, v)

Generic MST Algorithm


Generic-MST(G, w)
1 A // Contains edges that belong to a MST
2 while A does not form a spanning tree do
3
Find an edge (u,v) that is safe for A
4
AA{(u,v)}
5 return A
Safe edge edge that does not destroy As property
The algorithm manages a set of edges A maintaining the following loop
invariant

Prior to each iteration, A is a subset of some minimum spanning tree.

At each step, an edge is determined that can be added to A without


violating this invariant. Such an edge is called a Safe Edge.

Kruskals Algorithm
Create a forest of trees from the vertices
Repeatedly merge trees by adding safe edges
until only one tree remains
A safe edge is an edge of minimum weight
which does not create a cycle

Kruskal's Algorithm
Edge based algorithm
Add the edges one at a time, in increasing weight
order
The algorithm maintains A a forest of trees. An
edge is accepted it if connects vertices of distinct
trees
We need a data structure that maintains a partition,
i.e.,a collection of disjoint sets
Make-Set(v): S {v}
Union(Si,Sj): S S {Si,Sj} {Si Sj}
FindSet(S, x): returns unique Si S, where x Si

Kruskal's Algorithm
The algorithm adds the cheapest edge that connects
two trees of the forest
MST-Kruskal(G,w)
1 A // set of edges forming MST
2 for each vertex v V[G] do
3
Make-Set(v)
4 sort the edges of E by non-decreasing weight w
5 for each edge (u,v) E, in order by nondecreasing weight do
6
if Find-Set(u) Find-Set(v) then
7
A A {(u,v)}
8
Union(u,v) // Union of sets containing u and v
9 return A

Prims Algorithm
Vertex based algorithm
It is a greedy algorithm.
Start by selecting an arbitrary vertex, include it
into the current MST.
Grow the current MST by inserting into it the
vertex closest to one of the vertices already in
current MST.
Grows one tree T, one vertex at a time
A cloud covering the portion of T already computed
Label the vertices v outside the cloud with key[v]
the minimum weigth of an edge connecting v to a
vertex in the cloud, key[v] = , if no such edge exists

Prims Algorithm
MST-Prim(G,w,r)
01
02
03
04
05
06
07
08
09
10
11

Q V[G] // Q a priority queue vertices out of T


for each u Q
key[u]
key[r] 0
[r] NIL
while Q do
u ExtractMin(Q) // making u part of T
for each v Adj[u] do
if v Q and w(u,v) < key[v] then
[v] u
key[v] w(u,v)

You might also like