# Conditional random fields
- Energy minimization in image segmentation

# Beginning of CRFs

- First paper released in 2001 by John Lafferty, Andrew McCallum, Fernando Pereira 
- A framework for building probabilistic models to segment and label sequence data.
![Forbackground.PNG](attachment:37fe8567-91df-441a-b729-3140b4c23c38.PNG)

# Hidden Markov Models

- Sequence models, compute a sequence of outputs dependent on a given sequence of hidden inputs.

- Good for finding patterns: speech, handwriting and gesture -recognition.

- Generative model: gives a framework as to how the data got generated in the first place, more concerned with how the data got generated


![HMMimg.PNG](attachment:dc791c37-49a5-4c02-b9a4-167d17808444.PNG)
- Weather (X) : Hidden State

- Mood (Y) : Observed State

- Markov Property: The probability of transitioning to a new state depends only on the current state and not on the sequence of previous states.

- Most probable weather given: Happy-Happy-Sad?

 given the joint ditribution:
$p(y,x) = arg max_{X=x_1,x_2,....x_n} \frac{P(Y|X)P(X)}{P(Y)} = \prod P(Y_i|X_i)P(X_i|X_{i-1})$

- Limitations: For image processing the pixels are not just dependent on themselves but the pixels around them aswell.

Markov Random Fields
-
Model the joint probability distribution p(x,y) of variables in the system. 


# CRFs

- A more general version of the hidden markov model
- Discriminative model. All it cares about is learning how to tell certain hidden states from others given an input sequence, it is based on conditional distribution with an associated graph.
- They model the conditional probability distribution p(y|x) of output variables given input variables.
<!--framework for building a probabilistic model to segment and label sequential data.  -->
![CRFImagesegmentation.PNG](attachment:7d157d18-cc52-42d1-9bc7-fcd7ea4027c2.PNG)


# Graphs and nodes
- In CRF, data points are represented as a graph G(V,E) consisting of a set of nodes V, and edges between nodes E.
- An edge between node i and node j denotes labels of data point i and j are dependent.
- Connections instead of arrows, it is undirected.

- $(X,Y)$ is a crf when conditioned on $X$, the random variables $Y_v$ obey the Markov property with respect to the graph. 

  $p(Y_v|X,Y_w, w \neq v) = p(Y_v|X,Y_w, w \backsim  v)$.[1]

![HMMmodel.PNG](attachment:8301e247-2647-44e6-826e-513d4d43e79c.PNG)
Note: here X and Y have been switched from the previous examples

The conditional probability p is defined: $p(x|y) = \frac{1}{Z}\prod_{i} \Upphi(x_i,y_i) \prod_{i,j} \uppsi(x_i,x_j)$

Where Z is an instance-specific normalization function $ \sum (x|y')$
<!-- basically a sum of  \sum p(y',x)-->

Taking the negative log of both sides and dropping the Z we define an energy function:

- $E(x|y) = \sum_{i} \Upphi_u(x_i,y_i)  + \sum_{i,j} \uppsi_{u,v}(x_i,x_j) $

The first sum holds $ \Upphi$, the unary part which tells us how likely a pixel $x_i$  belongs to a class $y_i$. 

- Measures the compatibility of each pixel's label with its local apperance features.

Second part is $\uppsi$, the pairwise term which gives neighborhood information, enforces consistency e.g. how different is a pixel from its neighbor. 

- Capture smoothness or coherence of neighboring pixel labels

## DNNs, CNNs and CRFs 
![CRFDNN.PNG](attachment:8d63bfe4-c1a3-4d20-98fb-e231ae75dfa4.PNG) [2]

Deep Neural Networks (DNNs) 

have been shown to achieve outstanding results working with CRFs.

Combining DNNs with CRF to use the outputs of a DNN as
the unary potentials of a well studied CRF model such as
DenseCRF, as a separate post-processing step.

![CRFCNN.jpg](attachment:aef2126f-016e-43db-b7bd-4124df249450.jpg)

Different CRF-based approaches For each graph: 
- (a) Post-processing CRF
- (b) End-to-end training CRF with predefined features
- (c) Proposed Posterior-CRF, which uses CNN feature maps as CRF reference maps.

a)Segmentation of arteries in CT: first row shows two axial slices of the CT scan with red arrows indicating indistinguishable boundaries; second row shows the corresponding ground truth of the aorta (yellow) and pulmonary artery (green); 

(b) White matter hyperintensities segmentation in MRI: four examples are shown with the ground truth of the lesions (green), red arrows indicate small isolated lesions that can be easily removed by CRF; 

(c) Ischemic stroke lesions segmentation in MRI: first row shows the ground truth of the lesions (green) where large appearance difference between lesions can be observed (red arrows); second row shows a close-up view of the lesions.[5]

# References
1. Lafferty, J.; McCallum, A.; Pereira, F. (2001). "Conditional random fields: Probabilistic models for segmenting and labeling sequence data".
2. Anurag Arnab∗
, Shuai Zheng
, Sadeep Jayasumana, Bernardino Romera-Paredes, Ma s Larsson˚
Alexander Kirillov, Bogdan Savchynskyy, Carsten Rother, Fredrik Kahl and Philip T (2018). "Conditional Random Fields Meet Deep Neural
Networks for Semantic Segmentatio".n
3. Charles Sutton, Andrew McCallum, (2010). "An Introduction to Conditional Random
Fields for Relational Leag."

4. Richard Szeliski, (2021), "Computer Vision:
Algorithms and Application 2nd Edition"

5. Shuai Chen, Zahra Sedghi Gamechi, Florian Dubost, Gijs van Tulder, Marleen de Bruijne, (2021), An end-to-end approach to segmentation in medical images with CNN and posterior-orr