Skip to content

Commit

Permalink
fixes
Browse files Browse the repository at this point in the history
  • Loading branch information
rabbou committed Mar 14, 2021
1 parent 4a77e0b commit c4a3223
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
# Chinese Restaurant Process Project
Binary file added doc/CRP_figure.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
21 changes: 21 additions & 0 deletions doc/Vignette.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# Chinese Restaurant Process Simulation
###### Jack Potrykus & Ruben Abbou
###### STAT 35450 - Computational Biology
###### Winter 2021

## Motivation

The Chinese Restaurant Process (CRP) is a Bayesian nonparametric method which can be used to cluster microarray gene expression data. Many of the successful techniques which have proven to work for this type of data (such as EXAMPLES) rely on previous knowledge about the data such as the number of clusters. This can be problematic as microarray data may contain important fluctuation and we may not always know the number of clusters we aim to separate. The CRP provides an alternative method which does not require prior knowledge on the data, and assumes an infinite number of latent clusters.

## Method

#### Base Case
Imagine a restaurant with an infinite number of tables, and imagine a sequence of customers entering the restaurant and sitting down. The first customer enters and sits at the first table. The second customer enters and sits at the first table with probability $\frac{1}{1+\alpha}$, and the second table with probability $\frac{\alpha}{1+\alpha}$, where $\alpha$ is a positive real. When the $n$th customer enters the restaurant, she sits at each of the occupied tables with probability proportional to the number of previous customers sitting there, and at the next unoccupied table with probability proportional to $\alpha$. At any point in this process, the assignment of customers to tables defines a random partition.

![CRP_figure](CRP_figure.png)

## Examples

## Exercises

#### Sources

0 comments on commit c4a3223

Please sign in to comment.