-
Notifications
You must be signed in to change notification settings - Fork 140
Interpreting communication probability values #636
Comments
I was wondering something similar. Since many interactions seem to have very low probability. I guess this is the reason why they chose to put in the figure's key/legends a range from 0 to max in both the paper and the tutorials. After reading the paper, in the methods section they mention: (https://www.nature.com/articles/s41467-021-21246-9) |
Thank you! Great advice. I had forgotten about that part in the paper. |
Hi,
Thanks for writing and maintaining this useful tool.
I am having some trouble interpreting the communication probability values output by the package.
My collaborator wanted me to examine interactions between T cells and DCs in an experiment. I produced bubble plots using the
![Screenshot 2023-06-13 at 11 45 32 am](https://private-user-images.githubusercontent.com/47837238/245327738-442aabde-1d9c-42b5-9c13-956f398a1ed2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjA3NzA2MjcsIm5iZiI6MTcyMDc3MDMyNywicGF0aCI6Ii80NzgzNzIzOC8yNDUzMjc3MzgtNDQyYWFiZGUtMWQ5Yy00MmI1LTljMTMtOTU2ZjM5OGExZWQyLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzEyVDA3NDUyN1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTFlZWM1YjlhOWYzM2ZiNzJlMzA5NzJkMTZhNjUxOTEzYTZhMGJmZTczNzg4NDQzMzMxMDFkZmM3MDZjY2Q3MDEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.ItaE_V67xwpZisRE6g_sS4NyClvLLTeuzhuZyfhfQe0)
netVisual_bubble()
function and got the below plot:The collaborator then wanted to focus on the CD28 pathways, so I re-plotted without those paths and got the below:
![image](https://private-user-images.githubusercontent.com/47837238/245327059-86fdceae-65b3-4dd8-818d-74e76c48b5a4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjA3NzA2MjcsIm5iZiI6MTcyMDc3MDMyNywicGF0aCI6Ii80NzgzNzIzOC8yNDUzMjcwNTktODZmZGNlYWUtNjViMy00ZGQ4LTgxOGQtNzRlNzZjNDhiNWE0LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzEyVDA3NDUyN1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWFiZmQ1NDA2NjcyYWQyMmE0NzNhNThjZmE0NTdjNjUwNzMxMjZlNDNiNmIzNWJhNzNhMjc0Zjc1ZWQxNTcxN2ImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.x0LI517iYJJquLJlP95ETXcT1TaFrb7wDJVHQQ8kKL8)
The key thing to point out is that for CD86-CD28, the colour label changes from blue/green (min probability) to red (high probability) across the plots. That's happening because the high probability MHC interactions are being removed, so the dynamic range of probability values is squished down.
But when I pull out the actual probability of CD86-CD28 in this dataset I get these values:
which are quite low! So having a high communication probability as shown on the plot doesn't make much sense.
But when I look at the entire range of communication probability values from the entire analysis I get a histogram like this:
![image](https://private-user-images.githubusercontent.com/47837238/245326495-075a8ec9-45a2-460c-a598-ca834becf673.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjA3NzA2MjcsIm5iZiI6MTcyMDc3MDMyNywicGF0aCI6Ii80NzgzNzIzOC8yNDUzMjY0OTUtMDc1YThlYzktNDVhMi00NjBjLWE1OTgtY2E4MzRiZWNmNjczLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzEyVDA3NDUyN1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTcwMDQ4YzhkNzA3YzljYjdiNWZiMDA5MjNlNWUyYmNhNzI4ZTVhZDUyNzk3NGI0N2M3ZGIyYjE0Mjc3OWU4ZjUmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.rHTBByoGXyCmCK0rFhjGhkicXGLL7MP4HeV0djDHOg4)
So there are no very high probabilities. How, then, should I interpret the communication probabilities that are output when the probabilities never pass 0.30? What range should I be looking at to infer high-confidence interactions? Can I infer that there is communication happening at a probability of 0.04 or is likely to be a false discovery? I note that the p-values are significant for the plots above so I'm also wondering how I use commun.prob and p-value to interpret the plots in conjunction.
I'll be grateful for you advice.
The text was updated successfully, but these errors were encountered: