Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Values of S.chao1 & ACE bigger than total_counts #583

Open
lynneitelson opened this issue Jul 3, 2023 · 1 comment
Open

Values of S.chao1 & ACE bigger than total_counts #583

lynneitelson opened this issue Jul 3, 2023 · 1 comment

Comments

@lynneitelson
Copy link

Hi,

I have a table of OTU's and how many times they were seen on each media:

image

I used the code written below and got strange results. The statistical values of ACE and Chao1 are larger than the total counts (total counts was calculated in the initial excel work sheet by applying =sum to each column).

What might be wrong?

The results:

image

Thank you!

The code:

`library("vegan")

#=============upload and fix the table=================#
media_table <- read.csv("Media.csv")

media_table <- subset(media_table, select = ISP2:MA)
media_table$MA <- media_table$ma

#=================Calculations of chao ace & sobs==================#

community <- as.matrix(media_table)

transposed_matrix <- t(community)
chao <- estimateR(transposed_matrix, "chao")
df_chao <- as.data.frame(chao)

write.csv(df_chao1, file = "X:/MichaI_Shared/users/lynne/Experiment/cultivation/chao+ace/df_chao_ace_sobs.csv")
`

@jarioksa
Copy link
Contributor

jarioksa commented Jul 3, 2023

What looks like wrong? Your results cannot be found from your data table, and I have no idea how you got those results, and what is wrong with them. However, the Chao and ACE values can be larger than the total count – after all, they are estimates of unseen species, and what is unseen is unseen also to total count. I think that for most data sets this happens rarely, but if none of the OTUs has very high counts, and there are many OTUs with count 1 the estimates with unseen species can be much higher than the total count.

The estimated number of unseen species, or the number of species added to the observed number of species is found from the number of species seen only once (singletons) a1 or twice a2 in estimateR as a1 (a1 – 1) / (a2 + 1) / 2. It is easy to see that this can be a large number if there are many OTUs that were seen only once, or a1 is large and a2 = 0 or small.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants