Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data preprocessing #6

Open
2001229323jyf opened this issue Dec 21, 2023 · 1 comment
Open

Data preprocessing #6

2001229323jyf opened this issue Dec 21, 2023 · 1 comment

Comments

@2001229323jyf
Copy link

Hello, it is great work!

However, I encountered difficulties when attempting to convert the dataset I found into the .h5ad format and integrating it into your project. I observed that you provided preprocessing methods for the data in the project as follows:

1.Exclude spots with less than 500 UMIs and genes expressed in less than 3 spots
2.Normalize the expression matrix with the LogNormalize method in Seurat.
3.Annotate the cell types by label transfer (the TransferData function in Seurat) with single-cell breast cancer dataset GSE118390 as reference dataset.
4.Deconvolution results stored at adata.obsm['predicted_cell_type']
Cell-type label (the max value) stored at adata.obs.cell_type

If you could provide some code guidance or suggestions on data preprocessing in these aspects, or if you could offer more well-processed data in the .h5ad format, I would be greatly appreciative.

@lhc17
Copy link
Owner

lhc17 commented Jan 1, 2024

First of all, thank you for using it~ My preprocessing steps are following the standard process of Seurat and Scanpy. Can you tell me exactly which step you encountered an error after converting the dataset to h5ad, and can you perhaps provide me with the structure of your current ‘adata’?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants