Add Seurat/MULTI-seq #6168

mari-ga · 2024-08-13T13:56:21Z

PR checklist

Closes #XXX

This comment contains a description of changes (with reason).
If you've fixed a bug or added code that should be tested, add tests!
If you've added a new tool - have you followed the module conventions in the contribution docs
If necessary, include test data in your PR.
Remove all TODO statements.
Emit the versions.yml file.
Follow the naming conventions.
Follow the parameters requirements.
Follow the input/output options guidelines.
Add a resource label
Use BioConda and BioContainers if possible to fulfil software requirements.
Ensure that the test works with either Docker / Singularity. Conda CI tests can be quite flaky:
- For modules:
  - nf-core modules test <MODULE> --profile docker
  - nf-core modules test <MODULE> --profile singularity
  - nf-core modules test <MODULE> --profile conda
- For subworkflows:
  - nf-core subworkflows test <SUBWORKFLOW> --profile docker
  - nf-core subworkflows test <SUBWORKFLOW> --profile singularity
  - nf-core subworkflows test <SUBWORKFLOW> --profile conda

merge files

mari-ga · 2024-08-13T14:50:06Z

MULTI-seq is a hashing demultiplexing module from Seurat. I created the structure Seurat/Multi-seq because I intend to add another hashing demultiplexing module to the folder, which is also the property of Seurat (HTODemux), so both tools from the same library are located in the same place. Is that right?
These modules need many arguments to run its different functionalities, which I added in the nextflow.config file, any suggestion about this? especially, regarding the tests
also the code used to generate the plots for both tools is very very similar, any suggestion about it?

Merge new code

new code merged

pinin4fjords

Mostly recommend some tidying up.

I always worry a little about smushing lots of custom stuff (the plots here) into a module that should really be as thin-as-possible a wrapper around some underlying function. But might be a me thing.

Also just wanted to flag the efforts that have gone on to build CLI parts for Seurat, in case you're interested.

pinin4fjords · 2024-08-26T10:27:27Z

modules/nf-core/seurat/multiseq/templates/MULTIseq.R

+
+# All values from ext.args are stored as strings
+# Function to transform strings to the correct class
+convert_element <- function(x) {


Could you move the functions to the top, with the other function? Would help with readability.

Also, add the proper roxygen-style function documentation to all.

I totally agree with you on having a wrapper as thin as possible, I'll check if some parts of the CLI can be applied here, I also thought from the beginning that the block of code for the plots could be too large for the module, however splitting the code would imply adding an "intermediate layer" in which we would obtain the results from demultiplexing as an RDS object and produce plots, the problem is that not all plots included in the tool are available under the CLI

modules/nf-core/seurat/multiseq/templates/MULTIseq.R

modules/nf-core/seurat/multiseq/environment.yml

merged code from master

modules/nf-core/seurat/multiseq/templates/MULTIseq.R

SPPearce · 2024-09-10T12:25:46Z

modules/nf-core/seurat/multiseq/tests/main.nf.test

+ then {
+ assertAll(
+ { assert process.success },
+ { assert path(process.out.assignment.get(0).get(1)).exists() },


Is there nothing stable in this file to check other than its existence?

It is a csv file that contains different values for each sample, I tried with { assert path(process.out.classification.get(0).get(1)).exists() } but any idea of another test that might be useful here?

modules/nf-core/seurat/multiseq/tests/main.nf.test

SPPearce · 2024-09-10T12:27:54Z

modules/nf-core/seurat/multiseq/tests/nextflow.config

+ assay = "HTO"
+ nfeatures = 2000
+ quantile = 0.7
+ autoThresh = false
+ maxiter = 5
+ qrange_from = 0.1
+ qrange_to = 0.9
+ qrange_by = 0.05
+ verbose = true
+ selection_method = "mean.var.plot"
+ normalization_method = "CLR"
+
+ // Parameters to generate plots
+ group_cells_feature_scatter = "MULTI_ID"
+ feature_scatter_feature_1 = "MS-11"
+ feature_scatter_feature_2 = "MS-12"
+ number_of_features_ridge_plot = 2
+ number_of_cols_ridge_plot = 2
+ group_cells_violin_plot = "MULTI_classification"
+ features_violin_plot = "nCount_RNA"
+ pt_size = 0.1
+ log = true
+ subset_idents = "Negative"
+ subset_invert = true
+ tsne_scale_data_verbose = false
+ run_pca_approx = false
+ run_tsne_dim_max = 2
+ run_tsne_perplexity = 100
+ check_duplicates_tsne = false
+ resolution = 0.6
+ singlet_identities_tsne = "MULTI_classification"


Are these all optional? Does it work without this?

For all the parameters before // Parameters to generate plots, the tool has the values by default.
After // Parameters to generate plots, some names such as feature_scatter_feature_1 = "MS-11" depend on the data and, therefore, must be given, if the option produce_plots is true, should I include those parameters as inputs in the main.nf?

mari-ga and others added 5 commits August 1, 2024 15:48

multiseq initial script

fd02239

structure for seurat multiseq

03e3b55

multiseq working template

91795bd

prettier

0d6923e

Merge branch 'nf-core:master' into seurat

d6c3c21

mari-ga requested a review from a team as a code owner August 13, 2024 13:56

mari-ga requested review from LeuThrAsp and removed request for a team August 13, 2024 13:56

mari-ga added 7 commits August 13, 2024 16:00

first lint fixes

781d245

Merge remote-tracking branch 'refs/remotes/origin/seurat' into seurat

66acc8f

merge files

fix code, check part of lint

0730684

lint fixes

f38b205

lint fixes

08998da

fix lint

f89a405

fix for error generated by lint

814309f

mari-ga and others added 10 commits August 14, 2024 11:47

singularity lint error

d999a1a

Merge branch 'master' into seurat

a1638d9

new snapshots multiseq

efdb207

Merge remote-tracking branch 'refs/remotes/origin/seurat' into seurat

e7ab7e7

Merge new code

Merge branch 'master' into seurat

36be37a

new snapshots

7fb913c

Merge remote-tracking branch 'refs/remotes/origin/seurat' into seurat

879a0a8

new code merged

Merge branch 'master' into seurat

1e184c0

Merge branch 'master' into seurat

ba7a58a

Merge branch 'master' into seurat

abaf864

SPPearce added the Ready for Review label Aug 23, 2024

pinin4fjords reviewed Aug 26, 2024

View reviewed changes

SPPearce removed the Ready for Review label Sep 9, 2024

SPPearce reviewed Sep 9, 2024

View reviewed changes

modules/nf-core/seurat/multiseq/environment.yml Outdated Show resolved Hide resolved

SPPearce and others added 7 commits September 9, 2024 14:24

Update modules/nf-core/seurat/multiseq/environment.yml

d244b93

Merge branch 'master' into seurat

b80af8c

added fixes suggested in comments

0ee0697

Merge remote-tracking branch 'refs/remotes/origin/seurat' into seurat

6f1a254

merged code from master

deleted white trailspace

0fd3fb9

Merge branch 'master' into seurat

3665e2a

Merge branch 'master' into seurat

9391901

SPPearce reviewed Sep 10, 2024

View reviewed changes

mari-ga and others added 2 commits September 13, 2024 13:09

Merge branch 'master' into seurat

14c7140

fixes

8f767c4

mari-ga requested a review from SPPearce September 13, 2024 12:11

mari-ga and others added 5 commits September 17, 2024 15:09

Merge branch 'master' into seurat

f962cb2

Merge branch 'master' into seurat

da27a20

lint problem fixes

5f8f493

fix linting

6d30fa1

Merge branch 'master' into seurat

01ffeaf

mari-ga requested a review from pinin4fjords September 24, 2024 11:58

Merge branch 'master' into seurat

0de1bd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Seurat/MULTI-seq #6168

Add Seurat/MULTI-seq #6168

mari-ga commented Aug 13, 2024 •

edited

Loading

mari-ga commented Aug 13, 2024

pinin4fjords left a comment

pinin4fjords Aug 26, 2024

mari-ga Sep 10, 2024

SPPearce Sep 10, 2024

mari-ga Sep 13, 2024 •

edited

Loading

SPPearce Sep 10, 2024

mari-ga Sep 13, 2024

Add Seurat/MULTI-seq #6168

Are you sure you want to change the base?

Add Seurat/MULTI-seq #6168

Conversation

mari-ga commented Aug 13, 2024 • edited Loading

PR checklist

mari-ga commented Aug 13, 2024

pinin4fjords left a comment

Choose a reason for hiding this comment

pinin4fjords Aug 26, 2024

Choose a reason for hiding this comment

mari-ga Sep 10, 2024

Choose a reason for hiding this comment

SPPearce Sep 10, 2024

Choose a reason for hiding this comment

mari-ga Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

SPPearce Sep 10, 2024

Choose a reason for hiding this comment

mari-ga Sep 13, 2024

Choose a reason for hiding this comment

mari-ga commented Aug 13, 2024 •

edited

Loading

mari-ga Sep 13, 2024 •

edited

Loading