Using SelectIntegrationFeatures() on scATAC data before integrating? #515
danielcgingerich
started this conversation in
Ideas
Replies: 1 comment 1 reply
-
Sure, you could do that. The random sampling approach is not a very good solution and we're working on some better, more scalable solutions for scATAC-seq integration that should be available soon. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The scATAC integration vignette constructs a common peak set among 2 or more scATAC datasets. Then, peaks are randomly sampled from this set and used for integration.
I was wondering if this is a better alternative:
Run
FindTopFeatures()
and setmin.cutoff = 'q0'
on each dataset. Then runSelectIntegrationFeatures()
and setnfeatures =
however many features you want. As long as this number is less than the number of common peaks between each dataset, the features used will be in common between all datasets. More so, they will be ranked instead of randomly sampled, so you will be using the most informative features to integrate.Beta Was this translation helpful? Give feedback.
All reactions