Efficient Package for Huge datasets in R

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Efficient Package for Huge datasets in R

Shylashree U.R
Dear Sir/Madam,

I have a large data set of 10,17,289 observations of 10,830 variables. I
need to use PCA to reduce the dimension of dataset. I have already tried
irlba, prcomp and nsprcomp packages in R but couldn't do for huge data
sets.

i.e pc <- prcomp_irlba(sparseYY[1:5000,], n=50, retx = TRUE, center = TRUE,
scale. = FALSE)
able to get only few PCs for 5000 rows only

so can you please help me what package can i use to do PCA in R for large
dataset?


Thanks and Regards

        [[alternative HTML version deleted]]

______________________________________________
[hidden email] mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.