sparklyr/sparklyr

R interface for Apache Spark

RScalaOthermachine-learningrsparkapache-sparkdplyridedistributedrstatssparklyrlivyremote-clusters
This is stars and forks stats for /sparklyr/sparklyr repository. As of 29 Apr, 2024 this repository has 916 stars and 305 forks.

sparklyr: R interface for Apache Spark Install and connect to Spark using YARN, Mesos, Livy or Kubernetes. Use dplyr to filter and aggregate Spark datasets and streams then bring them into R for analysis and visualization. Use MLlib, H2O, XGBoost and GraphFrames to train models at scale in Spark. Create interoperable machine learning pipelines and productionize them with MLeap. Create extensions that call the full Spark API or run distributed R code to support new functionality. Table of Contents Installation Connecting...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
decalage2/oletoolsPythonHTMLOther2.6k05440
microsoft/TemplateStudioRich Text FormatC#Visual Basic .NET2.5k+54660
nodenv/node-buildRoffShell2400106+2
naaive/orangeRustJavaScriptCSS1.4k01160
MystenLabs/suiRustMoveTypeScript5.2k+912.1k-3
wormtql/yasRust1k01590
paritytech/polkadotRustShellRuby7k01.6k0
MaterializeInc/materializeRustPythonC++5.3k04380
joernio/joernScalaJavaShell1.5k+52130
spotify/scioScalaJavaOther2.5k+15100