nama1arpit/reddit-streaming-pipeline

A real-time reddit data streaming pipeline for sentiment analysis of various subreddits

HCLPythonDockerfileShellkuberneteskafkacassandraterraformgrafanareddit-apiminikubespark-structured-streaming
This is stars and forks stats for /nama1arpit/reddit-streaming-pipeline repository. As of 03 May, 2024 this repository has 55 stars and 2 forks.

Reddit Sentiment Analysis Data Pipeline Project Overview The Reddit Sentiment Analysis Data Pipeline is designed to collect comments from Reddit using the Reddit API, process them using Apache Spark, store the processed data in Cassandra, and visualize sentiment scores of various subreddits in Grafana. The pipeline leverages containerization and utilizes a Kubernetes cluster for deployment, with infrastructure management handled by Terraform. Finally, Kafka is used as a message broker to provide...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
personoids/personoids-liteJavaScriptTypeScriptCSS361+1220
blue-pen5805/sdweb-easy-prompt-selectorJavaScriptPythonCSS407+41450
badlogic/heissepreiseJavaScriptHTMLCSS85601040
afnan47/sem6Jupyter NotebookHTMLPython310290
analogdevicesinc/buildrootMakefilePythonC290530
apache/yunikorn-scheduler-interfaceMakefile200580
animesh/scriptsPerlPythonR3030
lucidrains/soundstorm-pytorchPython1k+7770
Birch-san/mpt-playPython103080
mbj/concordRubyShell112060