drshahizan/Python-big-data

Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.

Jupyter Notebookdata-sciencepandasdaskrayvaexmodin
This is stars and forks stats for /drshahizan/Python-big-data repository. As of 02 May, 2024 this repository has 64 stars and 48 forks.

Don't forget to hit the ⭐ if you like this repo. About Us The information on this Github is part of the materials for the subject High Performance Data Processing (SECP3133). This folder contains general big data information as well as big data case studies using Malaysian datasets. This case study was created by a Bachelor of Computer Science (Data Engineering), Universiti Teknologi Malaysia student. 📚 Course: High Performance Data Processing Python for beginners Web scraping and Python web framework Exploratory...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
DataScience-Lab-Yonsei/DSL-23-1-StudyJupyter NotebookOther10120
Hello-SimpleAI/chatgpt-comparison-detectionPythonJupyter Notebook1.1k0970
rhasspy/larynx2C++PythonJupyter Notebook1.6k01160
theerfan/QJupyter NotebookPython1190560
UBC-CS/cpsc330-2022W2Jupyter NotebookOther180590
RonaldSchlenker/VideHTMLJupyter NotebookF#77030
joisino/otbookJupyter Notebook71080
epfl-ada/2021Jupyter Notebook4501140
dfinke/PowerShellAIPowerShellJupyter Notebook6090860
deepmind/tracrPythonJupyter Notebook4040280