ddotta/awesome-polars

A curated list of Polars talks, tools, examples & articles. Contributions welcome !

nodejspythonrustawesomecollectionrstatsawesome-listcurated-listpolars
This is stars and forks stats for /ddotta/awesome-polars repository. As of 09 May, 2024 this repository has 407 stars and 14 forks.

A curated list of Polars docs, talks, tools, examples & articles the internet has to offer. Polars is a lightning-fast DataFrame library for Rust, Python, Node.js and R. Implemented in Rust, Polars uses Apache Arrow Columnar Format as the memory model. Contributions of any kind welcome! Just follow the guidelines by either: Filling a suggestion issue (easier). Opening a pull request. To see the latest entries in the list : click here ! Awesome Polars Awesome Polars Official documentation Libraries/Packages/Scripts Python Rust R Node.js Scala/Java Ruby Resources Cheat Sheets Tutorials & workshops Blog posts Talks and videos Follow : Official Contributing Official documentation Documentation - Official user guide for Python, Rust and R. Documentation for Python API - Official API Reference for Python. Documentation for Rust API - Official API Reference for Rust. Documentation for Node.js API - Official API Reference for Node.js. Documentation for R API - Official API Reference for R (WIP). Github: Polars Github Organization - Official Polars Github repository. Blog posts from Polars - Official blogs posts from Polars. Keynote on Polars at EuroSciPy 2023 ⏳ 57 min - Talk by @ritchie46 that dives into Polars and sees what makes it so efficient. It will touch on technologies like Arrow, Rust, parallelism, data structures, query optimization and more. Talk about Polars at EuroPython Conference 2023 ⏳ 28 min - Talk by @ritchie46 that introduces Polars and some of its design decisions. NEWS (August 2023): Polars announces that it has raised a a $4M seed round! Libraries/Packages/Scripts Python polars for Python - Python polars package to use polars DataFrame from Python. tidypolars tidypolars python library built on top of polars library that gives access to methods and functions familiar to R tidyverse users. Working with Polars and XlsxWriter - Guide to using the Python XlsxWriter library with Polars to create Excel reports. Python package seaborn_polars - Python package to plot Polars DataFrames and LazyFrames with seaborn by @pavelcherepan. Python package functime - Machine learning Python package built on Polars for time-series predictions by @neocortexdb. According to the developpers, it's the world's fastest and most feature-full machine learning forecasting library ! Rust polars for Rust - Rust polars crate to use polars DataFrame with Rust. Polars CLI Polars CLI is a command line interface for running SQL queries with Polars as backend. GeoPolars Geopolars Rust crate that extends the Polars DataFrame library for use with geospatial data. R rpolars for R - R rpolars package to use polars DataFrame from R. tidypolars for R tidypolars package to use polars with tidyverse syntax. Node.js nodejs-polars for Node.js - Node.js nodejs-polars package to use polars DataFrame from Node.js. Scala/Java scala-polars for Scala and Java - Scala - Java scala-polars is a library for using Polars in Scala and Java projects by @chitralverma. Ruby polars for Ruby - Ruby polars-df gems to use Polars with Ruby. Resources Cheat Sheets Polars Cheat Sheet - A Polars Cheat Sheet by @FranzDiebold. Tutorials & workshops Modern Polars - A side by side comparison between Polars and Pandas containing code in both frameworks by @kevinheavey. Polars: um simples mas prático tutorial - Tutorial in the format of an ipynb notebook that illustrates several features of Polars in Portuguese by @barbosarafael. Associated github repository is here. A Practical Comparison of Polars and Pandas - A tutorial that showcases several common operations in Pandas and Polars side by side to demonstrate how much easier Polars is by @FlorianWilhelm. There is also an accompanying Jupyter notebook available. Prise en main de Polars - A notebook tutorial in French that illustrates the main features of Polars by @romaintailhurat and @linogaliana. There is also an accompanying blog post . Running Polars code distributedly - A page that explains how to rung Polars code distributedly with Fugue by @fugue-project. Converting SQL Queries to Polars DataFrames with JupySQL - A tutorial that explains how to convert SQL queries to Polars DataFrames using JupySQL by @ploomber. How to display Polars dataframes with itables - A tutorial that explains how to display Polars dataframes with itables by @mwouts. Rust Polars: Unlocking High-Performance Data Analysis — Part 1 - First part of an article that explores the world of Rust’s Polars and explain some basic concepts of Polars such as Series by @wiseaidev. Code used is available on Github here. Fast String Processing with Polars — Scam Emails Dataset - A tutorial using Polars to implement a text processing pipeline process by @AntonsRuberts. Code used is available on Github here. Cookbook Polars for R - A side-by-side comparison of Polars, R base, dplyr and data.table packages by @ddotta. Polars Workshop on AWS - A comprehensive workshop comparing Polars to Pandas, exploring a wide range of functions and features by @debnsuma. Polars cookbook in Python - This cookbook is a fork of the popular pandas-cookbook and has been modified to use the polars library. By @escobar-west, it uses real-world examples with "all the bugs and weirdness that entails." Data Pipelines with Polars: Step-by-Step Guide - A tutorial that explains how to build data pipelines with Polars by @AntonsRuberts. Code used is available on Github here. Python Polars: A Lightning-Fast DataFrame Library - A tutorial that shows how to use Polars with Python ecosystem by @hfhoffman1144. Code used is available on Github here. Blog posts Using the Polars DataFrame Library - A blog post by Wei-Meng Lee to discover the basics of Polars and how it can be used in place of Pandas. Why Polars uses less memory than Pandas - A blog post by Itamar Turner-Trauring detailing some techniques to opptimize Pandas memory usage and see how Polars can provide an answer in some cases. Plodding with Polars in Python - A blog post by @amitrathore that introduces some basic features of Polars. Polars-lazy - A blog post by @ritchie46 and @jorgecarleitao that introduces Polars' lazy API in Rust. Series of posts on Polars - A series of blogpost on Polars usage with a lot of useful tricks and information by @braaannigan. Moreover, Liam also has a Data Analysis with Polars course on Udemy. Youtube videos about Polars - A series of short youtube videos about Polars by @braaannigan Alternatives to Pandas: Python Polars - An article that explores the Python Polars module as an alternative to Pandas, comparing their similarities and differences and providing some examples by @JohnLockwood Pandas vs Polars - A comparison on File I/O - A blog post that evaluates Polars and Pandas in terms of I/O performance and speed when handling large datasets by Wes Poulsen. Discover the Power of Polars Library for Fast and Efficient Data Manipulation in Python - An article that quickly compares Polars to other data manipulation libraries by Morpheus. Why is Polars All The Rage? - An article that explains why Polars will become very popular by SeattleDataGuy and Daniel Beach. Polars vs Pandas — 4 key concept differences - An article that helps to understand key differences between Pandas and Polars Data Science libraries by @DataScienceDisciple. Replacing Pandas with Polars. A Practical Guide - A blog post comparing some common functions between Pandas and Polars by @danielbeach. Code used is available on Github here. Dataframe Showdown – Polars vs Spark vs Pandas vs DataFusion. Guess who wins? - A short article that presents a performance test between Polars, Pandas, Datafusion and Spark on a csv dataset by @danielbeach. Code used is available on Github here. 7 Easy Steps To Switch From Pandas to Lightning Fast Polars And Never Return - A cheat sheet blog post of the most common Pandas operations translated into Polars by @BexTuychiev. Polars for initial data analysis, Polars for production - A blog post that shows how to use Polars for initial data analysis and then effectively in production by @itamarst. DuckDB vs Polars vs Spark! - An article that performs a benchmark against duckdb/Polars/spark, with varying row count, with swap usage as another metric, in addition to runtime in seconds. Code used is available on Github here. Benchmarking PySpark Pandas, Pandas UDFs, and Fugue Polars - A blog post that compares the execution time of fugue + Polars, Pandas UDFs and PySpark Pandas by @kvnkho. Pandas vs. Polars: The Battle of Performance - An another blog post that compares the performance between Pandas and Polars across a range of common data manipulation tasks by @makeuseofcode. Code used is available on Github. Pandas 2.0 vs Polars: The Ultimate Battle - A blog post that analyzes in terms of Syntax, Speed, and Usability between Pandas 2.0 and Polars 0.17.0 by @priyanshu7401. Polars - modern data frame library - A blog post that describes why Polars could be a better alternative to pandas, dplyr or data.table by @DSkrzypiec. The fastest way to read a CSV file in Python - A blog post that compares different ways (including Polars, pyarrow and C) to read a CSV file with Python by Finn Andersen. Pandas vs Polars vs Pandas 2.0 …. FIGHT - A blog post that does an ETL process for checking big data speed processing between Pandas, Pandas 2.0 and Polars by @guoliveira. Pandas vs Polars vs Pandas 2.0 … ROUND 2) - A blog post that makes a new comparison between Pandas, Pandas 2.0 and Polars by @guoliveira. Polars VS PySpark: Lazy Evaluation and Big Data - A blog post that compares lazy evaluation between Polars and Spark by @guoliveira. Polars in the aRtic! - An another blog post that compares the performance between Pandas and Polars across a range of common data manipulation tasks by @MCodrescu. Code used is available on Github. A Polars exploration into Kedro - A blog post that explains how Polars can be used instead of pandas in Kedro for your data catalog and data manipulation by @astrojuanlu. High Performance Data Manipulation in Python: pandas 2.0 vs. polars - A blog post that compares differences between Python pandas 2.0 and Polars libraries by @jcanalesluna. Lightning-fast queries with Polars - Another blog post that is a good introduction to Polars by @astrojuanlu. Polars – Laziness and SQL Context. - A blog post that presents two good reasons to adopt Polars : Lazy and SQL Context by @danielbeach. Exploring Polars - The Lightning-Fast DataFrame Library in Python - A blog post on the basics of Polars by @mddas. Pandas vs Polars – Speed Comparison - A blog post that compares the performance of Polars, Pandas and Pandas 2.0 by @StuffbyYuki. Code used is available on Github here. LazyFrame vs DataFrame in Polars – Performance Comparison - A blog post that introduces what LazyFrame is in Polars and its performance gain compared to DataFrame by @StuffbyYuki. Code used is available on Github here. Querying Polars DataFrames using SQL - A blog post that shows how to use the SQLContext object in Python to query a Polars DataFrame directly using SQL by @weimenglee. Polars vs Pandas: A Brief Tale of Two DataFrame Libraries - A blog post that compares Polars and Pandas focusing in particular on optional dependencies by @ranggakd. EDA with Polars: Step-by-Step Guide for Pandas Users (Part 1) - A blog post that describes the main data processing operations with Polars in Python by @AntonsRuberts. Code used is available in this notebook. EDA with Polars: Step-by-Step Guide to Aggregate and Analytic Functions (Part 2) - A blog post that shows how to perform with Polars and Python some fairly complex aggregates, rolling statistics and more by @AntonsRuberts. Code used is available in this notebook. Pyspark or Polars — What should you use? - A blog post that explores and breaks down some of the similarities between PySpark and Polars. It provides insights on when to choose one over the other by Vivek Kovvuru. Getting Started with the Polars Data Manipulation Library - A blog post that presents some simple features of Polars using Python by Juveriya Mahreen. 8 ways pandas really losing to Polars for quick market data analysis - A newsletter that compares the performance of Polars to Pandas for many common data manipulation techniques by PyQuant News. All that Polars that Make You Forget Pandas - A blog post that explores some deeper reasons behind the performance gains of Polars over Pandas. Polars vs Pandas. Inside an AWS Lambda - A blog post that covers the topic of using Polars vs Pandas inside an AWS Lambda to do data processing by @danielbeach. Code used is available on Github here. DuckDB vs Polars for Data Engineering - A blog post that compares Polars and DuckDB with pipelines for Data Engineering by @danielbeach. Pandas vs Polars: A database speed test. Who wins? - A blog post that compares the run-time of reading a database into a dataframe using Pandas versus using Polars by Thomas Reid. Polars and Pandas : What's the difference ? - A blog post that explains how Polars works under the hood and th best use cases for Polars and Pandas by @t-redactyl. Understanding the Polars nested column types - A blog post that helps to understand how nested column types works in Polars by @braaannigan. Polars vs DuckDB for Delta Lake ops - A blog post that compares Polars to DuckDB using Delta Lake by @wolliq. Enhancing Data Analytics with Polars and MinIO - A blog post that explains how to use Polars with Minio’s open-source object storage by @IndexSeek. Using Polars with Snowflake - A blog post that shows how to use Polars with Snowflake by @IndexSeek. Partitioning Polars DataFrame on S3 with Apache Arrow - A blog post that explains how to partition large Polars DataFrames in AWS S3 by Matteo Arellano. Goodbye Spark. Hello Polars + Delta Lake - An article that presents how to use Polars in addition to Delta Lake by @danielbeach. How to learn Polars with ChatGPT? - An article that explains how to learn fundamental Polars concepts with ChatGPT by Suhith Illesinghe. Talks and videos Polars: Blazingly Fast DataFrames in Rust and Python ⏳ 37 min - Introduction to Polars by databricks. Polars: The Next Big Python Data Science Library... written in Rust? ⏳ 14 min - A short video tutorial to get started coding with Polars by @RobMulla. The Last Polars Dataframe vs. Pandas Dataframe Video You Should Ever See ⏳ 19 min - A video that compares Polars and Pandas data frames. The Best library for building Data Pipelines... ⏳ 12 min - A video that compares Pandas, Spark and Polars for working with data in Python by @RobMulla. Expressive & fast dataframes in Python with polars ⏳ 28 min - A video that reviews some alternatives to Pandas for Python and then demonstrates some Polars features by Juan Luis Cano Rodríguez. Polars Introduction for Python with a 128GB Ryzen 24-core Benchmark vs Pandas ⏳ 12 min - A video that provides an introduction to Polars for Python by @jeffheaton. Notebook used for the video in this github repo. Polars: The main alternative to pandas in Python!? ⏳ 57 min - A Polars tutorial series on Youtube by @martinbel. Notebooks and datasets used for the videos available in this github repo. Polars vs Pandas ⏳ 37 min - A detailed video on Youtube that compares Polars and Pandas by @hu-po. Polars: The Super Fast Dataframe Library for Python — Goodbye Pandas? - An article and a video ⏳ 19 min that explores some basic features of Polars by @ifrankandrade Manipulación de Datos con Polars en python ⏳ 51 min - A detailed tutorial video in Spanish that shows 20 Polars functions to perform 80% of the tasks of a data scientist by Naren Castellon. An opinionated introduction to Polars - Great Polars introduction slides from @krlng at PyCon 2023. Polars - make the switch to lightning-fast dataframes ⏳ 30 min - A talk that reports an experience switching from Pandas to Polars in a real-world ML project by @datenzauberai. Slides are available here. Polars vs Pandas | detailed test with explained results ⏳ 22 min - A video that presents 8 distinct tests which demonstrates differences between Pandas and Polars by @vb100. Associated github repo is here. Polars: A highly optimized dataframe library ⏳ 20 min - A video that presents some mains features of Polars by @mattharrison. How to update mass data using Polars DataFrame ⏳ 9 min - A video that presents the process of writing code to update mass columns across CSV or data files by @AmitXShukla. Notebook used for the video in this github repo. Using the Rust Polars DataFrame library in a CLI ⏳ 4 min - A video that shows how to integrate Polars in a commande line interface by @paiml. The Ultimate Guide to Data Wrangling with Python | Rust Polars Data Frame ⏳ 10 videos - A playlist of 10 videos (WIP) that equips you with all the necessary knowledge required to utilize Python Polars Data Frame by @AmitXShukla. Polars DataFrame ⏳ 41 min - A video that shows some basic manipulations with Polars and Python by @vedica1011. Notebook used for the video in this github repo. Why I switched grom Pandas to Polars ⏳ 53 min - A workshop that breaks down the 3 reasons why you could switched from Pandas to Polars by @bfeif. Notebook used for the video in this github repo. Delimiters in Python Polars ⏳ 15 min - A video that explains how to use delimiters in Python Polars by @CodeKlaudia. Intro to Polars ⏳ 7 videos - A playlist of 7 videos that introduces the basic concepts of Polars (DataFrames, filtering, splitting...) by Joram Mutenge. Machine Learning with Polars ⏳ 6 videos - A playlist of 6 videos that analyzing and cleaning data using Polars to train machine learning models by Joram Mutenge. Follow : Official pola-rs (@pola-rs) - Github organisation for Polars (Twitter: @DataPolars). Ritchie Vink (@ritchie46) - Author of Polars Stijn de Gooijer (@stinodego) - Member of Polars organisation Danny van Kooten (@dannyvankooten) - Member of Polars organisation Søren Havelund Welling (@sorhawell) - Member of Polars organisation Alexander Beedie (@alexander-beedie) - Contributor to Polars projects Marco Edward Gorelli (@MarcoGorelli) - Contributor to Polars projects Eitsupi (@eitsupi) - Contributor to R Polars project Etienne Bacher (@etiennebacher) - Contributor to R Polars project Contributing Damien Dotta (@ddotta) - Maintainer of Awesome Polars list Thanks goes to these contributors!
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
chiphuyen/just-pandas-thingsJupyter Notebook89701330
rust-nostr/nostrRustMakefileJavaScript2710580
kriasoft/react-starter-kitTypeScriptJavaScriptHTML22.2k04.2k0
maximecb/uvmRustCPython3390130
janhohenheim/foxtrotRustJavaScriptOther3160240
WardPearce/paasterSvelteTypeScriptPython3640150
rizemon/exploit-writing-for-oswe2720680
filipecalegario/awesome-generative-ai1.4k+13157+1
MakiNaruto/Automatic_ticket_purchasePythonJavaScript3k06640
gelstudios/gitfitiPython7.8k01.1k0