awslabs/deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

ScalaOtherunit-testingscalasparkdataquality
This is stars and forks stats for /awslabs/deequ repository. As of 02 May, 2024 this repository has 2964 stars and 501 forks.

Deequ - Unit Tests for Data Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. We are happy to receive feedback and contributions. Python users may also be interested in PyDeequ, a Python interface for Deequ. You can find PyDeequ on GitHub, readthedocs, and PyPI. Requirements and Installation Deequ depends on Java 8. Deequ version 2.x only runs with Spark 3.1, and vice versa. If you rely on a previous Spark version, please...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
ucb-bar/riscv-sodorScalaPythonMakefile60201510
datastax/spark-cassandra-connectorScalaJavaShell1.9k+29230
slick/slickScala2.6k+16150
typelevel/fs2ScalaOther2.3k05830
prakhar1989/docker-curriculumSCSSHTMLJavaScript5.3k02.1k0
crosstool-ng/crosstool-ngShellCM41.9k06160
six2dez/reconftwHTMLShellPython4.7k+18806+4
apple/swift-collectionsSwiftPythonC++3.1k+6249+1
mapbox/mapbox-maps-iosSwiftShellOther34801310
liuxinyu95/AlgoXYTeXPythonHaskell5.9k07560