alan-turing-institute/datadiff

Datadiff is diff for data

R
This is stars and forks stats for /alan-turing-institute/datadiff repository. As of 12 May, 2024 this repository has 24 stars and 2 forks.

Overview Tabular data sets are common, and many data processing tasks must be repeated on multiple similar data samples. In practice, however, there may be unexpected changes in structure across different batches of data, which are likely to break the analytical pipeline. Datadiff identifies structural differences between pairs of (related) tabular data sets and returns an executable summary (or "patch") which is both a description of the differences and a corrective transformation. In making comparisons,...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
tokuhirom/akazaRustPerlC200060
devongovett/glob-matchRust2810130
tomhrr/coshRustxBaseOther120020
emmett-framework/granianRustPythonOther9780340
Razikus/its-friday-k8s-admission-controllerRustSmartyPython60040
akaza-im/akazaRustPerlC200060
jcrodriguez1989/chatgptR2870390
aws-samples/amazon-sagemaker-statistical-simulation-rstudioRShellDockerfile7060
aws-samples/rstudio-on-sagemaker-workshopRShell6080
aws-samples/reinvent2020-aim404-productionize-r-using-amazon-sagemakerRShellDockerfile16070