data-profiling

As a user,

It would be nice to have the "Observed Value" Field be standardized to show percentages of "successful" validations, vs a mix of 0% / 100%. This causes confusion as there are different levels of validation outputs with different verbage (making someone not used to the expectations confused) I've given an example below in a screenshot for what I mean:

![image](https://user-images.g

Because some user has had problems configuring these services could be helpful to make some examples or videos about how to properly setup Optimus in this services.

This can be useful to get an overview of string structure of a columns
def patterns(self, input_cols, output_cols=None, mode=0):

See https://github.com/ironmussa/Optimus/blob/develop-3.0/optimus/engines/base/columns.py#L153 For more info about the param

The readme links to the 'CKAN standard model', with the URL: http://demo.ckan.org/api/3/action/package%5C_show?id=adur%5C_district%5C_spending

This is currently returning a 400 error:

"Bad request - Action name not known: package\\_show"

data-profiling

Here are 28 public repositories matching this topic...

pandas-profiling / pandas-profiling

great-expectations / great_expectations

Standardize Validation "Observed Value" outputs in HTML Documents

Documentation : BatchKwargs Examples and explaination

Allow result_format to be configurable at the run_validation_operator level

ironmussa / Optimus

Create examples for EMR, dataproc and databricks

Adding NLP functions

Give additional guidance to the user about how to configure env vars

ironmussa / Bumblebee

Show patterns in detail windows

psebenick / data-profiling

ing-bank / popmon

VIDA-NYU / datamart

ahmadassaf / openData-checker

Broken link in readme

hpcc-systems / DataPatterns

mtna / rds-r

gandalf1819 / NYCOpenData-Profiling-Analysis

darenasc / auto-fes

mtna / rds-js

rounayak / Data-Profiling-Tool

giagiannis / data-profiler

mtna / rds-js-examples

BJennWare / hitucc

statsim / profile

jmakeig / data-profile

wosaku / data-profiling-mask-analyzer

camillereaves / subreddit-crossposting

bballamudi / great_expectations

VIDA-NYU / sato

mzj14 / function-dependency-exploration

christianbors / OpenRefineQualityMetrics

bballamudi / Optimus

bballamudi / deequ

p-disha / NYC-Open-Dataset-Analysis

Improve this page

Add this topic to your repo