13 packages found

openmetadata-sqlalchemy-bigquery

SQLAlchemy dialect for BigQuery by OpenMetadata
  1. bigquery
  2. sqlalchemy
  3. data-catalog
  4. data-collaboration
  5. data-contracts
  6. data-discovery
  7. data-governance
  8. data-lineage
  9. data-observability
  10. data-profiling
  11. data-quality
  12. data-quality-checks
  13. data-science
  14. data-validation
  15. datadiscovery
  16. dataengineering
  17. dataquality
  18. dbt
  19. hacktoberfest
  20. metadata
  21. metadata-management
  22. snowflake
297 Contributors
0.2.2published 3 years agoMIT

ydata-profiling

Generate profile report for pandas DataFrame
  1. pandas
  2. data-science
  3. data-analysis
  4. python
  5. jupyter
  6. ipython
  7. big-data-analytics
  8. data-exploration
  9. data-profiling
  10. data-quality
  11. deep-learning
  12. eda
  13. exploration
  14. exploratory-data-analysis
  15. hacktoberfest
  16. html-report
  17. jupyter-notebook
  18. machine-learning
  19. pandas-dataframe
  20. pandas-profiling
  21. statistics
116 Contributors
4.16.1published 3 weeks agoMIT

desbordante

Science-intensive high-performance data profiler
  1. anomaly-detection
  2. correlations
  3. data-analytics
  4. data-cleaning
  5. data-cleansing
  6. data-engineering
  7. data-exploration
  8. data-mining
  9. data-mining-algorithms
  10. data-preprocessing
  11. data-profiling
  12. data-science
  13. data-wrangling
  14. exploratory-data-analysis
  15. feature-engineering
  16. feature-extraction
  17. feature-selection
  18. knowledge-discovery
  19. spreadsheets
  20. tabular-data
51 Contributors
2.3.2published 2 months agoAGPL-3.0-only

great-expectations-cta

Always know what to expect from your data.
  1. data
  2. science
  3. testing
  4. pipeline
  5. quality
  6. dataquality
  7. validation
  8. datavalidation
  9. cleandata
  10. data-engineering
  11. data-profilers
  12. data-profiling
  13. data-quality
  14. data-science
  15. data-unit-tests
  16. datacleaner
  17. datacleaning
  18. dataunittest
  19. eda
  20. exploratory-analysis
  21. exploratory-data-analysis
  22. exploratorydataanalysis
  23. mlops
  24. pipeline-debt
  25. pipeline-testing
  26. pipeline-tests
390 Contributors
0.15.43published 2 years agoApache-2.0

great-expectations

Always know what to expect from your data.
  1. data
  2. science
  3. testing
  4. pipeline
  5. quality
  6. dataquality
  7. validation
  8. datavalidation
  9. cleandata
  10. data-engineering
  11. data-profilers
  12. data-profiling
  13. data-quality
  14. data-science
  15. data-unit-tests
  16. datacleaner
  17. datacleaning
  18. dataunittest
  19. eda
  20. exploratory-analysis
  21. exploratory-data-analysis
  22. exploratorydataanalysis
  23. mlops
  24. pipeline-debt
  25. pipeline-testing
  26. pipeline-tests
396 Contributors
1.4.0published 1 day agoApache-2.0

pandas-summary

An extension to pandas describe function.
  1. pandas
  2. data
  3. analysis
  4. machine
  5. learning
  6. dask
  7. data-exploration
  8. data-profiling
  9. data-quality
  10. data-quality-checks
  11. data-science
  12. data-visualization
  13. dataframes
  14. dataops
  15. explainable-ai
  16. matplotlib
  17. mlops
  18. pandas-summary
  19. plotly
  20. pytorch
  21. spark
  22. statistics
  23. tensorflow
  24. tracking
90 Contributors
0.2.0published 3 years agoMIT

pandas-profiling

Deprecated 'pandas-profiling' package, use 'ydata-profiling' instead
  1. pandas
  2. data-science
  3. data-analysis
  4. python
  5. jupyter
  6. ipython
  7. big-data-analytics
  8. data-exploration
  9. data-profiling
  10. data-quality
  11. deep-learning
  12. eda
  13. exploration
  14. exploratory-data-analysis
  15. hacktoberfest
  16. html-report
  17. jupyter-notebook
  18. machine-learning
  19. pandas-dataframe
  20. pandas-profiling
  21. statistics
119 Contributors
3.6.6published 2 years agoMIT

great-expectations-experimental

Always know what to expect from your data.
  1. data
  2. science
  3. testing
  4. pipeline
  5. quality
  6. dataquality
  7. validation
  8. datavalidation
  9. cleandata
  10. data-engineering
  11. data-profilers
  12. data-profiling
  13. data-quality
  14. data-science
  15. data-unit-tests
  16. datacleaner
  17. datacleaning
  18. dataunittest
  19. eda
  20. exploratory-analysis
  21. exploratory-data-analysis
  22. exploratorydataanalysis
  23. mlops
  24. pipeline-debt
  25. pipeline-testing
  26. pipeline-tests
388 Contributors
0.1.20240917055published 7 months agoApache-2.0

sweetviz

A pandas-based library to visualize and compare datasets.
  1. pandas
  2. data-science
  3. data-analysis
  4. python
  5. eda
  6. data-exploration
  7. data-profiling
  8. data-visualization
  9. exploration
  10. exploratory-data-analysis
  11. machine-learning
  12. pandas-dataframe
  13. statistics
10 Contributors
2.3.1published 1 year agoOther

cleanlab

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.
  1. machine_learning
  2. data_cleaning
  3. confident_learning
  4. classification
  5. weak_supervision
  6. learning_with_noisy_labels
  7. unsupervised_learning
  8. datacentric_ai
  9. datacentric
  10. active-learning
  11. annotation
  12. data-centric-ai
  13. data-cleaning
  14. data-curation
  15. data-labeling
  16. data-profiling
  17. data-quality
  18. data-science
  19. data-validation
  20. dataops
  21. dataquality
  22. datasets
  23. exploratory-data-analysis
  24. labeling
  25. llms
  26. noisy-labels
  27. out-of-distribution-detection
  28. outlier-detection
  29. weak-supervision
54 Contributors
2.7.1published 2 months agoOther
Showing 1 to 10 of 13 results