Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/ydata-profiling
Library/ydata-profilingForked

Data-Centric-AI-Community/fg-data-profiling

ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

View on GitHub↗Upstream Data-Centric-AI-Community/fg-data-profiling↗

Builder

Data-Centric-AI-Community

Data-Centric-AI-Community

Data-Centric-AI-Community • individual

Stars

13,573

Using upstream star count

Forks

1,790

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jan 9, 2016

Project creation date

README Summary

[![Build Status](https://github.com/ydataai/pandas-profiling/actions/workflows/tests.yml/badge.svg?branch=master)](https://github.com/ydataai/pandas-profiling/actions/workflows/tests.yml) [![PyPI download month](https://img.shields.io/pypi/dm/ydata-profiling.svg)](https://pypi.python.org/pypi/ydata-profiling/) [![](https://pepy.tech/badge/pandas-profiling)](https://pypi.org/project/ydata-profiling/) [![Code Coverage](https://codecov.io/gh/ydataai/pandas-profiling/branch/master/graph/badge.svg?to

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Automated ReportingData PreprocessingData Quality AssessmentData ValidationData VisualizationExploratory Data AnalysisFeature EngineeringStatistical Profiling

Tags

Automated ReportingData PreprocessingData Quality AssessmentData ValidationData VisualizationExploratory Data AnalysisFeature EngineeringStatistical ProfilingAWSAirflowCLI ToolData ScienceDatabaseEmbeddingsForkedGoogle CloudHealthcare AIJupyterNumPyPandasPythonSparkStatistics

Taxonomy

AI Trends

Data-Centric AIAutoMLMLOps

category

Data Science & AnalyticsRAG & RetrievalMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsIndustry: Healthcare

Deployment Context

Self-hostedCloudOn-premiseJupyter Notebooks

Industries

FinTechHealthcareE-commerceManufacturingMarketing AnalyticsResearch

Modalities

Tabular

Skill Areas

Exploratory Data AnalysisData Quality AssessmentStatistical ProfilingData PreprocessingFeature EngineeringData ValidationAutomated ReportingData Visualization

tag

AWSAirflowCLI ToolData ScienceDatabaseEmbeddingsForkedGoogle CloudHealthcare AIJupyterNumPyPandasPythonSparkStatistics

Use Cases

Automated Data Quality ReportingDataset ProfilingData Pipeline ValidationFeature Distribution AnalysisMissing Value DetectionCorrelation AnalysisData Documentation GenerationML Model Input Validation

Recent Activity

Updated 3 months ago

7 Days

0

30 Days

0

90 Days

0

chore(actions): fix permissions for token in merge dev and master (#1813)

Luís Portela Afonso • Mar 3, 2026

82479e9

chore(actions): migrate to data centric ai community (#1812)

Luís Portela Afonso • Mar 3, 2026

8dd7d0c

Quality

production
Quality
high
Maturity
production

Categories

RAG & RetrievalPrimaryML Platform & InfrastructureData Science & AnalyticsHealthcare & BiologyOther AI / MLMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsIndustry: Healthcare

PM Skills

Data & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Jan 9, 2016
Forked
Mar 22, 2026
Your last push
3 months ago
Upstream last push
1 months ago
Tracked since
Mar 3, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…