Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • A awesome-python
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 13
    • Issues 13
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 317
    • Merge requests 317
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Vinta Chen
  • awesome-python
  • Merge requests
  • !2274

add Pandas-Profiling

  • Review changes

  • Download
  • Email patches
  • Plain diff
Open Alexandre Quemy requested to merge github/fork/aquemy/master into master Dec 16, 2022
  • Overview 0
  • Commits 1
  • Pipelines 0
  • Changes 1

What is this Python project?

pandas-profiling generates profile reports from a pandas DataFrame. The reports are like a richer version of DataFrame.describe with a lot of information that any data scientist is looking for: distribution description, missing values, type inference, zero, high correlation, skewness, etc.

The reports can be generated in PDF or HTML. They contain alerts that give a clear information on the issues with a given dataset.

The library can also generate comparison reports between two datasets, so it can be used to compare two versions of the same dataset, or some new data once a model is in production (sort of model monitoring but for data).

It is well maintained with active development, has 10k stars on GitHub and it is used by many companies, including large ones.

What's the difference between this Python project and similar ones?

As far as I know, there is no alternative library.

--

Anyone who agrees with this pull request could submit an Approve review to it.

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: github/fork/aquemy/master