Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • P pyod
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 144
    • Issues 144
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 16
    • Merge requests 16
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Yue Zhao
  • pyod
  • Merge requests
  • !102

Generate Synthesized Categorical Data

  • Review changes

  • Download
  • Email patches
  • Plain diff
Merged Yahya requested to merge github/fork/John-Almardeny/Generate_Categorical_Data into development May 22, 2019
  • Overview 3
  • Commits 4
  • Pipelines 0
  • Changes 3

All Submissions Basics:

Closes #101 (closed)

  • Have you followed the guidelines in our Contributing document?
  • Have you checked to ensure there aren't other open Pull Requests for the same update/change?
  • Have you checked all Issues to tie the PR to a specific one?

All Submissions Cores:

  • Have you added an explanation of what your changes do and why you'd like us to include them?
  • Have you written new tests for your core changes, as applicable?
  • Have you successfully ran tests with your changes locally?
  • Does your submission pass tests, including CircleCI, Travis CI, and AppVeyor?
  • Does your submission have appropriate code coverage? The cutoff threshold is 95% by Coversall.

New Model Submissions:

  • Have you created a _example.py in ~/examples/?
  • Have you lint your code locally prior to submission?

Description

The idea of the algorithm is to create a pool of features based on the number of features passed by user. This pool will be the base of generating all categorical data. Also, the user can specify the number of categories in the normal points and in the outliers. Added to that, the user can specify the number of informative features in the outlier points in which the higher the easier to detect and classify, whereas the lower the more redundant (non-informative and insignificant) the features in the outlier points which makes it more difficult to detect.

Required tests and example have been added.

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: github/fork/John-Almardeny/Generate_Categorical_Data