catboost

1.2.8last stable release 1 week ago
Complexity Score
N/A
Open Issues
N/A
Dependent Projects
226
Weekly Downloadsglobal
474,519

License

  • Apache-2.0
    • Yesattribution
    • Permissivelinking
    • Permissivedistribution
    • Permissivemodification
    • Yespatent grant
    • Yesprivate use
    • Permissivesublicensing
    • Notrademark grant

Downloads

Readme

Website | Documentation | Tutorials | Installation | Release Notes

CatBoost is a machine learning method based on gradient boosting over decision trees.

Main advantages of CatBoost:

  • Superior quality when compared with other GBDT libraries on many datasets.
  • Best in class prediction speed.
  • Support for both numerical and categorical features.
  • Fast GPU and multi-GPU support for training out of the box.
  • Visualization tools included.
  • Fast and reproducible distributed training with Apache Spark and CLI.

Get Started and Documentation

All CatBoost documentation is available here.

Install CatBoost by following the guide for the

  • Python package
  • R-package
  • Сommand line
  • Package for Apache Spark

Next you may want to investigate:

  • Tutorials
  • Training modes and metrics
  • Cross-validation
  • Parameters tuning
  • Feature importance calculation
  • Regular and staged predictions
  • CatBoost for Apache Spark videos: Introduction and Architecture

If you cannot open documentation in your browser try adding yastatic.net and yastat.net to the list of allowed domains in your privacy badger.

Catboost models in production

If you want to evaluate Catboost model in your application read model api documentation.

Questions and bug reports

  • For reporting bugs please use the catboost/bugreport page.
  • Ask a question on Stack Overflow with the catboost tag, we monitor this for new questions.
  • Seek prompt advice at Telegram group or Russian-speaking Telegram chat

Help to Make CatBoost Better

  • Check out open problems and help wanted issues to see what can be improved, or open an issue if you want something.
  • Add your stories and experience to Awesome CatBoost.
  • To contribute to CatBoost you need to first read CLA text and add to your pull request, that you agree to the terms of the CLA. More information can be found in CONTRIBUTING.md
  • Instructions for contributors can be found here.

News

Latest news are published on twitter.

Reference Paper

Anna Veronika Dorogush, Andrey Gulin, Gleb Gusev, Nikita Kazeev, Liudmila Ostroumova Prokhorenkova, Aleksandr Vorobev “Fighting biases with dynamic boosting”. arXiv:1706.09516, 2017.

Anna Veronika Dorogush, Vasily Ershov, Andrey Gulin “CatBoost: gradient boosting with categorical features support”. Workshop on ML Systems at NIPS 2017.

License

© YANDEX LLC, 2017-2024. Licensed under the Apache License, Version 2.0. See LICENSE file for more details.

Dependencies

No runtime dependency information found for this package.

CVE IssuesActive
0
Scorecards Score
4.50
Test Coverage
No Data
Follows Semver
No
Github Stars
8,355
Dependenciestotal
10
DependenciesOutdated
1
DependenciesDeprecated
0
Threat Modelling
No Data
Repo Audits
No Data

Learn how to distribute catboost in your own private PyPI registry

pip install catboost
Processing...
Done

109 Releases

PyPI on Cloudsmith

Getting started with PyPI on Cloudsmith is fast and easy.