Flexible

Supports regression, classification, ranking and user defined objectives.

Portable

Runs on Windows, Linux and OS X, as well as various cloud Platforms

Multiple Languages

Supports multiple languages including C++, Python, R, Java, Scala, Julia.

Battle-tested

Wins many data science and machine learning challenges. Used in production by multiple companies.

Distributed on Cloud

Supports distributed training on multiple machines, including AWS, GCE, Azure, and Yarn clusters. Can be integrated with Flink, Spark and other cloud dataflow systems.

Performance

The well-optimized backend system for the best performance with limited resources. The distributed version solves problems beyond billions of examples with same code.