Greedy Algorithm

At each decision point, it chooses the option that looks best right now, without worrying about how it affects future steps. It doesn't always gurantee the absolute best overall solution.

Why Greedy

Short_slighted

It optimizes locally (one node) without planning the global tree struture.

Contrast

A non-greedy approach might test all possible trees (exponential time) to find the absolute best, but that's impractical.

Greedy in GBDT and LightGBM

Each decision tree uses a greedy algorithm to grow
At each node, it picks the slit that best reduces the loss
LightGBM's leaf-wise growth is extra greedy: it only splits the leaf with the biggest potential gain, not all leaves at a level.

The initial feature split has a big influence on the tree's structure and predictions

The initial feature shapes the tree. Greedy Bias, Path dependency.

How to Avoid

Feature Enginnering Create better features to make the root split more meaningful.
Limit Tree Depth A shallow tree relies less on the root split, reducing its dominance.
Ensamble Methods A single tree's wrong split is corrected by the ensemble.
Randomness feature_fraction in LightGBM randomly consider only a subset of features at each split.
Regularization