It's not AGI - it's tree of thoughts, driven by some RL-derived heuristics.
I suppose what this type of approach provides is better prediction/planning by using more of what the model learnt during training, but it doesn't address the model being able to learn anything new.
It'll be interesting to see how this feels/behaves in practice.
I suppose what this type of approach provides is better prediction/planning by using more of what the model learnt during training, but it doesn't address the model being able to learn anything new.
It'll be interesting to see how this feels/behaves in practice.