DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints
DeepPlanning benchmark addresses limitations of current LLM planning assessments by introducing complex, real-world tasks requiring both global optimization and local constraint reasoning.
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints
DeepPlanning benchmark addresses limitations of current LLM planning assessments by introducing complex, real-world tasks requiring both global optimization and local constraint reasoning.
