Qwen3-Max-Thinking state-of-the-art reasoning model at your fingertips!
mattshumer/
Reflection Llama-3.1 70B is trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course. The model was trained on synthetic data.

© 2026 Deep Infra. All rights reserved.