[nexa] Learning to reason with LLMs