🚀 Training-free Efficient Reasoning Online Judge

Design and evaluate your training-free efficient reasoning methods for multi-branch reasoning

📝 Your Code

Available methods:
probe_new() - 开始探测一个新分支
  返回: (answer: str, index: int, is_finish: bool)
  answer: 当前probe得到的答案
  index: 分支索引(用于probe_more)
  is_finish: 该分支是否已完成

probe_more(index: int) - 继续探测指定分支
  返回: (answer: str, is_finish: bool)
  answer: 继续probe得到的答案
  is_finish: 该分支是否已完成

get_new_branch_final_answer() - 获取完整分支的最终答案
  返回: answer: str - 完整分支的最终答案

Your code should assign the final answer to result or answer
Configure parameter ranges to automatically evaluate and plot results.
X-axis: Average Cost, Y-axis: Accuracy
Use {param1} and {param2} as placeholders for parameters.
Example: n_samples = {param1}
Compare two parameter sweep algorithms side by side.
Both algorithms will be evaluated and plotted on the same chart for comparison.

Algorithm 1

Algorithm 2

📊 Results

Write your code and click "Evaluate" to see results here.
Loading example...
This shows example branch probe results from a sample question.