# GPT Pro 问题清单(2026-04-10) ## 1) 当前执行状态(已完成) - 已按 `chinext50_recalibrate_guidance_for_codex_2026-04-09.md` 顺序落地: 1. `backtest/frozen_walkforward.py`(候选评分、硬约束、frontier fallback、违规距离诊断) 2. `config/regime.yaml`(candidate_selection 默认参数 + window success 配置) 3. `pipelines/real_walkforward_report.py`(报告语义改为 stitched frozen OOS 主比较) 4. 对应测试补齐并通过 ## 2) 代码侧关键变化 - 报告新增三套指标: - `default_strategy_full_sample_metrics` - `stitched_frozen_oos_metrics` - `baseline_full_sample_metrics` - comparison 拆分为: - `comparison.stitched_oos_vs_baseline` - `comparison.default_vs_baseline` - 兼容字段保留,映射到 stitched 分支: - `annual_return_delta` - `annual_return_delta_vs_baseline` - `max_drawdown_delta` - `max_drawdown_delta_vs_baseline` - `drawdown_ratio_vs_baseline` - `utility_delta_vs_baseline` - 新增窗口语义指标: - `primary_window_count` / `partial_window_count` - `primary_window_success_ratio` / `partial_window_success_ratio` - `window_success_rule` - 新增产物: - `stitched_frozen_oos_ledger.csv` ## 3) 测试状态 - 定向:`py -m pytest -q tests/test_frozen_walkforward.py tests/test_real_walkforward_report_pipeline.py` - 结果:`11 passed` - 全量:`py -m pytest -q tests` - 结果:`81 passed` ## 4) 关键指标对比(旧 vs 新语义) ### 旧版(default vs baseline) - 来源:`outputs/real_walkforward_recalibrated_20260409_v2/real_walkforward_summary.json` - `annual_return_delta = -0.079631` - `drawdown_ratio_vs_baseline = 0.481396` - `utility_delta_vs_baseline = -0.134472` - `upside_capture = 0.284889` - `hard_pass_window_ratio = 0.8` - `selection_mode_distribution = {'constraint_score': 4, 'utility_fallback_no_hard_pass': 1}` ### 新版(stitched + default 拆分) - 来源:`outputs/real_walkforward_recalibrated_20260410_semantic_v1/real_walkforward_summary.json` - `comparison.stitched_oos_vs_baseline.annual_return_delta = -0.095134` - `comparison.stitched_oos_vs_baseline.drawdown_ratio_vs_baseline = 0.552714` - `comparison.stitched_oos_vs_baseline.utility_delta_vs_baseline = -0.098477` - `comparison.stitched_oos_vs_baseline.upside_capture = 0.340925` - `comparison.default_vs_baseline.annual_return_delta = -0.079631` - `frozen_walkforward.primary_window_success_ratio = 0.5` - `frozen_walkforward.partial_window_success_ratio = 0.0` - `frozen_walkforward.hard_pass_window_ratio = 0.6` - `frozen_walkforward.selection_mode_distribution = {'constraint_score': 3, 'frontier_fallback_no_hard_pass': 2}` ## 5) 需要 GPT Pro 精确指导的问题 1. 在当前语义正确后,第二轮是否优先改 `backtest/utility.py`(重标定 turnover 惩罚),还是先做 policy mapping(`trend/chop/repair/euphoric_late`)? 2. 针对 stitched OOS 当前表现(收益仍落后、回撤比上升),请给出建议的参数调整顺序和每轮 guardrail(包括停止条件)。 3. `primary_window_success_ratio=0.5` 刚达最低门槛,但 `hard_pass_window_ratio=0.6` 偏低,建议优先调整哪些约束/权重使 hard-pass 比例恢复到 >=0.8,同时避免再次回到单一候选。 4. 是否建议在当前阶段引入 candidate-level probability/robustness 过滤(如跨窗口稳定性约束)以减少 frontier fallback 次数。 ## 6) 建议 GPT Pro 回传格式 - 分三层输出: 1. 必改(blocking) 2. 建议改(high impact) 3. 可选改(nice-to-have) - 每条建议请附: - 目标文件 - 精确参数/公式 - 预期影响方向 - 验证指标与阈值