Files
linshen 2cdd095c2b docs(workspace): consolidate compare evaluation specs and acceptance evidence
- fold earlier planning notes into a single current-spec and archived history structure
- keep manual acceptance steps and real API samples aligned with the refactored analysis/result/compare model
- retain supporting workspace notes needed to review version-selection and evaluation behavior changes
2026-03-18 09:35:44 +08:00

40 lines
1019 B
JSON

{
"type": "result",
"score": {
"overall": 50,
"dimensions": [
{
"key": "goalAchievement",
"label": "目标达成度",
"score": 30
},
{
"key": "outputQuality",
"label": "输出质量",
"score": 60
},
{
"key": "constraintCompliance",
"label": "约束符合度",
"score": 40
},
{
"key": "promptEffectiveness",
"label": "提示词引导有效性",
"score": 40
}
]
},
"improvements": [
"在提示词中更明确地强调必须先通过提问澄清用户需求,再进行建议。",
"为 assistant 提供一个结构化的问题列表模板,用于收集用户团队规模、协作方式等关键信息。"
],
"summary": "助理未按提示词要求澄清用户需求就直接给出建议,未能有效遵循引导逻辑。",
"patchPlan": [],
"metadata": {
"model": "dashscope",
"timestamp": 1773729227193,
"duration": 9499
}
}