Alter-xyz 4339f79a55 feat: add translation A/B testing framework (#564)
Add tools for comparing translation quality between different models
(e.g., Sonnet vs Opus) or prompt variations. Useful for evaluating
translation improvements before deploying changes.

- run_test.py: Test runner with Dify API streaming
- compare.py: Generate similarity reports between variants
- Example spec and documentation included

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude <noreply@anthropic.com>
2025-11-28 16:02:13 +08:00
2025-11-27 07:25:53 -08:00
2025-11-27 06:48:35 -08:00
2025-11-26 04:46:31 -08:00
2025-05-29 11:21:19 +08:00
2025-11-26 04:46:31 -08:00
2025-11-26 04:46:31 -08:00
2025-03-11 15:26:25 +08:00
2025-03-14 22:11:49 +08:00
2025-03-11 15:26:25 +08:00
2025-04-24 17:16:59 +08:00
2025-04-02 11:35:34 +08:00
Description
No description provided
CC-BY-4.0 292 MiB
Languages
MDX 90.5%
Python 9.3%
HTML 0.2%