Fix DirectAttackSimulator randomization_seed inconsistency by ensuring deterministic template ordering #42203

Copilot · 2025-07-24T15:53:42Z

Problem

Users running SafetyEvaluation with direct_attack evaluator twice using the same randomization_seed were getting different query sets. They expected 200 matching queries but only got 100 matches, indicating non-deterministic behavior despite using identical seeds.

Root Cause

The issue was in AdversarialTemplateHandler._get_content_harm_template_collections() where templates were processed in non-deterministic order:

# Before (problematic)
for key, value in plist.items():  # Dictionary iteration order not guaranteed
    if value["category"] == template_category:
        # Process template...

This caused:

Templates retrieved from service could be stored in different dictionary orders between calls
Different template processing order led to different parameter zipping in AdversarialSimulator
Same randomization_seed produced different query sets across runs

Solution

Changed template processing to use sorted keys for deterministic ordering:

# After (fixed)
# Sort keys to ensure consistent ordering across different calls
# This ensures that templates are processed in the same order regardless of
# how they were retrieved from the service or stored in the dictionary
for key in sorted(plist.keys()):
    value = plist[key]
    if value["category"] == template_category:
        # Process template...

Impact

✅ Fixed randomization inconsistency: Same randomization_seed now produces identical query sets
✅ Zero breaking changes: Existing functionality preserved
✅ Minimal code change: Only 1 line modified with maximum impact
✅ Deterministic behavior: DirectAttackSimulator results are now reproducible

Testing

Added comprehensive test coverage:

test_template_handler_determinism.py: Validates template ordering consistency
test_direct_attack_determinism.py: Tests DirectAttackSimulator deterministic behavior

All tests confirm the fix resolves the issue:

Template ordering is deterministic regardless of service response order
Parameter zipping produces consistent results
Same randomization_seed produces identical query sets (100% match rate)

Example Usage

# This now works correctly:
outputs1 = await safety_evaluation(
    evaluators=[_SafetyEvaluator.DIRECT_ATTACK],
    target=target_function,
    randomization_seed=1
)

outputs2 = await safety_evaluation(
    evaluators=[_SafetyEvaluator.DIRECT_ATTACK], 
    target=target_function,
    randomization_seed=1  # Same seed
)

# Before fix: Only ~50% queries matched
# After fix: 100% queries match ✅

Closes issue where DirectAttackSimulator with identical randomization_seed produced different results.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

…g deterministic template ordering Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

Initial plan

6bf5ad5

Copilot AI assigned Copilot and slister1001 Jul 24, 2025

Copilot started work on behalf of slister1001 July 24, 2025 15:53 View session

Copilot AI and others added 2 commits July 24, 2025 16:13

Fix DirectAttackSimulator randomization_seed inconsistency by ensurin…

dfbc140

…g deterministic template ordering Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

Add comprehensive tests for DirectAttackSimulator randomization_seed fix

305ed44

Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

Copilot finished work on behalf of slister1001 July 24, 2025 16:20

Copilot AI requested a review from slister1001 July 24, 2025 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix DirectAttackSimulator randomization_seed inconsistency by ensuring deterministic template ordering #42203

Fix DirectAttackSimulator randomization_seed inconsistency by ensuring deterministic template ordering #42203

Uh oh!

Copilot AI commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix DirectAttackSimulator randomization_seed inconsistency by ensuring deterministic template ordering #42203

Are you sure you want to change the base?

Fix DirectAttackSimulator randomization_seed inconsistency by ensuring deterministic template ordering #42203

Uh oh!

Conversation

Copilot AI commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Solution

Impact

Testing

Example Usage

Uh oh!

Uh oh!

Copilot AI commented Jul 24, 2025 •

edited

Loading