perf: lazy test materialization with two-phase discovery #4300

thomhurst · 2026-01-11T13:24:08Z

Summary

This PR implements lazy test materialization through a two-phase discovery architecture that significantly improves test discovery performance, especially when running filtered tests.

Key Changes

TestDescriptor struct - Lightweight test descriptor containing only filter-relevant metadata (class name, method name, categories, properties, dependencies) without full test materialization
ITestDescriptorSource interface - Enables test sources to provide fast descriptor enumeration via EnumerateTestDescriptors()
Source generator updates - Generates EnumerateTestDescriptors() implementation with pre-computed filter hints extracted at compile time
Two-phase discovery in pipeline - When filters are present:
1. Enumerate lightweight descriptors
2. Apply filter + expand dependencies at descriptor level
3. Materialize only matching tests
Descriptor-level dependency expansion - DependsOn property on descriptors enables dependency resolution before materialization, ensuring filtered tests include their dependencies

Performance Impact

When running filtered tests (e.g., --treenode-filter "/*/*/MyClass/MyTest"), only tests matching the filter (plus their dependencies) are fully materialized. This avoids the cost of:

Instantiating data source attributes
Evaluating data generators
Creating full TestMetadata objects

For unfiltered runs, the traditional collection path is used.

Backward Compatibility

Sources implementing only ITestSource continue to work (fallback to full materialization)
No breaking changes to public APIs

Test plan

All 290 engine tests pass (including FilteredDependencyTests)
All 372 source generator snapshot tests pass
Verified filtered test execution works correctly with dependencies
Run benchmarks to measure performance improvement

🤖 Generated with Claude Code

Add comprehensive design document for optimizing test discovery through lazy materialization. Key improvements: - Lightweight TestDescriptor struct for fast enumeration - Two-phase discovery: filter first, materialize matching tests only - Pre-computed filter hints at compile time - Deferred data source evaluation This architectural change targets ~20% improvement in single test execution time by avoiding eager materialization of tests that won't run due to filtering. Co-Authored-By: Claude Opus 4.5 <[email protected]>

…zation (Phase 1) Introduce lightweight test descriptor infrastructure to enable two-phase test discovery: 1. Fast enumeration via ITestDescriptorSource.EnumerateTestDescriptors() - Returns TestDescriptor structs with minimal identity/filter data - Pre-computed categories and properties for filtering - Lazy Materializer delegate for deferred full metadata creation 2. ITestSource updated with documentation for optional ITestDescriptorSource - Backward compatible: existing implementations work unchanged - New implementations can opt into optimization by implementing both TestDescriptor struct includes: - Test identity (TestId, ClassName, MethodName, FullyQualifiedName) - Location info (FilePath, LineNumber) - Filter hints (Categories, Properties) pre-extracted at compile time - HasDataSource flag for parameterized tests - RepeatCount from RepeatAttribute - Materializer delegate for lazy TestMetadata creation This is Phase 1 of the lazy test materialization redesign. Next: Update source generator to emit descriptors (Phase 2). Co-Authored-By: Claude Opus 4.5 <[email protected]>

Update TestMetadataGenerator to: - Generate ITestDescriptorSource implementation alongside ITestSource - Add EnumerateTestDescriptors() method for fast test enumeration - Pre-extract filter hints at compile time: - Categories from CategoryAttribute - Properties from PropertyAttribute - HasDataSource flag for parameterized tests - RepeatCount from RepeatAttribute Generated TestDescriptor includes: - Test identity (TestId, ClassName, MethodName, FullyQualifiedName) - Source location (FilePath, LineNumber) - Pre-computed filter arrays (no runtime attribute instantiation) - Materializer delegate referencing GetTestsAsync This enables two-phase discovery: 1. Fast enumeration via EnumerateTestDescriptors (filter before materialize) 2. Lazy materialization only for matching tests Co-Authored-By: Claude Opus 4.5 <[email protected]>

Add EnumerateDescriptors() and MaterializeFromDescriptorsAsync() to ITestDataCollector interface, enabling two-phase test discovery: 1. Phase 1: Fast enumeration via lightweight TestDescriptor structs 2. Phase 2: Lazy materialization only for tests that pass filtering Implementation: - AotTestDataCollector leverages ITestDescriptorSource when available - ReflectionTestDataCollector provides fallback descriptor enumeration - Both collectors support streaming materialization from descriptors This prepares the infrastructure for filter-before-materialize optimization. Co-Authored-By: Claude Opus 4.5 <[email protected]>

When filters with extractable hints are present, use two-phase discovery: 1. Enumerate lightweight TestDescriptors (no delegate/array allocations) 2. Filter descriptors using pre-computed hints from source generator 3. Only materialize TestMetadata for tests that pass filtering This optimization avoids creating full TestMetadata objects (with all their delegates, arrays, and allocations) for tests that won't run due to filters. Key changes: - Add CouldDescriptorMatch() to FilterHints for descriptor-level filtering - Update AotTestDataCollector to use two-phase discovery when available - Fallback to traditional collection for legacy sources or no filter hints Co-Authored-By: Claude Opus 4.5 <[email protected]>

…sion Adds DependsOn property to TestDescriptor containing pre-extracted dependency information from DependsOnAttribute. This enables dependency expansion during two-phase discovery, ensuring that when filtering tests, their dependencies are also materialized. Changes: - Add DependsOn string array to TestDescriptor - Extract DependsOn attributes at compile time in source generator - Expand dependencies transitively at descriptor level before materialization - Restore method-name filtering in CouldDescriptorMatch This fixes the issue where filtering to a single test would not include its dependencies (e.g., filtering to "DependentTest" would not include "BaseTest" that it depends on). Co-Authored-By: Claude Opus 4.5 <[email protected]>

thomhurst · 2026-01-11T13:28:05Z

Summary

Implements lazy test materialization through a two-phase discovery architecture (descriptors → filter → materialize) to improve performance when running filtered tests.

Critical Issues

None found ✅

Suggestions

1. Missing DependsOn information in ReflectionTestDataCollector descriptors

In ReflectionTestDataCollector.EnumerateDescriptors(), the generated descriptors have empty DependsOn arrays. This means dependency expansion in two-phase discovery won't work correctly for reflection mode.

Why this matters: The two-phase discovery in AotTestDataCollector.CollectTestsWithTwoPhaseDiscoveryAsync() relies on descriptor.DependsOn to expand dependencies before materialization. Without this, filtered tests won't include their dependencies in reflection mode.

Suggested fix: Either extract DependsOn values, document that dependency expansion is source-gen-only, or add a TODO comment explaining the limitation.

2. Consider ConfigureAwait consistency

In AotTestDataCollector.CollectTestsWithTwoPhaseDiscoveryAsync(), the code uses ConfigureAwait(false) but the method signature doesn't accept a CancellationToken parameter itself.

TUnit Rules Compliance

✅ Rule 1 - Dual-Mode: Both AotTestDataCollector and ReflectionTestDataCollector updated
✅ Rule 2 - Snapshot Testing: All 100 .verified.txt files updated correctly
✅ Rule 3 - No VSTest: No VSTest dependencies added
✅ Rule 4 - Performance First: This is a performance optimization PR
✅ Rule 5 - AOT Compatibility: Proper annotations on new methods

Verdict

✅ APPROVE - No critical issues

Excellent performance optimization that maintains backward compatibility. The architecture is clean, implementation is thorough, and all snapshot tests updated properly.

- Combine enumeration, indexing, and filtering into a single pass - Eliminate separate iteration over allDescriptors list - Add hasDependencies flag to skip dependency expansion when not needed - Use for-loop instead of foreach for DependsOn array iteration - Reduce memory allocations by avoiding intermediate list Co-Authored-By: Claude Opus 4.5 <[email protected]>

thomhurst · 2026-01-11T13:34:27Z

Summary

This PR implements lazy test materialization through a two-phase discovery architecture that defers creation of full TestMetadata objects until after filtering, improving performance when running filtered tests.

Critical Issues

None found - all TUnit critical rules are properly followed.

Suggestions

Dependency parsing uses IndexOf/Substring repeatedly. For complex dependency graphs, consider pre-parsing to tuples, though current approach is reasonable given dependencies are rare.
Property extraction assumes values do not contain equals signs. Consider documenting this limitation or using a more robust separator, though PropertyAttribute values are typically simple.
Reflection mode cannot pre-extract filter hints efficiently (documented limitation). Consider adding comment that it still avoids data source materialization which is the bigger win.

TUnit Rules Compliance

Dual-Mode: Both source-gen and reflection implement ITestDescriptorSource correctly
Snapshot Testing: All 372 .verified.txt files updated properly
AOT Compatible: Proper DynamicallyAccessedMembers annotations present
Performance First: Excellent - readonly structs, compile-time hints, static delegates
No VSTest: Uses Microsoft.Testing.Platform exclusively

Architecture Highlights

Backward compatible fallback for ITestSource without ITestDescriptorSource
Descriptor-level dependency expansion before materialization
Single-pass enumeration with indexing and filtering
Conditional optimization only when filterHints.HasHints is true

Verdict

Approve - Excellent performance optimization with proper dual-mode support, AOT compatibility, and backward compatibility.

- Filter out assemblies that don't transitively reference TUnit.Core - Skip TUnit framework assemblies (TUnit.Core, TUnit.Engine, etc.) - Build reverse dependency graph to find all assemblies that could contain tests - Reduces assembly registrations from 26 to 1 in typical test projects This optimization avoids loading and scanning assemblies that cannot possibly contain TUnit tests (e.g., AutoFixture, Polly, Humanizer). Co-Authored-By: Claude Opus 4.5 <[email protected]>

thomhurst · 2026-01-11T13:59:44Z

Summary

This PR implements lazy test materialization through a two-phase discovery architecture to improve test discovery performance when running filtered tests.

Critical Issues

None found ✅

Observations

Excellent Design ✅

Dual-mode implementation - Both source-gen (ITestDescriptorSource.EnumerateTestDescriptors) and reflection (ReflectionTestDataCollector.EnumerateDescriptors) paths are implemented, adhering to TUnit's dual-mode requirement.
Snapshot tests properly updated - All 97 .verified.txt files correctly show the new EnumerateTestDescriptors() method being generated. No .received.txt files committed.
AOT compatibility maintained - Proper [DynamicallyAccessedMembers] annotations present in reflection code paths.
No blocking async - All async operations use proper async/await patterns. No .Result, .Wait(), or .GetAwaiter().GetResult().
No VSTest dependencies - Exclusively uses Microsoft.Testing.Platform.
Performance-first design:
- Lightweight readonly struct TestDescriptor minimizes allocations
- Pre-computed filter hints (categories, properties, dependencies) extracted at compile time
- Two-phase discovery avoids materializing tests that won't run
- Dependency expansion at descriptor level before materialization
- Sequential processing for small test sets (from PR perf: use sequential processing for small test sets #4299)
String escaping - Proper escaping in generated code (EscapeString handles backslashes, quotes, newlines, etc.)
Backward compatibility - Sources implementing only ITestSource continue to work via fallback to traditional collection path.

Code Quality

Clean separation of concerns between descriptor enumeration and materialization
Well-documented with XML comments explaining the two-phase approach
Efficient dependency resolution using dictionaries for O(1) lookups
Smart optimization: only expands dependencies if any matching descriptor has them

Verdict

✅ APPROVE - No critical issues. This is a well-designed performance optimization that follows all TUnit mandatory rules.

Add new public API entries for: - TestDescriptor struct - lazy materialization container - ITestDescriptorSource interface - source generation contract Co-Authored-By: Claude Opus 4.5 <[email protected]>

thomhurst · 2026-01-11T14:19:56Z

Summary

This PR implements lazy test materialization through a two-phase discovery architecture that improves test discovery performance when running filtered tests.

Critical Issues

None found ✅

Observations

TUnit Rules Compliance

✅ Dual-Mode Implementation - The PR correctly implements the feature in both:

Source generator mode: TUnit.Core.SourceGenerator/Generators/TestMetadataGenerator.cs generates ITestDescriptorSource implementation with EnumerateTestDescriptors()
Reflection mode: TUnit.Engine/Discovery/ReflectionTestDataCollector.cs implements EnumerateDescriptors() and MaterializeFromDescriptorsAsync()

✅ Snapshot Testing - All .verified.txt files updated correctly (372 source generator snapshots). No .received.txt files committed.

✅ Public API Changes - New public types properly tracked:

TestDescriptor struct (TUnit.Core:11711-11870)
ITestDescriptorSource interface (TUnit.Core/Interfaces/SourceGenerator:11624-11676)
API snapshots updated for all target frameworks

✅ AOT Compatible - Proper [DynamicallyAccessedMembers] annotations on reflection code:

CreateReflectionMaterializer (TUnit.Engine/Discovery/ReflectionTestDataCollector.cs:12289-12295)
MaterializeSingleTestAsync (TUnit.Engine/Discovery/ReflectionTestDataCollector.cs:12297-12316)

✅ Performance First - Excellent optimization:

Descriptors are lightweight structs
Filter hints pre-computed at compile time (categories, properties, dependencies)
Two-phase discovery only triggered when filters present
Dependency expansion at descriptor level (before materialization)

Architecture Strengths

Backward Compatible - Sources implementing only ITestSource continue to work via fallback path
Smart Filtering - The CouldDescriptorMatch method (TUnit.Engine/Services/MetadataFilterMatcher.cs:12359-12384) correctly filters by class/method name before materialization
Dependency Handling - Pre-extracted DependsOn property enables dependency resolution at descriptor level (TUnit.Engine/Building/Collectors/AotTestDataCollector.cs:11960-12015)
Single-Pass Enumeration - Clever optimization in CollectTestsWithTwoPhaseDiscoveryAsync that indexes AND filters descriptors in one pass (TUnit.Engine/Building/Collectors/AotTestDataCollector.cs:11923-11958)

Minor Observations

Assembly Loader Optimization (TUnit.Core.SourceGenerator/CodeGenerators/AssemblyLoaderGenerator.cs:11149-11245) - Nice addition to only register assemblies that reference TUnit.Core, reducing reflection overhead
Reflection Mode Limitation - As documented, reflection mode cannot pre-extract filter hints without instantiating attributes. This is an acceptable trade-off and properly documented.
Test Coverage - PR description mentions "All 290 engine tests pass" and "All 372 source generator snapshot tests pass". The unchecked benchmark item is reasonable for a separate follow-up.

Verdict

✅ APPROVE - No critical issues

This is a well-architected performance optimization that:

Follows all TUnit critical rules
Maintains dual-mode compatibility
Uses modern C# features appropriately
Is properly documented
Has comprehensive test coverage

The implementation demonstrates careful attention to performance, backward compatibility, and TUnit's architectural constraints.

thomhurst and others added 6 commits January 11, 2026 12:40

thomhurst had a problem deploying to Pull Requests January 11, 2026 13:24 — with GitHub Actions Failure

thomhurst had a problem deploying to Pull Requests January 11, 2026 13:30 — with GitHub Actions Failure

thomhurst had a problem deploying to Pull Requests January 11, 2026 13:56 — with GitHub Actions Failure

test: update public API snapshots for TestDescriptor

3cd46c2

Add new public API entries for: - TestDescriptor struct - lazy materialization container - ITestDescriptorSource interface - source generation contract Co-Authored-By: Claude Opus 4.5 <[email protected]>

thomhurst had a problem deploying to Pull Requests January 11, 2026 14:17 — with GitHub Actions Failure

thomhurst merged commit 643e39f into main Jan 11, 2026
9 of 13 checks passed

thomhurst deleted the feature/lazy-test-materialization branch January 11, 2026 14:44

thomhurst mentioned this pull request Jan 11, 2026

chore(deps): update tunit to 1.9.68 #4301

Merged

1 task

This was referenced Jan 12, 2026

Chore(deps): Bump TUnit from 1.9.2 to 1.9.68 InfiniLore/InfiniBlazor#237

Merged

Chore(deps): Bump TUnit from 1.9.2 to 1.9.68 code-of-chaos/cs-code_of_chaos-extensions#49

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

perf: lazy test materialization with two-phase discovery #4300

perf: lazy test materialization with two-phase discovery #4300

Uh oh!

thomhurst commented Jan 11, 2026

Uh oh!

thomhurst commented Jan 11, 2026

Uh oh!

thomhurst commented Jan 11, 2026

Uh oh!

thomhurst commented Jan 11, 2026

Uh oh!

thomhurst commented Jan 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

perf: lazy test materialization with two-phase discovery #4300

perf: lazy test materialization with two-phase discovery #4300

Uh oh!

Conversation

thomhurst commented Jan 11, 2026

Summary

Key Changes

Performance Impact

Backward Compatibility

Test plan

Uh oh!

thomhurst commented Jan 11, 2026

Summary

Critical Issues

Suggestions

1. Missing DependsOn information in ReflectionTestDataCollector descriptors

2. Consider ConfigureAwait consistency

TUnit Rules Compliance

Verdict

Uh oh!

thomhurst commented Jan 11, 2026

Summary

Critical Issues

Suggestions

TUnit Rules Compliance

Architecture Highlights

Verdict

Uh oh!

thomhurst commented Jan 11, 2026

Summary

Critical Issues

Observations

Excellent Design ✅

Code Quality

Verdict

Uh oh!

thomhurst commented Jan 11, 2026

Summary

Critical Issues

Observations

TUnit Rules Compliance

Architecture Strengths

Minor Observations

Verdict

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants