Linux Target pids should work with existing and future tasks within processes #209

beaubelgrave · 2025-10-24T22:26:51Z

Closes #192.

Perf_event_open() requires an FD per-PID + CPU + Tracepoint, which causes a lot of FDs, so this change also includes setting the rlimit default limit to the max to ensure we can have as many files open as possible.

I've tested scenarios where existing threads are already running and we get data back for them correctly.

I've also tested a process without any extra threads getting collection started, and then having new threads spawn.

In both scenarios I see the new threads data showing up via nettrace within PerfView.

ProcFS on Linux allows us to enumerate processes and tasks, however, we only offer process enumeration. Add iter_proc_tasks() so callers can iterate over all the tasks within the process. Signed-off-by: Beau Belgrave <[email protected]>

The perf_event_open() syscall can target specific tasks, but it will not automatically track new tasks. Nor will it enumerate the existing tasks. Add methods to enumerate the current tasks for target_pids and include these when enabling ring buffers. Add the inherit flag to tell perf_event_open() to carry the perf_event settings through to child tasks. Signed-off-by: Beau Belgrave <[email protected]>

brianrob

LGTM. A couple of questions below.

brianrob · 2025-10-28T17:02:03Z

one_collect/src/perf_event/rb/source.rs

+
+        /* Find all unique tasks IDs */
+        for pid in pids.drain(..) {
+            tasks.insert(pid);


If we insert pid here, then we're going to end up with duplicate pids when this function returns, no?

tasks is a HashSet, so we will only keep unique task IDs. I wanted to ensure we did not have duplicates. For example, if /proc/<pid>/task is given a task, it will include parent tasks. The hashset is preventing it, and pids is drained fully, so we only end up with a unique set after this function (pids drain into a hashset that then repopulates the drained pids vec).

You're right - I missed that it calls drain to empty them out.

brianrob · 2025-10-28T17:13:49Z

one_collect/src/perf_event/rb/source.rs

                    &self.leader_ids,
                    &mut self.ring_bufs,
                    &common,
                    None)?;


I see that we're not keeping track of the fds that get created here since we're not passing a Vec<PerfDataFile> to add_cpu_bufs. Presumably this is because we're redirecting these events to the leader buffers that we actually look at. When a fork/clone operation occurs, does this redirection automatically happen since we're setting the inherit flag? I'm assuming so, since that's the only thing I can see that would allow these events to flow from new tasks created after we call build.

Yep, inherit will do the redirection within the kernel for fork/clone. We unfortunately have to keep the FD in memory for perf_events to keep the redirect event active (I wish this wasn't the case). So we end up with a lot of FDs. Ideally, we'd only need 1 FD per-CPU and have perf_event ref-count the redirected ring buffers, but that's not how it currently works. I'll be poking around with tracing kernel folks if we can make this better over time.

brianrob

LGTM. Thanks!

With microsoft/one-collect#209, we can now allow specifying a particular process to be traced.

beaubelgrave added 2 commits October 24, 2025 22:18

ProcFS: Add iter_proc_tasks

fc6f6bd

ProcFS on Linux allows us to enumerate processes and tasks, however, we only offer process enumeration. Add iter_proc_tasks() so callers can iterate over all the tasks within the process. Signed-off-by: Beau Belgrave <[email protected]>

beaubelgrave self-assigned this Oct 24, 2025

beaubelgrave requested a review from brianrob October 24, 2025 22:40

brianrob reviewed Oct 28, 2025

View reviewed changes

brianrob approved these changes Oct 28, 2025

View reviewed changes

beaubelgrave merged commit dc5ce1f into main Oct 28, 2025
11 checks passed

beaubelgrave deleted the users/beaub/pid_fix branch October 28, 2025 21:52

mdh1418 mentioned this pull request Oct 29, 2025

[dotnet-trace][collect-linux] Reintroduce process specifier dotnet/diagnostics#5623

Merged

mdh1418 added a commit to dotnet/diagnostics that referenced this pull request Oct 31, 2025

[dotnet-trace][collect-linux] Reintroduce process specifier (#5623)

ff0c8b1

With microsoft/one-collect#209, we can now allow specifying a particular process to be traced.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Linux Target pids should work with existing and future tasks within processes #209

Linux Target pids should work with existing and future tasks within processes #209

Uh oh!

beaubelgrave commented Oct 24, 2025

Uh oh!

brianrob left a comment

Uh oh!

brianrob Oct 28, 2025

Uh oh!

beaubelgrave Oct 28, 2025 •

edited

Loading

Uh oh!

brianrob Oct 28, 2025

Uh oh!

brianrob Oct 28, 2025

Uh oh!

beaubelgrave Oct 28, 2025

Uh oh!

brianrob left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Linux Target pids should work with existing and future tasks within processes #209

Linux Target pids should work with existing and future tasks within processes #209

Uh oh!

Conversation

beaubelgrave commented Oct 24, 2025

Uh oh!

brianrob left a comment

Choose a reason for hiding this comment

Uh oh!

brianrob Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

beaubelgrave Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brianrob Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

brianrob Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

beaubelgrave Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

brianrob left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beaubelgrave Oct 28, 2025 •

edited

Loading