Want to figure out critical algorithm of Detect layer

## ❔Question
Hi, 
I want to figure out the intuition of bbox detection.
In yolov3, we can find that the output can be write by these:
![image](https://user-images.githubusercontent.com/7379039/88074490-77b8c080-cb45-11ea-86c0-001e23a7b3f9.png)
![image](https://user-images.githubusercontent.com/7379039/88074530-87d0a000-cb45-11ea-9f4d-65c1ba0740f9.png)

So, in yolov5,
I look into the src code: https://github.com/ultralytics/yolov5/blob/1e95337f3aec4c12244802bb6e493b07b27aa795/models/yolo.py#L21-L38
And try to formularize it:
![image](https://user-images.githubusercontent.com/7379039/88074806-e5fd8300-cb45-11ea-8028-87cabe354cbe.png)

Am I right?


	def forward(self, x):
	# x = x.copy() # for profiling
	z = [] # inference output
	self.training \|= self.export
	for i in range(self.nl):
	bs, _, ny, nx = x[i].shape # x(bs,255,20,20) to x(bs,3,20,20,85)
	x[i] = x[i].view(bs, self.na, self.no, ny, nx).permute(0, 1, 3, 4, 2).contiguous()

	if not self.training: # inference
	if self.grid[i].shape[2:4] != x[i].shape[2:4]:
	self.grid[i] = self._make_grid(nx, ny).to(x[i].device)

	y = x[i].sigmoid()
	y[..., 0:2] = (y[..., 0:2] * 2. - 0.5 + self.grid[i].to(x[i].device)) * self.stride[i] # xy
	y[..., 2:4] = (y[..., 2:4] * 2) ** 2 * self.anchor_grid[i] # wh
	z.append(y.view(bs, -1, self.no))

	return x if self.training else (torch.cat(z, 1), x)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Want to figure out critical algorithm of Detect layer #471

❔Question

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Want to figure out critical algorithm of Detect layer #471

Description

❔Question

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions