Skip to content

Incoherence in the pooled_output definition in ElectraBackbone and BertBackbone #1358

@mbrhd

Description

@mbrhd

Describe the bug

There is an incoherence in the definition of the pooled_output in ElectraBackbone and BertBackbone vs AlbertBackbone and FNet.

  • In ElectraBackbone and BertBackbone the pooled_output is defined as the pooling of the cls token before the dense layer.
  • In AlbertBackbone and FNet the pooled_output is defined as the output of dense layer which takes the cls token from the sequence output.

Expected behavior

The pooled_output should have one definition or follow the original implementation.

Additional context

The original implementation of Bert, Fnet, Albert

Would you like to help us fix it?
I would like to work on this issue.

Metadata

Metadata

Assignees

Labels

type:BugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions