python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Using the encoder part only from T5 model
I want to build a classification model that needs only the encoder part of language models. I have tried Bert, Roberta, xlnet, and so far I have been successful.
I now want to test the encoder part on...
ls_grep
Votes: 0
Answers: 1
Vision transformer binary classifier is only predicting one class
I wrote a code for a vision transformer to classify mammograms into benign and malignant. After training for 30 epochs, the model is, however, predicting only one class(benign). All the final predicti...
James Albert
Votes: 0
Answers: 0
Why DETR need to set a empty class?
Why DETR need to set a empty class?
It has set a "Background" class, which means non-object, why?
wwzzyy
Votes: 0
Answers: 2
Is it possible to access hugging face transformer embedding layer?
I want to use a pretrained hugging face transformer language model as an encoder in a sequence to sequence model.
The task is grammatical error correction, so both input and output come from the same ...
Aleksander Misztal
Votes: 0
Answers: 2