BIOT¶
biotbraindecode/biot-pretrained-six-datasets-18chs (HuggingFace Hub)BIOT is a transformer-based foundation model for biosignal classification. The pretrained checkpoint was trained on six EEG datasets (PREST, SHHS, CHB-MIT, IIIC Seizure, TUAB, TUEV) at 200 Hz using 18 bipolar channels derived from the Temporal Central Parasagittal (TCP) montage.
In NeuralBench, a Conv1d channel projection maps from the recording’s
arbitrary channel count to the 18 expected channels, allowing BIOT to be
evaluated on datasets with diverse montages without manual channel selection.
Pretraining data overlap¶
Warning
The biot-pretrained-six-datasets-18chs checkpoint was pretrained on six
EEG datasets including TUAB and TUEV (Section 3.6 of the paper).
These are used both in pretraining and as NeuralBench evaluation datasets,
so results on the affected tasks may be inflated.
Pretraining dataset |
NeuralBench task |
NeuralBench study |
|---|---|---|
TUAB |
|
Lopez2017 |
TUEV |
|
Harati2015 |
Known limitations¶
Fixed spatial embeddings – The pretrained positional embeddings are tied to the 18 TCP channels. The
Conv1dchannel projection learns a linear mapping into this space, but there is no guarantee that the resulting channels align with the original TCP bipolar semantics.