cambridgeltl/SapBERT-from-PubMedBERT-fulltext

Name: cambridgeltl/SapBERT-from-PubMedBERT-fulltext
Author: cambridgeltl

feature-extraction

Staleby cambridgeltl1.8M68updated 2 years ago

Python

datasets: - UMLS

README

license: apache-2.0 language: en tags: biomedical lexical semantics bionlp biology science embedding entity linking datasets: UMLS [news] A cross-lingual extension of SapBERT will appear in the main onference of ACL 2021! [news] SapBERT will appear in the conference proceedings of NAACL 2021! SapBERT-PubMedBERT SapBERT by Liu et al. (2020). Trained with UMLS 2020AA (English only), using microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract-fulltext as the base model. Expected input and output…

HuggingFace: https://huggingface.co/cambridgeltl/SapBERT-from-PubMedBERT-fulltext

Source attribution

HuggingFace — cambridgeltl/SapBERT-from-PubMedBERT-fulltext

Related resources

gbyuvd/chemselfies-base-bertmlm

by gbyuvd

Model

fill-mask

This model is a lightweight model pre-trained on SELFIES (Self-Referencing Embedded Strings) representations of molecules. It is trained on 2.7M unique and valid molecules taken from COCONUTDB and ChemBL34, with 7.3M total generated masked examples.

137 months ago

Python

gbyuvd/synthaccess-chemselfies

by gbyuvd

Model

text-classification

ChemFIE-SA is a BERT-like sequence classifier for predicting synthesis accessibility given a SELFIES string of a compound, fine-tuned from gbyuvd/chemselfies-base-bertmlm on DeepSA's expanded dataset from Wang et al. 2023.

91 year ago

Python

Dr-BERT/DrBERT-4GB-CP-PubMedBERT

by Dr-BERT

Model

fill-mask

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.

3572 years ago

Python

gbyuvd/drugtargetpred-chemselfies

by gbyuvd

Model

text-classification

This model is a BERT-like sequence classifier for 221 human protein drug targets, fine-tuned from gbyuvd/chemselfies-base-bertmlm on a dataset derived ChemBL34 (Zdrazil et al. 2023). It predicts potential drug targets using chemical structures represented as SELFIES (Self-Referencing Embedded…

91 year ago

Python

AIRI-Institute/moderngena-base

by AIRI-Institute

Model

# ModernGENA base ModernGENA is a DNA foundation model based on ModernBERT (a modernized BERT-style encoder architecture) adapted for genomic sequence modeling. ModernGENA base is the 377M-parameter version introduced in the paper Back to BERT in 2026: ModernGENA as a Strong, Efficient Baseline for…

4951 month ago

Dr-BERT/DrBERT-7GB

by Dr-BERT

Model

fill-mask

1.4K2 years ago

Python