Keylab/COMO
COMO (Closed-loop Optical Molecule recOgnition) is a deep learning framework for Optical Chemical Structure Recognition (OCSR). It recognizes chemical structure diagrams from images and predicts SMILES strings with atom-level 2D coordinates and bond matrices.
README
license: cc-by-nc-4.0 libraryname: pytorch tags: chemistry cheminformatics optical-chemical-structure-recognition ocsr molecule-recognition smiles transformer swin-transformer minimum-risk-training molecular-graph datasets: Keylab/COMO metrics: exactmatch tanimotosimilarity COMO: Closed-Loop Optical Molecule Recognition COMO (Closed-loop Optical Molecule recOgnition) is a deep learning framework for Optical Chemical Structure Recognition (OCSR). It recognizes chemical structure diagrams from…
- HuggingFace
- https://huggingface.co/Keylab/COMO
Source attribution
- HuggingFace — Keylab/COMO
Related resources
sagawa/ReactionT5v1-forward
by sagawaThis is a ReactionT5 pre-trained to predict the products of reactions.
A library for building, manipulating, analyzing and automatic design of molecules, including a genetic algorithm.
Drugs must satisfy stringent criteria for both efficacy and safety. This model predicts the likelihood of FDA approval for small-molecule drugs, represented using SMILES (Simplified Molecular Input Line Entry System) strings.
Dr-BERT/DrBERT-7GB
by Dr-BERTIn recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
This model is a BERT-like sequence classifier for 221 human protein drug targets, fine-tuned from gbyuvd/chemselfies-base-bertmlm on a dataset derived ChemBL34 (Zdrazil et al. 2023). It predicts potential drug targets using chemical structures represented as SELFIES (Self-Referencing Embedded…