Llemma

General Science Models

Open language model for mathematics (7B/34B) trained on Proof-Pile-2, outperforming Minerva at equal scale on MATH benchmark, with tool use and formal theorem proving in Lean without finetuning (EleutherAI, ICLR 2024)

Repository: github.com/eleutherai/math-lm

Source attribution

Awesome AI for Science — github.com/eleutherai/math-lm

Related resources

TimesFM (Google Research)

Tool

General Science Models

Pretrained time series foundation model for long-horizon forecasting across diverse scientific domains including climate variables, biomedical signals, and physical observations; decoder-only Transformer architecture with strong zero-shot generalization (19.8K+ stars, Apache 2.0, 2024-2025)

20.1K5 days ago

Python

Apache-2.0

Galactica

Tool

General Science Models

Large language model for science

Intern-S1

Tool

General Science Models

Open-source scientific multimodal foundation model built on a 235B MoE LLM and 6B vision encoder, continually pretrained on 5T tokens including 2.5T scientific-domain tokens, with strong results across chemistry, materials, life science, and earth science benchmarks (2025)

MinervaAI

Tool

General Science Models

Mathematical reasoning

Chronos (Amazon Science, NeurIPS 2024)

Tool

General Science Models

Pretrained time series foundation model for zero-shot forecasting across diverse scientific and real-world domains; tokenizes continuous time series into discrete bins to train transformer language models on large-scale corpora, achieving strong zero-shot generalization and competitive performance with task-specific supervised models on climate, energy, and health benchmarks (5.3K+ stars, Apache 2.0, 2024-2026)