AIAny - TabFM 1.0.0 (PyTorch)

Tabular workflows usually rely on per-dataset training, feature engineering, and hyperparameter sweeps. TabFM flips that paradigm: it reframes tabular prediction as an in‑context learning problem so you can provide training rows as context and get predictions on test rows in a single forward pass — no dataset-specific training required.

Key Capabilities

Architecture: alternating column attention (Set Transformer-style Fourier-feature cell embeddings and induced attention), row-level compression with CLS tokens and RoPE, then a 24-block causal ICL transformer over compressed row embeddings. Key hyperparameters include 256 embedding dim, 3 column-attention blocks, 3 row-attention blocks, 24 ICL blocks, SwiGLU activations, and 32 Fourier frequencies.
Zero-shot by design: works out-of-the-box for classification (up to 10 classes) and regression on mixed numeric/categorical columns, handling DataFrame or numpy inputs without hyperparameter tuning.
Training priors: pre-trained on hundreds of millions of synthetic datasets generated from structural causal models (SCMs) to capture diverse tabular relationships without using proprietary real-world tables.
Production conveniences: PyTorch and JAX/Flax backends available; Hugging Face weights and an scikit-learn-compatible wrapper; Google announced integration (e.g., BigQuery AI.PREDICT) to simplify SQL-based inference.

Who it's for and tradeoffs

Great fit if you need quick, high-quality baseline predictions on heterogeneous tabular datasets without spending time on model selection or hyperparameter tuning, or if you want an easy-to-run, scikit-learn-style inference flow. It also serves as a strong research baseline for zero-shot tabular ICL approaches.

Look elsewhere if your task has >10 target classes, requires training-time fine-tuning for domain-specific performance, needs handling of extremely wide tables (well beyond ~500 features), or demands a commercial license for the provided weights — the released weights use a TabFM Non-Commercial License v1.0 while source code is Apache-2.0. Performance and fairness on specific real-world domains and minority subgroups are not fully characterized; memory usage grows with the number of training rows because all rows are passed as context.

TabFM 1.0.0 (PyTorch)

Introduction

Key Capabilities

Who it's for and tradeoffs

Information

Categories

Tags

More Items

Rampart

BugTraceAI-CORE-Ultra-27B-Q6

Qwopus-3.6-35B-A3B-Coder-MTP-GGUF