PITTI - Article - SlimTrainer and Adalite

SlimTrainer and Adalite

Artificial Intelligence,Information Processing | Computing

Date : 2023-10-02

Description

SlimTrainer and Adalite allow for full parameter 16-bit finetuning of language models up to 7B on a single 24GB GPU. The optimizer uses the backpropagation fusing technique from LOMO, but uses a custom optimizer instead of using simple SGD. The small batch size and extreme memory requirements extensive exploration of potential optimizer variants, resulting in a custom optimizer, Adalite, based on Adafactor and LAMB.

GitHub repository below

How hard does Art need to be ?

Evaluation of Sports Performance: Cognitive Biases, Vectors an...

Recently on :

Artificial Intelligence

Information Processing | Computing

PITTI - 2026-03-05

Scaling Trust : a Missing Piece in Multi-Agent Worlds

Humanity’s ability to build complex civilizations relies on an "invisible infrastructure" - the shared culture, institutions, a...

PITTI - 2026-01-14

Cultural, Ideological and Political Bias in LLMs

Transcription of a talk given during the work sessions organized by Technoréalisme on December 9, 2025, in Paris. The talk pres...

WEB - 2025-11-13

Measuring political bias in Claude

Anthropic gives insights into their evaluation methods to measure political bias in models.

WEB - 2025-10-09

Defining and evaluating political bias in LLMs

OpenAI created a political bias evaluation that mirrors real-world usage to stress-test their models’ ability to remain objecti...

WEB - 2025-07-23

Preventing Woke AI In Federal Government

Citing concerns that ideological agendas like Diversity, Equity, and Inclusion (DEI) are compromising accuracy, this executive ...

more articles on
-
Artificial Intelligence

We care about your privacy so we do not store nor use any cookie unless it is stricly necessary to make the website to work

Got it

Learn more