Apertus: a fully open, transparent, multilingual language model

snikta@programming.dev · 1 month ago

Apertus: a fully open, transparent, multilingual language model

Sonalder@lemmy.ml · edit-2 1 month ago

Open Source is a way to create but is not limited to only software but many different things. LLMs are software. Most open source LLMs are using Open washing to label themselves as Open Source, however it is not. The importance in Open Source is being able to study how it was made and most of open models have closed training data-sets and training method. Apertus is trully open in the sense that they published Open Data and full training details.

You have the right to be bother by “AI” but let Open Source enthousiasts being… well, enthousiasts when in a field of Open Washing someone created something trully Open Source to the point of sharing it in an Open Source community on a FOSS plateform.

Edit : Minor corrections

Zerush@lemmy.ml · edit-2 4 days ago

You can find it in HuggingFace.

Apertus is designed with transparency at its core, thereby ensuring full reproducibility of the training process. Alongside the models, the research team has published a range of resources: comprehensive documentation and source code of the training process and datasets used, model weights including intermediate checkpoints – all released under a permissive open-source license, which also allows for commercial use. The terms and conditions are available via Hugging Face. Apertus was developed with due consideration to Swiss data protection laws, Swiss copyright laws, and the transparency obligations under the EU AI Act. Particular attention has been paid to data integrity and ethical standards: the training corpus builds only on data which is publicly available. It is filtered to respect machine-readable opt-out requests from websites, even retroactively, and to remove personal data, and other undesired content before training begins.

You can use it here (optional free account).

Review:

Apertus truly delivers on its transparency promises, representing one of the most open and transparent LLM projects to date. The philosophy has been “open at every level,” backed by concrete actions that set new standards for AI transparency.

Apertus: a fully open, transparent, multilingual language model

Apertus: a fully open, transparent, multilingual language model

Key features