farm · PyPI

Toolkit for finetuning and evaluating transformer based language models

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

(Framework for Adapting Representation Models)

What is it?

FARM makes cutting edge Transfer Learning for NLP simple. It is a home for all species of pretrained language models (e.g. BERT) that can be adapted to different down-stream tasks. The aim is to make it simple to perform document classification, NER and question answering, for example, using the one language model. The standardized interfaces for language models and prediction heads allow flexible extension by researchers and easy adaptation for practitioners. Additional experiment tracking and visualizations support you along the way to adapt a SOTA model to your own NLP problem and have a very fast proof-of-concept.

Core features

Easy adaptation of language models (e.g. BERT) to your own use case
Fast integration of custom datasets via Processor class
Modular design of language model and prediction heads
Switch between heads or just combine them for multitask learning
Smooth upgrading to new language models
Powerful experiment tracking & execution
Simple deployment and visualization to showcase your model

Resources

Installation

Recommended (because of active development):

git clone https://github.com/deepset-ai/FARM.git
cd FARM
pip install -r requirements.txt
pip install --editable .

If problems occur, please do a git pull. The –editable flag will update changes immediately.

From PyPi:

pip install farm

Basic Usage

1. Train a downstream model

FARM offers two modes for model training:

Option 1: Run experiment(s) from config

https://raw.githubusercontent.com/deepset-ai/FARM/master/docs/img/code_snippet_experiment.png

Use cases: Training your first model, hyperparameter optimization, evaluating a language model on multiple down-stream tasks.

Option 2: Stick together your own building blocks

https://raw.githubusercontent.com/deepset-ai/FARM/master/docs/img/code_snippet_building_blocks.png

Usecases: Custom datasets, language models, prediction heads …

Metrics and parameters of your model training get automatically logged via MLflow. We provide a public MLflow server for testing and learning purposes. Check it out to see your own experiment results! Just be aware: We will start deleting all experiments on a regular schedule to ensure decent server performance for everybody!

2. Run Inference (API + UI)

Run docker-compose up
Open http://localhost:3000 in your browser

One docker container exposes a REST API (localhost:5000) and another one runs a simple demo UI (localhost:3000). You can use both of them individually and mount your own models. Check out the docs for details.

Upcoming features

More pretrained models XLNet, XLM …
SOTA adaptation strategies (Adapter Modules, Discriminative Fine-tuning …)
Enabling large scale deployment for production
Additional Visualizations and statistics to explore and debug your model

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.8.0

Jun 10, 2021

0.7.1

Mar 31, 2021

0.7.0

Feb 22, 2021

0.6.2

Jan 20, 2021

0.6.1

Jan 12, 2021

0.6.0

Dec 30, 2020

0.5.0

Oct 30, 2020

0.4.9

Sep 21, 2020

0.4.8

Sep 14, 2020

0.4.7

Aug 27, 2020

0.4.6

Jul 10, 2020

0.4.5

Jun 24, 2020

0.4.4

Jun 18, 2020

0.4.3

Apr 29, 2020

0.4.2

Apr 2, 2020

0.4.1

Feb 3, 2020

0.3.2

Nov 28, 2019

0.3.1

Nov 4, 2019

0.3.0

Oct 28, 2019

0.2.2

Oct 14, 2019

0.2.1

Oct 10, 2019

This version

0.2.0

Aug 19, 2019

0.1.2

Jul 29, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

farm-0.2.0.tar.gz (68.9 kB view hashes)

Uploaded Aug 19, 2019 Source

Hashes for farm-0.2.0.tar.gz

Hashes for farm-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`3a044615b12982cfb78222f5f367c2321d641684dcb8de04b7bd31e3654a4b87`
MD5	`7442b08b4fd4d89d56fb744d1bda1b8e`
BLAKE2b-256	`9ab3b35c26aea084d7998d403061c4c139c7f024952dec39854da8e1e2435cdf`