Active Learning

Using active learning to speed up molecular scoring

Local development

Prerequisites

Conda

Create Environment

The following commands will setup an environment where you can run and test the application locally:

git clone [email protected]:jonswain/active-learning.git
cd active_learning
conda env create -f env.ml
conda activate active-learning
code .

Procedure

Active learning is used when we have some sort of scoring function that is too computationally expensive to label the full library of compounds. A machine learning model is trained on a subset of the data and used to score all compounds from within the library. The compounds with the best scores from the ML are labelled using the more expensive function, and the labelled data is pooled and used to train a new machine learning model. This cycle is repeated until a finish criteria is met.

Data

The SMILES data was borrowed from Thompson Sampling by Pat Walters

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
.gitignore		.gitignore
README.md		README.md
active_learning.ipynb		active_learning.ipynb
env.yml		env.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Learning

Local development

Prerequisites

Create Environment

Procedure

Data

About

Releases

Packages

Languages

jonswain/active-learning

Folders and files

Latest commit

History

Repository files navigation

Active Learning

Local development

Prerequisites

Create Environment

Procedure

Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages