We are actively developing our finetuning pipelines. Please email info@tamarind.bio if any of the below use cases sound interesting to you, and we can work with you to determine the best approach given your available data.
We offer protein language model finetuning using ESM2 for both regression and classification. Simply upload a CSV containing a column for your protein sequences and a column for your property of interest, and we’ll create a custom model. This model will then appear in your My Models tab, and you can use it for inference with your new protein sequences.
For antibody datasets with a heavy and light chain, we offer an Antibody finetuning tool, originally trained on specificity data, which uses embeddings from Antiberty for regression and classification. Select the columns for your sequence, property, and job name, and we’ll create a custom model for your use case.
It’s difficult for current state of the art models to capture differences in point mutations. We recommend ALDE if you have some point mutations and want to find what others to test.