Sebastian Franz issues

Results 36 issues of


                                            Sebastian Franz

Create tutorial how-to use a custom embedder

It would be nice to have a tutorial how to use custom embedders with biotrainer. This way, new protein language models can be used directly in biotrainer without having to...

documentation

[ppi] Interaction mode not compatible with all protocols yet

The ppi interaction mode is not yet compatible with all protocols yet. `sequence_to_class` have been tested throughout. Other per-sequence protocols should work as well. However, for per-residue tasks (`residue_to_class`), changes...

enhancement

Support multiple hyperparameters for hold_out cross validation

After the cross_validation PR will be merged, parameter search for nested cross validation will be enabled. It would be nice to extend this behaviour also to hold_out cross validation. A...

enhancement

good first issue

Add random comparison baseline

As a researcher, it would be nice to have an automatic random baseline as a comparison for every run. This could be included in the final test metrics: `test set...

enhancement

good first issue

BatchNorm1D does not work with batches of size 1

The LightAttention model used for residues_to_class protocol uses BatchNorm1D. However, if using a batch size of 1 is not possible with BatchNorm1D. Because a batch size of 1 is an...

bug

wontfix

Config - Embeddings - Targets pipeline is inefficient

Currently, at first the config file is loaded (but not completely sanity checked yet, for example biotrainer does not care if the input files actually exist, so embeddings might be...

refactoring

Add residue_to_value protocol

Implement residue to value protocol: ```text residue_to_value --> Predict a value V for each residue encoded in D dimensions in a sequence of length L. Input BxLxD --> output BxLxV...

enhancement

Support more visualization platforms

Many machine learning researchers are using different platforms to visualize their parameters and model output and training. At the moment, we are only supporting tensorboard. It would be possible to...

enhancement

good first issue

Implement embeddings calculation "on the fly"

Currently, embeddings must be pre-calculated before the training process starts. For some users, it might be beneficial to calculate the embeddings on the fly, especially if they only require low...

enhancement

Embeddings do not get re-computed automatically

Currently, if a user already ran biotrainer and thus there already exist pre-computed embeddings, these embeddings do not get re-computed if the sequences.fasta file has changed. This might be a...

bug

wontfix