Dilyara Bareeva issues

Results 28 issues of


                                            Dilyara Bareeva

Add random seed for reproducibility

Make sure multiple re-runs of each metric deliver the same values each time.

Mypy: fix errors

### Implemented changes - fix most errors @annahedstroem Please check the remaining errors. - small edits in CONTRIBUTING.MD - branched from mypy-static-type-checker

quanda link in README.md

### Implemented changes - Added a **quanda** link to README.md - Changed pytest.ini configuration to ignore FutureWarning due to failing tests

feat: Unified configuration for benchmark training, unit testing and usage

Currently, the process of training and setting up benchmarks is overly complicated—we have to configure them separately in the benchmark classes, tests, and training scripts. I'm proposing a unified approach...

Kronfluence bug

**Describe the bug** When running the tests on **cuda**, I get the error: ``` --- > raise NotImplementedError(error_msg) E NotImplementedError: Automatic batch size search is not supported for multi-GPU setting....

Torch > 2.2.1 causes bugs with file loading

Torch doesn't like it anymore when non-weights are loaded with torch.load. Fix this. - shall we just suppress the warnings and set `weights_only=False?`

Class Detection for text classification

As a first step in incorporating text_classification into quanda, we want to adjust `ClassDetection `metric to support text classification.

TracIn state loading function supposed to return learning rate

- this clashes with state loading functions pre-defined in Benchmark base download

Better Handling of Model Wrapping

- accessing the benchmark models wrapped into a lightning module is awkward right now. Problem A: - nested layer naming. We should create a layer `prefix` string, so that the...

Benchmark Checkpoint Saving

Checkpoints should be saved as separate files in the file storage and accessed separately from the rest of the benchmark dictionary.