Sarah Lutteropp

Results 281 comments of Sarah Lutteropp

Hi Hajk, you're welcome. :-) I do not know about other cases where these name confusions can happen - I'm just a computer scientist (only recently started with bioinformatics) and...

current state in trying to plot things, turns out the BICs are too closely lying together. Instead of plotting the BIC scores, it likely makes more sense to print absolute...

![Screenshot from 2020-11-30 22-50-25](https://user-images.githubusercontent.com/1059869/100669956-75ba0600-335e-11eb-8996-c45c9471ca84.png) Yes... this looks slightly better, but still not useful... switching to relative BIC difference instead of absolute difference here. Also, likely a histogram works better for...

Here with relative BIC differences, values smaller than zero meaning a BIC improvement ![Screenshot from 2020-11-30 22-57-53](https://user-images.githubusercontent.com/1059869/100670749-9cc50780-335f-11eb-8965-08adcf5d3be1.png) Slightly more useful, but still... histogram is likely better here.

Maybe for the BIC score, what we really are interested in are the counts of these situations happening: - NetRAX (starting from best raxml-ng tree) BIC was less-or-equal (better) than...

I got the BIC score plots to look like this now ![SimulationType CELINE_SamplingType PERFECT_SAMPLING_1000_msasize_LikelihoodType BEST_bic_stats](https://user-images.githubusercontent.com/1059869/100674523-90dc4400-3365-11eb-980c-6fa487be6e99.png) ![SimulationType CELINE_SamplingType PERFECT_SAMPLING_1000_msasize_LikelihoodType BEST_bic_plot](https://user-images.githubusercontent.com/1059869/100674525-9174da80-3365-11eb-9d1a-184e22bad109.png)

For relative RF distance, I currently have such plots: ![SimulationType CELINE_SamplingType PERFECT_SAMPLING_1000_msasize_LikelihoodType BEST_rfdist_stats](https://user-images.githubusercontent.com/1059869/100677201-d4857c80-336a-11eb-9b1a-1e397432a35f.png) ![SimulationType CELINE_SamplingType PERFECT_SAMPLING_1000_msasize_LikelihoodType BEST_rfdist_plot](https://user-images.githubusercontent.com/1059869/100677203-d5b6a980-336a-11eb-95bb-306259984ae4.png) A set of histograms would definitely fit better here.

Observations from this experiment: - NetRAX does similarly well as raxml-ng. - On all datasets, raxml-ng inferred no near-zero-branches in its ML tree. - The BIC score does not see...

I am also not 100% excluding the possibility of a bug in the partitioned MSA creation (when giving it multiple trees) with SeqGen. I will do an alternative experiment to...

@celinescornavacca ``` double bic(double logl, double k, double n) { return -2 * logl + k * log(n); } ``` - logl = the loglikelihood of the network. - k...