tortoise.cpp issues

Profile the gpt-2 module and the tortoise-tts gpt-2 module and try to improve the gpt-2 module's performance

This task is blocked by the gpt-2 forward pass test being added since this could introduce regressions. https://github.com/balisujohn/tortoise.cpp/issues/5 The task is as follows: measure the runtime of the autoregressive model...

balisujohn

good first issue

Implement the CLVP module

CLVP is a non-essential component of tortoise-tts for filtering for latents that will will yield good generation quality if given to the diffusion model. This task is to export CLVP...

balisujohn

why not vulkan

3

that way you dont waste time implementing cpu,cuda,amd,metal,intel

Kreijstal

AMD GPUs

5

Hi, I saw this project currently only supports Cuda. I was wondering if it might be possible to use [HIPIFY](https://github.com/ROCm/HIPIFY) to make it work on AMD GPUs. Do you know...

fakerybakery

Make the tokenizer match the tortoise-tts Tokenizer exactly

If people are interested in contributing to tortoise.cpp, a great first task would be getting the tokenizer to always match the tokenization tortoise-tts uses. The tokenizer I'm using in tortoise.cpp...

balisujohn

good first issue

Fix Windows 10 error

1

The "#include numeric" (would add less and greater than but markdown removes them) is necessary for including std::partial_sum() (at line 5219, or, in this fork, 5220). This will fix an...

N0CTRON

latest ggml version sync

10

This is an attempt to rebase on the latest commit of ggml master branch. My primary goal behind it is to add Vulkan/OpenCL support as I only have AMD GPUs....

dridri

That's a turtle, not a tortoise

2

Everybody realises that's a picture of a turtle? right? :)

logikstate

Converting .pth files

5

Hello, I found a fine-tuned model to handle French : https://huggingface.co/Snowad/French-Tortoise , but the files are in Torch format. Is there any way to convert it to ggml's format ?

dridri

Optimize GPT2 inference: Remove redundant `autoregressive_latent_graph` and enable streaming output

3

Thank you for this excellent implementation. I'd like to suggest an optimization that could significantly speed up inference and enable streaming output. Currently, there are two GPT2 graphs: 1. autoregressive:...

candlewill

tortoise.cpp
tortoise.cpp copied to clipboard

Metadata

Profile the gpt-2 module and the tortoise-tts gpt-2 module and try to improve the gpt-2 module's performance

Implement the CLVP module

why not vulkan

AMD GPUs

Make the tokenizer match the tortoise-tts Tokenizer exactly

Fix Windows 10 error

latest ggml version sync

That's a turtle, not a tortoise

Converting .pth files

Optimize GPT2 inference: Remove redundant `autoregressive_latent_graph` and enable streaming output

← Metadata

Owner

Metadata

tortoise.cpp tortoise.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

tortoise.cpp
tortoise.cpp copied to clipboard