Yuki Arimo
Yuki Arimo
I want to use this locally with Ollama and SD (or anything else). Is it possible?
I've tried: ``` # build using Emscripten git clone https://github.com/ggerganov/whisper.cpp cd whisper.cpp mkdir build-em && cd build-em emcmake cmake .. make -j # copy the produced page to your HTTP...
I have a JSONL dataset like this: ``` {"text": "This is raw text in 2048 tokens I want to feed in"}, {"text": "This is next line, tokens are also 2048"}...
Can somebody please explain how the Moondream model works? For example, if my context window is 1024 tokens, and the model uses 724 for the image, how? The image is...
The current model is working only with 378x378 image resolution. Is it possible to make it recognize images with higher resolution to extract more details?
### 🐛 Describe the bug I ran the following code: ``` git clone https://github.com/myshell-ai/MeloTTS.git cd MeloTTS pip install -e . python -m unidic download cd melo python preprocess_text.py --metadata all.list...
Hello. I downloaded all files. And all of them are just a randomly answers in JSON format. So, I want to train my own tensorflow.js model using this database! But,...
### Describe the bug I have manually split my .wav audio file into chunks of 10 seconds in 75 files and written transcriptions for them. However, when I tried to...
``` (tiny-audio-diffusion) yuki@yuki tiny-audio-diffusion % python train.py exp=drum_diffusion trainer.gpus=1 datamodule.dataset.path=/Users/yuki/Downloads/tiny-audio-diffusion/samples GPU available: True (mps), used: True TPU available: False, using: 0 TPU cores IPU available: False, using: 0 IPUs HPU...