zxjyes issues

Results 6 issues of


                                            zxjyes

[Help] <title>我是想做一些新闻关键词标注，但是经常遇到超出token的问题，有没有好的解决方法

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 大部分文章长度都会超出maxtoken ### Expected Behavior _No response_ ### Steps To Reproduce 提取较长篇新闻的关键词...

请问，http://localhost:8090/可以打开，遇到这个问题如何解决， openai.error.APIError: Invalid response object from API: 'Internal Server Error' (HTTP response code was 500)

import openai openai.api_key = "7c7aa4a3549f5" openai.api_base = "http://localhost:8090/v1" completion = openai.ChatCompletion.create( model="gpt-3.5-turbo", messages=[{"role": "user", "content": "Hello world"}]) print(completion.choices[0].message.content)

[BUG]: can't upload .pdf or .docx files that are only 2~3MB with ollama embedding

### How are you running AnythingLLM? Docker (remote machine) ### What happened? ![Image](https://github.com/user-attachments/assets/9014fd96-2576-4ea3-b7b0-563b515dbc4b) ### Are there known steps to reproduce? some settings: ![Image](https://github.com/user-attachments/assets/60fa0ee6-2362-45af-80b5-bda4303c511e) ![Image](https://github.com/user-attachments/assets/22ece0c2-abf9-4a75-ab73-9ee991077152) **the steps to run AnythingLLM** export...

possible bug

The rag result on the web side is better than the api call result.

I've called the same knowledge base both through the web side and the api, but the api calls are answering poorly. question is "李俊杰做了什么研究？" The RAG on the web side...

ImportError: flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

> 我也遇到这个问题，按照版本安装的 > > ![Image](https://github.com/user-attachments/assets/bc733f39-0cc4-4b0d-830c-ffb231accd77) > > ![Image](https://github.com/user-attachments/assets/b9f65123-7b85-42dc-bf44-6945bc219eec) > > ![Image](https://github.com/user-attachments/assets/47684e16-3355-4cee-a588-83ed1da39584) > > ![Image](https://github.com/user-attachments/assets/a74814b8-ee6c-4e5d-95d8-81e5f9ebc21c) _Originally posted by @zxjhellow2 in [#1327](https://github.com/Dao-AILab/flash-attention/issues/1327#issuecomment-2774631611)_ 安装的版本是 flash_attn-2.6.3+cu118torch2.4cxx11abiTRUE-cp310-cp310-linux_x86_64.whl

zxjyes

[Help] <title>我是想做一些新闻关键词标注，但是经常遇到超出token的问题，有没有好的解决方法

请问，http://localhost:8090/可以打开，遇到这个问题如何解决， openai.error.APIError: Invalid response object from API: 'Internal Server Error' (HTTP response code was 500)

[BUG]: can't upload .pdf or .docx files that are only 2~3MB with ollama embedding

The rag result on the web side is better than the api call result.

ImportError: flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

cuda11.8 版本怎么修改呀？