VectorHub icon indicating copy to clipboard operation
VectorHub copied to clipboard

Article: Semantic Chunking

Open Ashish-Abraham opened this issue 1 year ago • 1 comments

This content explores semantic chunking in depth. I have shown how to implement semantic chunking from scratch using 3 methods: embedding similarity, hierarchical clustering, and LLMs. This article also includes RAG evaluation using each approach on different standard embedding models and datasets.

Ashish-Abraham avatar Sep 30 '24 18:09 Ashish-Abraham

Thanks Ashish! I think that this might be ready for a style review, can you check @robertdhayanturner ?

svonava avatar Oct 04 '24 21:10 svonava

hi @Ashish-Abraham ! I'm in the process of editing your PR. Nice work, btw! At this point, I'll handle the editing entirely, and just ask for your input in comments. It's faster if we communicate in comments and then I'll make changes in the document as necessary.

robertdhayanturner avatar Oct 07 '24 17:10 robertdhayanturner

@Ashish-Abraham the article's ready for you to check. Thx!

robertdhayanturner avatar Oct 15 '24 23:10 robertdhayanturner

Good to go!

Ashish-Abraham avatar Oct 16 '24 07:10 Ashish-Abraham