Jesse Zhang comments

Results 28 comments of


                                            Jesse Zhang

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

> I need to do a bit more thinking on how much to expose the PromptHelper (do we want to expose it as a customizable class at all if we...

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

Check out `GPTTreeIndex` for the new pattern of how a default text splitter is set. If it looks good, I'll extend to all indices.

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

Done! Thanks for the review of that! We can revamp PromptHelper in the future.

Refactor download_loader to how loaders are downloaded.

This looks awesome! Thanks for all the improvements. The 600 req/hr limit should be fine since we're caching everything. The website, for example, needs an API key, which I believe...

Add similarity scores and metadata to Source Nodes

Included one more enhancement: - New web reader using BeautifulSoup - Allows for more fine-grained scraping of specific sites. Included an extractor for Substack posts

Add similarity scores and metadata to Source Nodes

> Thanks for the changes! Some comments below. Also high-level note, there's really like 2-3 subchanges in here 🙂 - next time would be good to split it up a...

Added pptx parser and image captioner

> nice!! is this going to get added to `DEFAULT_FILE_EXTRACTOR` in file/base.py as well? Yep good point, forgot about that at the end -- added

New eval for math contests (AMC 10/12)

> Please add some examples of evals to the respective field in the PR description. Just edited the original post!

Jesse Zhang

More flexible answer generation

More flexible answer generation

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

[WIP] Refactor Text Splitters to be composable, customizable, and less dependent on prompt

Refactor download_loader to how loaders are downloaded.

Add similarity scores and metadata to Source Nodes

Add similarity scores and metadata to Source Nodes

Added pptx parser and image captioner

New eval for math contests (AMC 10/12)