Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【New Features】Chunking method available? #3

Open
Kunlun-Zhu opened this issue May 27, 2024 · 4 comments
Open

【New Features】Chunking method available? #3

Kunlun-Zhu opened this issue May 27, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@Kunlun-Zhu
Copy link

To my best understanding.

The retriever only returns the doc ID without the chunking method for each document.

I would also suggest API usage for chatGPT, Gemini, Claude, etc in the generator.

@DaoD DaoD added the enhancement New feature or request label May 27, 2024
@DaoD
Copy link
Collaborator

DaoD commented May 27, 2024

@ignorejjj check this

@ignorejjj
Copy link
Member

The retriever will retrieve similar items (including ID and text) from the document corpus. As I understand it, document chunking is employed during corpus construction and does not need to be returned by the retriever.

For the generator, due to various limitations of the black-box model (can't return logits, requiring API costs), we did not implement it initially. To ensure completeness, we plan to implement mainstream API-based models, such as ChatGPT within the next few weeks.

If I have misunderstood anything, please feel free to make suggestions!

@Kunlun-Zhu
Copy link
Author

Thanks for the reply, looking forward to new updates.

@DaoD DaoD reopened this May 28, 2024
@linchen111
Copy link

hope that I can use this to chunk my html ,hhh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants