Context length and slow gpt-4

truell20 · October 26, 2023, 4:28pm

At this point, I think we just rely on 8k in production; though if you have 32k access using your API key, you’re welcome to use that.

When you blow out the prompt with large files, we do indeed use embeddings to pick the most relevant parts of the files. Sometimes you’ll see the UI give other options too. When your conversation gets too long, we recursively summarize it so the bot knows about some of the past.

(FWIW in our experience, 32k doesn’t really look at things that more than 8k tokens back in its context window)

Topic		Replies	Views
GPT-4 or GPT-4 Turbo? Discussion	21	2001	January 16, 2024
GPT-4 Turbo with API key -- context length is.....? Discussion	4	1063	January 20, 2024
Option to Toggle Between Fast and Slow GPT-4 Requests Feature Requests	24	3260	April 12, 2025
Bad chat performance? Feedback	8	844	December 15, 2023
GPt-4 Latest Release (Up to April 2023) Discussion	8	1957	November 11, 2023

Context length and slow gpt-4

Related topics