Hi everyone. Here are more details on usage:
- We show you what is reported by AI providers based on your consumption.
- High token usage means higher consumption. Also heavier models cost more.
- One request can be 100 tokens or 1 million. which can be < $0.01 or > $100.
- Make sure to use focused and shorter chats as over many tool calls and follow up requests the token usage accumulates.
- We are improving Auto models, they will get better as well.
- Thinking models usually do not perform so much better than non-thinking models but they consume more tokens.
- Avoid attaching files, rules, MCPs when not necessary as they add up context usage.
More on token usage, how it affects your consumption and how to optimize it: