Claude Users Report Rapid Quota Depletion: Anthropic Engineer Confirms "Peak Hour" Resource ManagementOver the past week, several Claude users have noticed a significant change in how their usage limits are calculated. Many reported that the standard 5-hour rolling quota is exhausting much faster than usual, hindering long-term projects and continuous workflows that previously ran smoothly.
Confirmation from Anthropic
While Anthropic has not issued a formal press release, Thariq Shihipar, an engineer at the company, clarified on social media that these observations are not a glitch. The shift is part of a strategic resource management plan designed to maintain system stability during periods of extreme demand.
Key details of the temporary adjustment include:
Dynamic Quotas: Claude has implemented a new calculation for the 5-hour usage cap specifically during "Peak Hours." This affects all user tiers, including Free, Pro, and Max subscribers.
The Peak Window: The high-traffic period is identified as weekdays from 5:00 AM to 11:00 AM Pacific Time (PT). During this window, message limits will deplete faster than usual.
Targeted Impact: Approximately 7% of users are significantly affected, most of whom are on the Pro subscription plan.
Weekly Limits Unchanged: Anthropic reassured users that the total aggregate usage quota per week remains the same.
The Workaround
Shihipar admitted that this dynamic scaling might cause confusion. He recommended that users with token-heavy tasks such as large file analysis or complex coding should schedule their work outside of peak hours. Doing so allows for longer, more stable sessions. Anthropic is currently scaling its infrastructure to meet growing demand and promises to provide periodic updates.
The reason Claude (especially the Claude 3.5/4 model) consumes more resources than competitors is due to its extremely large context window. When a large number of users simultaneously input large code files or projects during peak hours in the US, it overloads GPU clusters, requiring resource rationing to prevent system crashes.
Notably, the unaffected group is high-end enterprise customers, who typically have dedicated capacity. This may be a quiet signal from Anthropic encouraging heavy users to shift from individual subscriptions to more stable team platforms.
Even large companies cannot keep up with the exponentially growing demand for data centers. Anthropic's admission of an impact on its Pro customers (7%) reflects that even paying users may experience resource rationing, a trend we'll see more frequently in premium AI services this year.
We're seeing increased use of AI agents performing automated background tasks. These agents run continuously and consume significant tokens. Dynamic quota adjustment is therefore the method Anthropic uses to maintain a balance between "human users" and "automated agents" during peak hours.
Android 17 Beta 3 is Here Universal Windowing and the Return of the Wi-Fi Toggle.
Source: Business Insider
Claude Users Report Rapid Quota Depletion: Anthropic Engineer Confirms "Peak Hour" Resource ManagementOver the past week, several Claude users have noticed a significant change in how their usage limits are calculated. Many reported that the standard 5-hour rolling quota is exhausting much faster than usual, hindering long-term projects and continuous workflows that previously ran smoothly.
Confirmation from Anthropic
While Anthropic has not issued a formal press release, Thariq Shihipar, an engineer at the company, clarified on social media that these observations are not a glitch. The shift is part of a strategic resource management plan designed to maintain system stability during periods of extreme demand.
Key details of the temporary adjustment include:
Dynamic Quotas: Claude has implemented a new calculation for the 5-hour usage cap specifically during "Peak Hours." This affects all user tiers, including Free, Pro, and Max subscribers.
The Peak Window: The high-traffic period is identified as weekdays from 5:00 AM to 11:00 AM Pacific Time (PT). During this window, message limits will deplete faster than usual.
Targeted Impact: Approximately 7% of users are significantly affected, most of whom are on the Pro subscription plan.
Weekly Limits Unchanged: Anthropic reassured users that the total aggregate usage quota per week remains the same.
The Workaround
Shihipar admitted that this dynamic scaling might cause confusion. He recommended that users with token-heavy tasks such as large file analysis or complex coding should schedule their work outside of peak hours. Doing so allows for longer, more stable sessions. Anthropic is currently scaling its infrastructure to meet growing demand and promises to provide periodic updates.
The reason Claude (especially the Claude 3.5/4 model) consumes more resources than competitors is due to its extremely large context window. When a large number of users simultaneously input large code files or projects during peak hours in the US, it overloads GPU clusters, requiring resource rationing to prevent system crashes.
Notably, the unaffected group is high-end enterprise customers, who typically have dedicated capacity. This may be a quiet signal from Anthropic encouraging heavy users to shift from individual subscriptions to more stable team platforms.
Even large companies cannot keep up with the exponentially growing demand for data centers. Anthropic's admission of an impact on its Pro customers (7%) reflects that even paying users may experience resource rationing, a trend we'll see more frequently in premium AI services this year.
We're seeing increased use of AI agents performing automated background tasks. These agents run continuously and consume significant tokens. Dynamic quota adjustment is therefore the method Anthropic uses to maintain a balance between "human users" and "automated agents" during peak hours.
Android 17 Beta 3 is Here Universal Windowing and the Return of the Wi-Fi Toggle.
Source: Business Insider
Comments
Post a Comment