Agnes AI Access Plans, API Keys, RPM Limits, and Subscription Quotas FAQ
Effective date: 2026-06-22 Version: Current public reference version This document explains the current Agnes AI access plans, API key types, RPM limits, and subscription quotas. The values below are current public reference values as of 2026-06-22. They are operational limits, not permanent guarantees. Agnes AI may adjust these limits based on infrastructure capacity, service stability, product updates, abuse prevention, or commercial policy changes.1. Overview
Agnes AI currently supports three main access types:| Access type | Description |
|---|---|
| Free / default users | Users without a Token Plan subscription or Enterprise certification. |
| Enterprise-certified users | Users who have completed Enterprise certification and receive higher baseline access limits. |
| Token Plan users | Users who subscribe to a Token Plan and receive significantly higher model access limits and subscription quotas. |
- Use Free / default keys.
- Complete Enterprise certification and use Enterprise-certified keys.
- Subscribe to a Token Plan and use Token Plan keys.
- Be both a Token Plan user and an Enterprise-certified user.
2. Current Access Limits
2.1 Free / Default Users
Free / default users currently have the following access limits:| Model type | Current limit |
|---|---|
| Text models | 20 actual RPM |
| Image models | Resolution-specific RPM limits apply |
| Video models | 20 actual RPM |
2.2 Enterprise-Certified Users
Enterprise-certified users currently have higher baseline access limits:| Model type | Current limit |
|---|---|
| Text models | 40 actual RPM |
| Image models | Higher resolution-specific RPM limits apply |
| Video models | 40 actual RPM |
2.3 Token Plan Users
Token Plan users currently have significantly higher access limits:| Model type | Current limit |
|---|---|
| Text models | 1,000 actual RPM |
| Image models | Higher 1K and 2K image RPM limits apply |
| Video models | 100 actual RPM |
3. Token Plan Subscription Quotas
Token Plan users also receive subscription quotas. These quotas are separate from RPM limits.| Plan | agnes-2.0-flash | agnes-image-2.0/2.1-flash | agnes-video-v2.0 |
|---|---|---|---|
| Starter | 1,500 requests per 5 hours; 15,000 requests per week | 4,000 images per day | 500 seconds per day |
| Plus | 7,500 requests per 5 hours; 75,000 requests per week | 4,000 images per day | 500 seconds per day |
| Pro | 30,000 requests per 5 hours; 300,000 requests per week | 4,000 images per day | 500 seconds per day |
4. RPM Limits vs Subscription Quotas
RPM limits and subscription quotas are different.RPM Limits
RPM means requests per minute. RPM limits control how many requests can be processed within one minute. For example:- Free / default users have 20 actual RPM for text models.
- Enterprise-certified users have 40 actual RPM for text models.
- Token Plan users have 1,000 actual RPM for text models.
Subscription Quotas
Subscription quotas control how much usage is available within a specific time window. For example:- Starter users can use 1,500
agnes-2.0-flashrequests per 5 hours and 15,000 requests per week. - Plus users can use 7,500 requests per 5 hours and 75,000 requests per week.
- Pro users can use 30,000 requests per 5 hours and 300,000 requests per week.
Both Limits Apply
A user must stay within both:- The RPM limit.
- The subscription quota.
5. API Key Types
Each access type has its own API key type. Current API key types include:| API key type | Who can use it | Limit behavior |
|---|---|---|
| Free / default key | All users | Shares the Free / default RPM pool |
| Enterprise-certified key | Enterprise-certified users | Shares the Enterprise-certified RPM pool |
| Token Plan key | Token Plan subscribers | Shares the Token Plan RPM and subscription quota pool |
6. API Key Limit Pooling
Limits are applied by key type, not by each individual key. Creating multiple keys of the same type does not increase the total available RPM or quota.Example 1: Multiple Free Keys
If a Free user creates two Free keys, both keys share the same Free / default RPM pool. The user does not receive a separate 20 RPM limit for each key.| Key | Key type | Limit pool |
|---|---|---|
| Key A | Free / default key | Free / default RPM pool |
| Key B | Free / default key | Free / default RPM pool |
Example 2: Multiple Token Plan Keys
If a Token Plan user creates multiple Token Plan keys, those keys share the same Token Plan RPM and subscription quota pool.| Key | Key type | Limit pool |
|---|---|---|
| Key A | Token Plan key | Token Plan RPM and subscription quota pool |
| Key B | Token Plan key | Token Plan RPM and subscription quota pool |
Example 3: User With Free, Enterprise, and Token Plan Keys
A user may have:- 2 Free / default keys.
- 1 Enterprise-certified key.
- 2 Token Plan keys.
| Key group | How limits apply |
|---|---|
| 2 Free / default keys | Share one Free / default RPM pool |
| 1 Enterprise-certified key | Uses the Enterprise-certified RPM pool |
| 2 Token Plan keys | Share one Token Plan RPM and subscription quota pool |
7. Can One User Have Multiple Access Types?
Yes. A user can have multiple access types at the same time. For example:| User status | Available key types |
|---|---|
| Free user | Free / default keys |
| Enterprise-certified user | Free / default keys and Enterprise-certified keys |
| Token Plan user | Free / default keys and Token Plan keys |
| Token Plan + Enterprise-certified user | Free / default keys, Enterprise-certified keys, and Token Plan keys |
8. Are Limits Shared Across Different Key Types?
No. Limits are shared only within the same key type. For example, if a user has both Free keys and Token Plan keys:| Key type | Limit pool |
|---|---|
| Free / default keys | Share the Free / default RPM pool |
| Token Plan keys | Share the Token Plan RPM and subscription quota pool |
9. What Does “Actual RPM” Mean?
“Actual RPM” refers to the effective number of requests that can be processed per minute. Some systems may use higher request admission thresholds for traffic control, buffering, or abuse prevention. However, the values listed in this document represent the effective public RPM reference limits.10. Image Model Limits
Image model limits are resolution-specific. Higher resolutions may have lower RPM limits because they consume more infrastructure resources. For example:- 1K image generation may support higher RPM.
- 2K image generation may support lower RPM than 1K.
- 3K and 4K image generation may have stricter RPM limits.
MODEL_CATALOG.md.
11. Video Model Limits
Video models may be limited by both request frequency and generated video duration. For Token Plan users, the currentagnes-video-v2.0 quota is:
| Plan | Video quota |
|---|---|
| Starter | 500 seconds per day |
| Plus | 500 seconds per day |
| Pro | 500 seconds per day |
| User type | Video model RPM |
|---|---|
| Free / default users | 20 actual RPM |
| Enterprise-certified users | 40 actual RPM |
| Token Plan users | 100 actual RPM |
12. What Happens If a User Exceeds the RPM Limit?
If a user exceeds the RPM limit, the request may be rejected or rate-limited. The user may receive an error such as:text
- Reduce request frequency.
- Add retry and backoff logic.
- Use a higher access tier if eligible.
- Subscribe to a Token Plan for higher RPM limits.
- Complete Enterprise certification for higher baseline limits.
13. What Happens If a Token Plan User Exceeds the Subscription Quota?
If a Token Plan user exceeds the subscription quota, additional requests may be rejected until the quota resets. For example:- If the 5-hour text request quota is exhausted, the user must wait for the next 5-hour window.
- If the weekly text request quota is exhausted, the user must wait until the weekly quota resets.
- If the daily image or video quota is exhausted, the user must wait until the daily quota resets.
14. Does Creating More API Keys Increase the Limit?
No. Creating more API keys under the same key type does not increase the total available RPM or quota. For example:- 1 Free key and 5 Free keys share the same Free / default RPM pool.
- 1 Token Plan key and 5 Token Plan keys share the same Token Plan RPM and subscription quota pool.
- 1 Enterprise-certified key and 5 Enterprise-certified keys share the same Enterprise-certified RPM pool.
15. Recommended Usage by User Type
Free / Default Users
Free / default access is suitable for:- Basic testing.
- Lightweight personal usage.
- Initial API exploration.
- Low-frequency integrations.
Enterprise-Certified Users
Enterprise-certified access is suitable for:- Business users.
- Internal testing.
- Higher baseline usage.
- Teams that need higher limits than Free / default users.
Token Plan Users
Token Plan access is suitable for:- Production usage.
- Higher-frequency API calls.
- Agent workflows.
- Multi-user applications.
- Applications that require higher throughput.
- Applications that use image or video generation frequently.
16. Best Practices
To avoid rate limit or quota issues, developers should:- Use the correct API key type for the workload.
- Avoid using Free / default keys for production workloads.
- Add retry and exponential backoff for 429 errors.
- Monitor request frequency.
- Monitor daily, 5-hour, and weekly usage.
- Separate environments with different keys, such as development, staging, and production.
- Avoid creating multiple keys for the purpose of bypassing limits.
- Use Token Plan keys for high-frequency or production traffic.
- Use Enterprise-certified keys when higher baseline access is required.
17. FAQ
Q1: Can a Free user create multiple API keys?
Yes. A Free user can create multiple Free / default keys. However, all Free / default keys share the same Free / default RPM pool. Creating more Free keys does not increase the user’s RPM limit.Q2: Can a Token Plan user still use Free keys?
Yes. A Token Plan user may still have and use Free / default keys. Free / default keys use the Free / default RPM pool. Token Plan keys use the Token Plan RPM and subscription quota pool.Q3: Can a Token Plan user also become Enterprise-certified?
Yes. A Token Plan user can also complete Enterprise certification. In this case, the user may have access to:- Free / default keys.
- Enterprise-certified keys.
- Token Plan keys.
Q4: Do Enterprise-certified users automatically get Token Plan quotas?
No. Enterprise certification and Token Plan subscription are separate. Enterprise certification provides higher baseline RPM limits. Token Plan provides higher RPM limits and subscription quotas. A user must subscribe to a Token Plan to receive Token Plan quotas.Q5: Are Free key limits and Token Plan key limits shared?
No. Free keys and Token Plan keys use separate limit pools. A Free key does not consume the Token Plan quota. A Token Plan key does not consume the Free / default RPM pool.Q6: Are multiple Token Plan keys counted separately?
No. Multiple Token Plan keys under the same user share the same Token Plan RPM and subscription quota pool. Creating additional Token Plan keys does not increase the total available quota.Q7: Are image quotas counted by request or by generated image?
For Token Plan users, image quota is currently counted by generated images. The current quota foragnes-image-2.0/2.1-flash is 4,000 images per day.
Resolution-specific RPM limits may also apply.
Q8: Are video quotas counted by request or by duration?
For Token Plan users, video quota is currently counted by generated video duration. The current quota foragnes-video-v2.0 is 500 seconds per day.
Video model RPM limits may also apply.