Follow new updates and improvements to Venice.ai.
June 9th, 2025
API
App
Last week, our engineering team focused heavily on platform stability and reliability along with preparing a number of new features to release to our beta test group. We look forward to sharing those out over the coming week.
App
Images will now render in the UI as they’ve completed vs. waiting for all four to complete for display.
Added (i) info buttons to text chats to permit a view of the details of the message (like temperature and top-p).
Added sign-up link in the left navigation menu when signed out.
Added UI elements that display upscale/enhance images in the chat history. Upscales > 1x will still be automatically downloaded.
Added a UI timer to the upscaler message UI to provide some sense of how long the process has been running.
Clarified wording on error messages related to capacity on upscale / enhanced images.
Fixed a rendering bug related to escaped dollar signs on code blocks in chat.
Fixed model specific settings not appropriately applying when selecting “Use these settings”
Fixed an issue with default settings not resetting across all models as expected.
Fixed a bug related to the “download all images” button missing on new chats.
Deprecated the code specific section of the web app to streamline the web UI.
API
Updated upscale/enhance API to provide more clear error messages when images are outside the bounds of supported sizes.
Added support for top_logprobs
parameter as part of the chat completions API.
Optimized the performance of the API keys list endpoint used in the app UI and available in the API particularly benefiting users with substantial API traffic.
June 2nd, 2025
API
App
Given the short week, the team’s engineering efforts this week was primarily focused behind the scenes on reducing model error rates, and on a set of new features that are coming soon.
App
Added a hover action state on images.
Fixed a bug preventing the mobile share screen from appearing when with sharing an image on mobile devices.
Fixed a bug that prevented horizontal scrolling when viewing complex LaTeX formulas wider than the viewport.
Fixed a bug that would cause images to stop displaying after viewing the (i) info details.
Disabled negative prompt on HiDream.
Added support for multiple image processing to Qwen VL.
Added new Character setting to hide model reasoning / thinking output for relevant models on character conversations.
Updated “Reasoning” text to reference “Thinking” for character conversations.
May 27th, 2025
API
App
Models
Bugfixes / Misc
Features
Token
Characters
It’s been a busy week for our team with 3 major releases, and a slew of updates and bug fixes for you:
Simple Mode for Pro Users
Venice… Enhance any Image
New Image Model: HiDream
We're about to roll out the biggest upgrade to the Venice's user experience since our launch: Simple Mode.
Simple Mode provides a streamlined experience by automatically routing your prompts, eliminating the need to manually select a chat type or model (if you ask for an image, it will just know it, without any conception of "Image Mode").
We’ve also added the concept of "image context", which allows you to build images through subsequent prompts. We believe for new users, this will dramatically simplify the user experience, making Venice feel and behave more like other generative AI platforms.
How can you help?
1. Go to your App settings and toggle on Simple Mode. You will see that the model dropdown, and mode selector, will disappear.
2. Chat with Venice and try different prompts in natural language. "Tell me about France" or "Create image of France" etc. Text, web search, image generation, all of it should work automatically.
3. Send your experiences and feedback to support@venice.ai. Please include screenshots and videos, or share your chat for things that don't seem to work as expected.
We appreciate you taking the time and look forward to hearing your feedback.
Pro users can now upscale and enhance any image. Click on the paperclip or drag and drop an image into the image generation chat.
Upscale & Enhance system represents a significant leap beyond traditional pixel-by-pixel methods:
• Use Upscale for accurate enlargements: Use the Upscale feature when you want to maintain the exact style and composition of your image while increasing resolution.
• Use Enhance for creative enhancements: Use the Enhance feature when you want to add details and refinements beyond what's in the original image.
Read more about our latest model in our announcement tweet.
Key Features Prompt adherence - Achieves industry-leading scores on GenEval and DPG benchmarks, outperforming all other open-source models.
Text placement - Effective and accurate text generation and placement, making it useful for graphic design purposes.
Commercial-friendly - Generated images can be freely used for personal projects, scientific research, and other applications.
Added an image generation loading state to make it more clear that the UI is waiting on inference.
Added a setting to enable display of message timestamps in the App Settings - Featurebase
Fix crash when editing a chat message and using Space key. - Featurebase
Updated the error when documents too large to be processed to be human readable.
Refactored context filtering logic to prevent Your message is too long for the selected model
errors when dealing with chat conversations with large histories.
Ensure that white space submitted in system prompts sent from the app are preserved all the way to the LLM.
Improved error messages when there are issues communicating with Venice’s inference servers.
Update the image generation prompt enhancer to ensure messages are constrained to fit within the image prompt limits.
Fixed a regression in streaming responses when invoking tool calling. Added test coverage to avoid future regressions.
Enabled support for response_format
to 405B and Deepseek. Ensure thinking models don't inject <think> tags into the response when response format is enabled.
Return 504 timeouts when non streaming requests timeout. Recommend use of streaming responses when inference is expected to run for longer durations.
Remove default `max_tokens` on API calls, permitting inference to use the full context of the LLM unless the user specifies a specific max_tokens
Introduced a character avatar popover on larger screens, displaying detailed character information and a quick chat option.
Added display of character creation date and creator information on public character profiles.
Introduced clearer visual grouping and dividers in character menus.
Added a "View & Rate Character" menu item for improved access to rating features
Updated the token dashboard to fix a few UX issues when used via mobile wallet.
May 19th, 2025
API
App
Characters
The big release over the last week was the launch of Venice Search V2 - a complete overhaul on how our search function operates. This was implemented for both our App and API users. Venice search is now:
Smarter
Now uses AI to generate search queries based on chat context rather than directly searching input text. This results in more contextually relevant information being injected to the conversation, and better overall responses.
Cleaner
Only displays sources actually referenced in the response, using superscripts. These reference the citations provided below the search.
Broader
We inject a greater number of results with additional information per result into the context.
Released Venice Search V2.
When switching models, the Top-P and Temperature settings will now automatically default to the optimal setting for that specific model. Additionally, a UI element was added to show what that default for the model is. This should remedy issues with temperatures changing as users move through models resulting in potential gibberish in responses.
Adjust the “image prompt enhancer” to keep its responses below the character limit for image generation.
Add a link to the hugging face model card from within the Image Detail view.
Add a w/ web search
banner to responses that have included web search.
When using shorten or elaborate, the current selected model will be used for the response, vs. the model that the original message was generated from.
Using the space bar will now trigger the “accept” button within confirmation screens.
Released Venice Search V2.
Added support for purchase of API credits with Crypto via Coinbase Commerce.
Add support for strip_thinking_response
for reasoning models. This will suppress the <think></think>
blocks server side, preventing them from reaching the client. Works in tandem with /no_think
on the Qwen3 models. API docs have been updated for the parameter, and the model feature suffix docs have also been updated. Satisfies this Featurebase.
Add support for disable_thinking
for reasoning models. This will add /no_think
in the background, and enable strip_thinking_response
- API docs have been updated and the model feature suffix docs have been updated.
Add support for enable_web_citations
- This will instruct the LLM to reference the citations it used generating its responses when Web Search is enabled. API docs have been updated and the model feature suffix docs have been updated.
Remove 4x option and show "max" in its place. This will leverage the above change on the API to allow images that can't 4x upscale to be uploaded. This will still block images that are > 4096 x 4096 since the scale can't be less than 1.
When upscaling, if scale is set to 4, dynamically reset it so that the maximum final output size is always less than the max pixel size of our upscaler.
Added a model compatibility mapper for gpt-4.1
to map to Venice Large / Qwen 3 235B
.
API Key Creation is now rate limited to 20 new keys per minute with a total of 500 keys per user.
Added a limit to character names to prevent issues within the UI.
Fixed up character display for characters with excessive display information that was previously breaking the page layout.
When using the auto-generate character feature, a confirmation box will be presented first to avoid overwriting existing details on accident.
May 12th, 2025
API
App
Models
Token
Characters
It’s been a big week of updates with many additional updates and enhancements coming over the next few weeks. Please check out our beta group on Discord if you’d like to participate in early testing of our new features.
Thank you to the community for all the helpful feedback after our new model paradigm launched. We will always refine our offering and your opinions are immensely helpful. We've decided to make a couple model changes.
Llama4 Maverick (aka Venice Large) is being retired May 12th
Zuck failed us on this one and it needs to go. It has been replaced with the new Qwen3 235B as our Venice Large model. The Venice beta users have been enjoying it, almost universally prefering it to Maverick.
Llama 3.2 3B (aka Venice Small) is also being retired.
This model had a good run, but a plainly superior option exists now. The new Qwen3 4B will replace it as our Venice Small model. Beta users also very positive on this one. Both of the retiring models will remain in the app+api for 2 weeks under their own names. Maverick will then be taken out to pasture, removed from both app+api. Llama 3.2 3B will leave the app, and remain in API for some time.
Deepseek retirement has been postponed to May 30th
Additionally, we’ve heard your feedback RE: Deepseek’s retirement and we’re thinking through options. The retirement for Deepseek has now been moved to May 30th and we’ll provide another update before then.
Inpainting Deprecation
We are re-engineering Venice’s in-painting feature set to better serve the use cases we’ve now seen from our users. We are going to deprecate the current version from the app and the API next Monday while we work on the new release.
In the interim, we encourage users to experiment with Venice’s “Enhance image” feature which can create neat re-creations of images.
We've released an update that should alleviate grammatical errors and missing characters from longer conversations, most notably on 405B. If you continue to see those issues, please use the report conversation feature. Thank you for the existing reports -- they were very helpful in tracking down the issue.
Updated the Report Conversation
feature to allow for self reported categorization of the issues. This helps our team identify trends and issues with models faster.
Added a Reasoning toggle for reasoning models that support enabling or disabling thinking responses.
Added a warning within the chat input for users who have increased temperature into bounds known to create gibberish / garbage responses.
Updated the Venice system prompt to reduce likelihood of Venice referencing details about itself in responses unless prompted about Venice.
Streamlined the share chat functionality to immediately copy the share URL to the clipboard vs. requiring a second click.
Updated the UI to disable the upscale / enhance button if both upscale was turned off and enhance was disabled.
Updated the UI to only copy the user’s prompt when copying prompt + images messages.
Updated the UI to view image options when viewing image variants in grid format.
Fixed a bug where non-pro users were unable to upload documents or images for analysis.
Fixed a bug when editing messages containing code blocks that would result in certain characters being improperly escaped.
Ensure full EXIF / ICC profile is maintained when using the upscale / enhance feature. Fixes this Featurebase request which had two [1] [2] new reports.
Security Notice - Fixed a bug reported via our bug bounty program that permitted API keys marked as inference only be able to manipulate the API key admin endpoint. This would have permitted these inference only keys to add, or remove other API keys. Please validate active API keys created between April 22nd, 2025 and May 7th, 2025 to ensure their validity.
Explorer Tier Deprecation - As Venice continues its growth, we’re seeing our API usage reaching all-time highs. Following our announcement last month, we have changed our Pro account API access.
Previously, Pro users had unlimited access to our Explorer Tier API with lower rate limits. We have now deprecated the Explorer Tier, and all new Pro subscribers will automatically receive a one-time $10 API credit when they upgrade to Pro –double the credit amount compared to competitors.
This credit provides substantial capacity for testing and small applications, with seamless pathways to scale via VVV staking or direct USD payments for larger implementations. This change reflects our API’s maturation from its beta to the enterprise-ready service that developers are increasingly building on the API.
Ensure full EXIF / ICC profile is maintained when using the upscale / enhance feature. Fixes this Featurebase request which had two [1] [2] new reports.
Add support for OpenAI Embedding names to the Embeddings API via Model Compatibility Mapper.
We've released an update that should alleviate grammatical errors and missing characters from longer conversations, most notably on 405B. If you continue to see those issues, please use the report conversation feature. Thank you for the existing reports -- they were very helpful in tracking down the issue.
Add support for JSON payload to Upscale / Enhance API - API docs are updated - Postman example.
Fixed a bug that caused the created
field on the OpenAI compatible image generations API to ensure it's coming back as an int and not a float.
Fixed a bug causing model_feature_suffix
features from properly updating their respective flags. Added additional test coverage to ensure this avoids a regression.
Updated the token dashboard coloring.
Redirect identifiable mobile wallets to the token dashboard when accessing https://venice.ai and hide the PWA installation modal.
Updated Character UI with imports and rating stats on the primary character cards.
Added a UI feature to show which source character a user’s character was cloned from.
May 5th, 2025
API
App
Models
Characters
Venice simplified its model selection with a curated list of LLMs, categorized into five distinct models: Venice Uncensored, Venice Reasoning, Venice Small, Venice Medium, and Venice Large. You can find more details in our blog post here.
The new models include the Dolphin Mistral 24B Venice Edition, Venice's most uncensored model ever, and Llama 4 Maverick, a vision-enabled model with a 256K token context window. Several legacy models, including DeepSeek R1, Llama 3.3 70B, and Dolphin 72B, will be retired from the chat interface by May 30. The changes aim to reduce model redundancy, improve user experience, and increase infrastructure scalability.
All current models remain available through the Venice API.
Implemented a substantial revision to search behavior to ensure search results are more effectively integrated into the context.
Added support for “Enhance Only” mode via the app. This permits the endpoint to be used solely for enhance without changing the output resolution of the image:
Added a prompt for users to permit the browser to persist local storage when their browser storage is becoming full.
Fixed a scrolling bug for users with character chats per this Featurebase.
Added some guidance to the app suggesting using descriptive prompts or the enhance prompt feature when using the Venice SD35 image model.
Added in-app guidance when the Temperature setting has been set very high to indicate the LLM may return Gibberish.
Added a subscription renewal flow for Crypto user’s who wish to renew their subscription.
Fixed a bug where upscale / enhance requests could return blank / black images.
Adjusted the pre-processing for in-painting to increase reliability of generation.
Fixed a bug where the Input History Navigation
setting in App settings was not properly controlling the feature behavior per this Featurebase.
Improved character search UI.
Update UI to permit free and anonymous Venice users to see the characters detail modal.
Added pricing information to the /models
endpoint per this request from Featurebase. API docs have been updated.
Increased Token per Minute (TPM) rate limits on medium and large models given Maverick can produce a large number of tokens quickly. API docs have been updated.
Added support for a 1x scale
parameter to the upscale / enhancement API endpoint. This permits the endpoint to be used solely for enhance without changing the output resolution of the image. Solves this Featurebase. API docs have been updated.
Added a new API route to export billing usage data. API docs have been updated.
Added support for the logprobs
parameter on our /chat/completions
API. API docs have been updated.
Added a UI to the API settings page to export billing history from the UI.
Added support for fractional scale
parameters to the upscale / enhancement API endpoint.
Updated the API to require application/json
headers on JSON related endpoints.
Return additional detail in error message if a model can not be found to assist user’s in debugging the issue.
Added support for Tools / Function Calling to Maverick.
Launched an /embeddings
endpoint in beta for Venice beta testers. API docs have been updated.
April 28th, 2025
API
App
Last week, we complete the migration of our backend infrastructure to our next generation platform. This transition reduces latency and cost and enables support for new types of inference (including video). We look forward to the next round of feature work this new infrastructure will support.
Added support for bulk chat deletion. Select the three dots to the right of the chats header to find the option.
Swapped out browser confirmation windows with native confirmation windows to allow them to be displayed on mobile crypto wallet browsers (like Coinbase Wallet).
Updated our Wallet Connect packages to fix a bug that was causing a notice to pop up prompting to change wallet networks.
Added additional delay to the character search to avoid re-rendering while search results are in flight.
Updated navigation on character pages making them easier to navigate on mobile.
Added support to allow for the “None” image style to be added to favorites.
Fixed a bug that was causing scroll issues in the left nav bar.
Venice's new upscale enhance mode is now available through the existing upscale API.
Added an OpenAI compatible image generation endpoint. This endpoint supports fewer configuration parameters than our full image endpoint but will work with OpenAI compatible image generation libraries without modification. Docs are updated.
Added support for json_object
type under response_format
for LLM API requests. Docs are updated.
Changed default image model to venice-sd35
to match app default.
Return more helpful errors when LLM API requests fail to execute on the runners. This should make debugging complex json_schemas easier to understand.
April 22nd, 2025
App
Features
Characters
It’s been a big week of updates at Venice and we’re excited to share these platform updates with you. The major changes this week include:
Venice Image Engine V2
Editing of Conversation History
Launch of the Character rating system
Migration of all app inference to our new backend
Venice Image Engine v2 is now live and represents a comprehensive overhaul of our image generation infrastructure, delivering the highest quality results across a range of styles.
Venice Image Engine v2 consists of two major components:
1. Venice SD35 (New Default model): Custom-configured Stable Diffusion 3.5 engine powered by a Comfy UI workflow backend.
2. Upscale & Enhance (Pro only): Completely new upscaling architecture with proprietary enhancement that creatively in-fills new pixels at the higher resolution.
Venice implemented substantial improvements to the base Stable Diffusion 3.5 architecture:
Natural language understanding:
Venice SD35 processes conversational language more effectively, so you can describe images as you would to a human artist instead of using awkward keyword lists.
Custom image generation pipeline:
A specialized image generation Comfy UI workflow on the backend that delivers superior results through optimized processing techniques that standard implementations don't provide.
Let’s compare our new default image model Venice SD35 to our previous default image model Fluently:
![]() | ![]() |
![]() | ![]() |
![]() | ![]() |
The new Upscale & Enhance system represents a significant leap beyond traditional pixel-by-pixel methods:
Use Upscale for accurate enlargements:
Use the Upscale feature when you want to maintain the exact style and composition of your image while increasing resolution.
Use Enhancer for creative enhancements:
Use the Enhance feature when you want to add details and refinements beyond what's in the original image. Adjust the creativity setting based on how much creative liberty you want the AI to take. For consistent results we suggest loading in your original prompt in the prompt field.
Combine for professional outputs:
Combine both approaches by first generating with Venice SD35, then upscaling, and finally applying a subtle enhancement for the highest quality results. Let’s compare zoomed-in screenshots for the following image.
You can read more about this new feature in this Venice blog post.
Added character ratings to public Venice characters. Now, Venice users can help curate the best characters on the platform by adding their ratings and perspective to each character.
Closing a character modal with unsaved changes will now prompt the user to avoid losing unsaved work.
The save button in the character edit modal is now sticky at the bottom of the window.
Character chats are now consolidated into the character section of the left nav bar.
Launched the ability for Pro members to edit history in their conversations. This was one of the most requested items on Featurebase with 371 votes. Huge shout out to our beta testing team who worked with us to perfect this feature.
Migrated all chat inference from the Venice app to use Venice’s new backend infrastructure. This change should result in increased performance and maintainability as we continue to scale the business.
Folders / Characters in the left nav bar are sorted by recent activity.
Increased the max response token limit on reasoning models 4x to avoid messages being cut off when reasoning extends for long periods of time / high token counts.
Updated our Markdown rendering to better format messages containing $ signs. This fixes a bug where certain responses from the LLMs would be inappropriately displayed as LaTeX formatted text.
Added a setting to the App settings panel to remove the warning on external links:
Added a progress bar to the UI when deleting chat history — this makes it more clear what’s going for users with large histories they are deleting.
April 14th, 2025
App
Models
Bugfixes / Misc
Over the last week, the Venice engineering team has been focused on substantial infrastructure overhaul to our backend inference APIs to improve performance and reliability and support additional scale. Additionally, the team has been working on a comprehensive model curation and testing framework, and preparing to launch a revised image generation infrastructure.
These features will manifest themselves into user visible updates over the coming weeks and we are excited release these updates.
Revised our Character moderation system to reduce time to approvals.
Models
Added support for web search to Llama 3B.
App
Support horizontal scroll for Mathjax containers on mobile.
Updated code block and inline code styling for improved readability.
Fixed a bug that was causing white space to be stripped from uploaded documents prior to LLM processing.
April 6th, 2025
API
App
Models
Mistral 24B is now the default Venice model for all users, bringing multi-modal capabilities to every one. Llama 70B has been updated to a Pro model.
Launched a preview of our new image inference infrastructure to our beta testers.
Image styles within the image settings are now searchable.
Added Conversation Forking to allow splitting conversations in new directions per this Featurebase request.
Added a “Jump to Bottom” mechanism per the Featurebase request.
Updated the app background to prevent a white flicker when loading the app.
Support pressing Enter to save the name when re-naming a conversation.
Update token dashboard to show 2 digits of prevision for staked VVV.
Updated authentication logic to programmatically reload authentication tokens if they time out due to the window being backgrounded.
Prevent the sidebar from automatically opening when the window is small.
Fixed a bug with text settings not properly saving.
Update image variant generation to more gracefully handle error cases.
Launched a new backend service to support the growth of Venice — migrated authentication, image generation and upscale to that service. This new service should be more performant and provide Venice a better platform to scale our user continued user growth on.
Added the nextEpochBegins
key to the api_keys/rate_limits
endpoint. Docs have been updated. Solves this Featurebase request.
Added response_format
support to Qwen VL in the API.
Fixed a bug where messages to Mistral including a reasoning_content
null parameter would throw an error.