May 19th, 2025

API

App

Characters

Venice.ai Change Log - May 13th - May 19th, 2025

The big release over the last week was the launch of Venice Search V2 - a complete overhaul on how our search function operates. This was implemented for both our App and API users. Venice search is now:

Smarter
Now uses AI to generate search queries based on chat context rather than directly searching input text. This results in more contextually relevant information being injected to the conversation, and better overall responses.

Cleaner
Only displays sources actually referenced in the response, using superscripts. These reference the citations provided below the search.

Broader
We inject a greater number of results with additional information per result into the context.


App

  • Released Venice Search V2.

  • When switching models, the Top-P and Temperature settings will now automatically default to the optimal setting for that specific model. Additionally, a UI element was added to show what that default for the model is. This should remedy issues with temperatures changing as users move through models resulting in potential gibberish in responses.

  • Adjust the “image prompt enhancer” to keep its responses below the character limit for image generation.

  • Add a link to the hugging face model card from within the Image Detail view.

  • Add a w/ web search banner to responses that have included web search.

  • When using shorten or elaborate, the current selected model will be used for the response, vs. the model that the original message was generated from.

  • Using the space bar will now trigger the “accept” button within confirmation screens.

API

  • Released Venice Search V2.

  • Added support for purchase of API credits with Crypto via Coinbase Commerce.

  • Add support for strip_thinking_response for reasoning models. This will suppress the <think></think> blocks server side, preventing them from reaching the client. Works in tandem with /no_think on the Qwen3 models. API docs have been updated for the parameter, and the model feature suffix docs have also been updated. Satisfies this Featurebase.

  • Add support for disable_thinking for reasoning models. This will add /no_think in the background, and enable strip_thinking_response - API docs have been updated and the model feature suffix docs have been updated.

  • Add support for enable_web_citations - This will instruct the LLM to reference the citations it used generating its responses when Web Search is enabled. API docs have been updated and the model feature suffix docs have been updated.

  • Remove 4x option and show "max" in its place. This will leverage the above change on the API to allow images that can't 4x upscale to be uploaded. This will still block images that are > 4096 x 4096 since the scale can't be less than 1.

  • When upscaling, if scale is set to 4, dynamically reset it so that the maximum final output size is always less than the max pixel size of our upscaler.

  • Added a model compatibility mapper for gpt-4.1 to map to Venice Large / Qwen 3 235B.

  • API Key Creation is now rate limited to 20 new keys per minute with a total of 500 keys per user.

Characters

  • Added a limit to character names to prevent issues within the UI.

  • Fixed up character display for characters with excessive display information that was previously breaking the page layout.

  • When using the auto-generate character feature, a confirmation box will be presented first to avoid overwriting existing details on accident.