This document was translated from Chinese by AI and has not yet been reviewed.
An Assistant
allows you to personalize the selected model with settings such as prompt presets and parameter presets, making the model work more in line with your expectations.
The System Default Assistant
provides a general parameter preset (no prompt). You can use it directly or find the preset you need on the Agents Page.
An Assistant
is the parent set of Topics
. Multiple topics (i.e., conversations) can be created under a single assistant. All Topics
share the Assistant
's parameter settings and prompt presets, among other model settings.
New Topic
creates a new topic within the current assistant.
Upload Image or Document
requires model support for image uploads. Uploading documents will automatically parse them into text to be provided as context to the model.
Web Search
requires configuring web search related information in the settings. Search results are returned to the large model as context. For details, see Web Search Mode.
Knowledge Base
enables the knowledge base feature. For details, see Knowledge Base Tutorial.
MCP Server
enables the MCP server feature. For details, see MCP Usage Tutorial.
Generate Image
is not displayed by default. For models that support image generation (e.g., Gemini), it needs to be manually enabled before images can be generated.
Select Model
switches to the specified model for subsequent conversations while preserving the context.
Quick Phrases
requires pre-setting common phrases in the settings to be called here, allowing direct input and supporting variables.
Clear Messages
deletes all content under this topic.
Expand
makes the chatbox larger for entering longer texts.
Clear Context
truncates the context available to the model without deleting content, meaning the model will "forget" previous conversation content.
Estimated Token Count
displays the estimated token count. The four values represent Current Context Count
, Maximum Context Count
(∞ indicates unlimited context), Current Input Box Message Character Count
, and Estimated Token Count
.
Translate
translates the content in the current input box into English.
Model settings are synchronized with the Model Settings
parameters in the assistant settings. For details, see Assistant Settings.
Message Separator
:Uses a separator to divide the message body from the action bar.
Use Serif Font
:Font style toggle. You can now also change fonts via custom CSS.
Display Line Numbers for Code
:Displays line numbers for code blocks when the model outputs code snippets.
Collapsible Code Blocks
:When enabled, code blocks will automatically collapse if the code snippet is long.
Code Block Word Wrap
:When enabled, single lines of code within a code snippet will automatically wrap if they are too long (exceed the window).
Auto-collapse Thinking Content
:When enabled, models that support thinking will automatically collapse the thinking process after completion.
Message Style
:Allows switching the conversation interface to bubble style or list style.
Code Style
:Allows switching the display style of code snippets.
Mathematical Formula Engine
:KaTeX renders faster because it is specifically designed for performance optimization.
MathJax renders slower but is more comprehensive, supporting more mathematical symbols and commands.
Message Font Size
:Adjusts the font size of the conversation interface.
Display Estimated Token Count
:Displays the estimated token count consumed by the input text in the input box (not the actual context token consumption, for reference only).
Paste Long Text as File
:When copying and pasting a long block of text from elsewhere into the input box, it will automatically display as a file, reducing interference during subsequent input.
Markdown Render Input Messages
:When disabled, only model replies are rendered, not sent messages.
Triple Space for Translation
:After entering a message in the conversation interface input box, pressing the spacebar three times consecutively will translate the input content into English.
Note: This operation will overwrite the original text.
Target Language
:Sets the target language for the input box translation button and the triple space translation feature.
In the assistant interface, select the assistant name you want to set → select the corresponding setting in the right-click menu.
Name
:Customizable assistant name for easy identification.
Prompt
:The prompt itself. You can refer to the prompt writing style on the agent page to edit the content.
Default Model
:You can fix a default model for this assistant. When adding from the agent page or copying an assistant, the initial model will be this model. If this item is not set, the initial model will be the global initial model (i.e., Default Assistant Model).
Auto Reset Model
:When enabled - if you switch to another model during use in this topic, creating a new topic will reset the new topic's model to the assistant's default model. When this item is disabled, the model for a new topic will follow the model used in the previous topic.
For example, if the assistant's default model is gpt-3.5-turbo, and I create Topic 1 under this assistant, then switch to gpt-4o during the conversation in Topic 1:
If auto-reset is enabled: When creating Topic 2, the default model selected for Topic 2 will be gpt-3.5-turbo.
If auto-reset is not enabled: When creating Topic 2, the default model selected for Topic 2 will be gpt-4o.
Temperature
:The temperature parameter controls the degree of randomness and creativity in the text generated by the model (default value is 0.7). Specifically:
Low temperature values (0-0.3):
Output is more deterministic and focused.
Suitable for scenarios requiring accuracy, such as code generation and data analysis.
Tends to select the most probable words for output.
Medium temperature values (0.4-0.7):
Balances creativity and coherence.
Suitable for daily conversations and general writing.
Recommended for chatbot conversations (around 0.5).
High temperature values (0.8-1.0):
Produces more creative and diverse output.
Suitable for creative writing, brainstorming, etc.
May reduce text coherence.
Top P (Nucleus Sampling)
:The default value is 1. A smaller value makes the AI-generated content more monotonous and easier to understand; a larger value gives the AI a wider and more diverse range of vocabulary for replies.
Nucleus sampling affects output by controlling the probability threshold for vocabulary selection:
Smaller values (0.1-0.3):
Only consider words with the highest probability.
Output is more conservative and controllable.
Suitable for scenarios like code comments and technical documentation.
Medium values (0.4-0.6):
Balances vocabulary diversity and accuracy.
Suitable for general conversations and writing tasks.
Larger values (0.7-1.0):
Considers a wider range of vocabulary.
Produces richer and more diverse content.
Suitable for creative writing and other scenarios requiring diverse expression.
Context Window
The number of messages to keep in the context. A larger value means a longer context and consumes more tokens:
5-10: Suitable for general conversations.
10: For complex tasks requiring longer memory (e.g., generating long texts step-by-step according to an outline, where generated context needs to ensure logical coherence).
Note: More messages consume more tokens.
Enable Message Length Limit (MaxToken)
The maximum Token count for a single response. In large language models, max token
(maximum token count) is a crucial parameter that directly affects the quality and length of the model's generated responses.
For example, when testing if a model is connected after filling in the key in CherryStudio, if you only need to know if the model returns a message correctly without specific content, you can set MaxToken to 1.
The MaxToken limit for most models is 32k Tokens, but some support 64k or even more; you need to check the relevant introduction page for details.
The specific setting depends on your needs, but you can also refer to the following suggestions.
Suggestions:
General Chat: 500-800
Short Text Generation: 800-2000
Code Generation: 2000-3600
Long Text Generation: 4000 and above (requires model support)
Generally, the model's generated response will be limited within the MaxToken range. However, it may also be truncated (e.g., when writing long code) or expressed incompletely. In special cases, adjustments need to be made flexibly according to the actual situation.
Streaming Output (Stream)
Streaming output is a data processing method that allows data to be transmitted and processed in a continuous stream, rather than sending all data at once. This method allows data to be processed and output immediately after it is generated, greatly improving real-time performance and efficiency.
In environments like the CherryStudio client, it basically means a "typewriter effect."
When disabled (non-streaming): The model generates information and outputs the entire segment at once (imagine receiving a message on WeChat).
When enabled: Outputs character by character. This can be understood as the large model sending you each character as soon as it generates it, until all characters are sent.
Custom Parameters
Adds additional request parameters to the request body, such as presence_penalty
and other fields. Generally, most users will not need this.
Parameters like top-p, maxtokens, stream, etc., mentioned above are some of these parameters.
Input format: Parameter Name — Parameter Type (text, number, etc.) — Value. Reference documentation: Click to go.
This document was translated from Chinese by AI and has not yet been reviewed.
The Agents page is an assistant marketplace where you can select or search for desired model presets. Click on a card to add the assistant to your conversation page's assistant list.
You can also edit and create your own assistants on this page.
Click My
, then Create Agent
to start building your own assistant.
This document was translated from Chinese by AI and has not yet been reviewed.
The drawing feature currently supports painting models from DMXAPI, TokenFlux, AiHubMix, and SiliconFlow. You can register an account at SiliconFlow and add it as a provider to use this feature.
For questions about parameters, hover your mouse over the ?
icon in corresponding areas to view descriptions.
This document was translated from Chinese by AI and has not yet been reviewed.
Note: Gemini image generation must be used in the chat interface because Gemini performs multi-modal interactive image generation and does not support parameter adjustment.
This document was translated from Chinese by AI and has not yet been reviewed.
Cherry Studio's translation feature provides you with fast and accurate text translation services, supporting mutual translation between multiple languages.
The translation interface mainly consists of the following components:
Source Language Selection Area:
Any Language: Cherry Studio will automatically identify the source language and perform translation.
Target Language Selection Area:
Dropdown Menu: Select the language you wish to translate the text into.
Settings Button:
Clicking will jump to Default Model Settings.
Scroll Synchronization:
Toggle to enable scroll sync (scrolling in either side will synchronize the other).
Text Input Box (Left):
Input or paste the text you need to translate.
Translation Result Box (Right):
Displays the translated text.
Copy Button: Click to copy the translation result to clipboard.
Translate Button:
Click this button to start translation.
Translation History (Top Left):
Click to view translation history records.
Select Target Language:
Choose your desired translation language in the Target Language Selection Area.
Input or Paste Text:
Enter or paste the text to be translated in the left text input box.
Start Translation:
Click the Translate
button.
View and Copy Results:
Translation results will appear in the right result box.
Click the copy button to save the result to clipboard.
Q: What to do about inaccurate translations?
A: While AI translation is powerful, it's not perfect. For professional fields or complex contexts, manual proofreading is recommended. You may also try switching different models.
Q: Which languages are supported?
A: Cherry Studio translation supports multiple major languages. Refer to Cherry Studio's official website or in-app instructions for the specific supported languages list.
Q: Can entire files be translated?
A: The current interface primarily handles text translation. For document translation, please use Cherry Studio's conversation page to add files for translation.
Q: How to handle slow translation speeds?
A: Translation speed may be affected by network connection, text length, or server load. Ensure stable network connectivity and be patient.
This document was translated from Chinese by AI and has not yet been reviewed.
For knowledge base usage, please refer to the Knowledge Base Tutorial in the advanced tutorials.
This document was translated from Chinese by AI and has not yet been reviewed.
Quick Assistant is a convenient tool provided by Cherry Studio that allows you to quickly access AI functions in any application, enabling instant operations like asking questions, translation, summarization, and explanations.
Open Settings: Navigate to Settings
-> Shortcuts
-> Quick Assistant
.
Enable Switch: Find and toggle on the Quick Assistant
button.
Set Shortcut Key (Optional):
Default shortcut for Windows: Ctrl + E
Default shortcut for macOS: ⌘ + E
Customize your shortcut here to avoid conflicts or match your usage habits.
Activate: Press your configured shortcut key (or default shortcut) in any application to open Quick Assistant.
Interact: Within the Quick Assistant window, you can directly perform:
Quick Questions: Ask any question to the AI.
Text Translation: Input text to be translated.
Content Summarization: Input long text for summarization.
Explanation: Input concepts or terms requiring explanation.
Close: Press ESC or click anywhere outside the Quick Assistant window.
Shortcut Conflicts: Modify shortcuts if defaults conflict with other applications.
Explore More Functions: Beyond documented features, Quick Assistant may support operations like code generation and style transfer. Continuously explore during usage.
Feedback & Improvements: Report issues or suggestions to the Cherry Studio team via feedback.
This document was translated from Chinese by AI and has not yet been reviewed.
This page only introduces the interface functions. For configuration tutorials, please refer to the Provider Configuration tutorial in the basic tutorials.
In Cherry Studio, a single service provider supports multi-key round-robin usage, where the keys are rotated from first to last in a list.
Add multiple keys separated by English commas, as shown in the example below:
sk-xxxx1,sk-xxxx2,sk-xxxx3,sk-xxxx4
You must use an English comma.
When using built-in service providers, you generally do not need to fill in the API address. If you need to modify it, please strictly follow the address provided in the official documentation.
If the address provided by the service provider is in the format https://xxx.xxx.com/v1/chat/completions, you only need to fill in the root address part (https://xxx.xxx.com).
Cherry Studio will automatically concatenate the remaining path (/v1/chat/completions). Not filling it as required may lead to improper functionality.
Typically, clicking the Manage
button in the bottom left corner of the service provider configuration page will automatically fetch all models supported by that service provider. You can then click the +
sign from the fetched list to add them to the model list.
Click the check button next to the API Key input box to test if the configuration is successful.
After successful configuration, make sure to turn on the switch in the upper right corner, otherwise, the service provider will remain disabled, and the corresponding models will not be found in the model list.
This document was translated from Chinese by AI and has not yet been reviewed.
When the assistant does not have a default assistant model configured, the model selected by default for new conversations is the one set here. The model used for prompt optimization and word-selection assistant is also configured in this section.
After each conversation, a model is called to generate a topic name for the dialog. The model set here is the one used for naming.
The translation feature in input boxes for conversations, painting, etc., and the translation model in the translation interface all use the model set here.
The model used for the quick assistant feature. For details, see Quick Assistant.
This document was translated from Chinese by AI and has not yet been reviewed.
On this page, you can set the software's color theme, page layout, or customize CSS for personalized settings.
Here you can set the default interface color mode (light mode, dark mode, or follow system).
These settings are for the layout of the chat interface.
When this setting is enabled, clicking on the assistant's name will automatically switch to the corresponding topic page.
When enabled, the creation time of the topic will be displayed below it.
This setting allows flexible customization of the interface. For specific methods, please refer to Custom CSS in the advanced tutorial.
This document was translated from Chinese by AI and has not yet been reviewed.
This interface allows for local and cloud data backup and recovery, local data directory inquiry and cache clearing, export settings, and third-party connections.
Currently, data backup supports three methods: local backup, WebDAV backup, and S3-compatible storage (object storage) backup. For specific introductions and tutorials, please refer to the following documents:
Export settings allow you to configure the export options displayed in the export menu, as well as set the default path for Markdown exports, display styles, and more.
Third-party connections allow you to configure Cherry Studio's connection with third-party applications for quickly exporting conversation content to your familiar knowledge management applications. Currently supported applications include: Notion, Obsidian, SiYuan Note, Yuque, Joplin. For specific configuration tutorials, please refer to the following documents: