# Modal Conversion

We support the following modal conversion:

* [**Text to Audio**](#text-to-audio): convert text input to audio output.
* [**Audio to Text**](#audio-to-text): convert audio input to text output.
* [**Image to text**](#image-to-text): convert any *text in image* to text output.&#x20;

## Text to Audio

This component is used to generated a voice output based on any text. You can choose the type of voice you like.

![](/files/qmhS6uII8qhVylcegqwW)

## Audio to Text

This component is used to generated a text output based on the user input audio.&#x20;

![](/files/HzkdfwXFvWwU7xZG7BX4)

## Image to Text

This module converts the text in an image to text format. The input should be an image, and the output will be a text string.&#x20;

![](/files/vCXW11IhyuIShknPLULW)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://botsquare.gitbook.io/botsquare/developer-guide/components/modal-conversion.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
