OOMOL Fusion API

Operate OOMOL Fusion API through your OOMOL-connected account. This skill calls the fusion-api connector with the oo CLI; OOMOL injects credentials server-side, so you never handle raw tokens.

Running an action

OOMOL Fusion API needs no account connection. Assume the oo CLI is installed and signed in. Do not run oo auth login proactively — just run the action. Fall back to First-time setup only when a command actually fails.

1. Inspect the contract to get the authoritative input/output schema before building a payload:

oo connector schema "fusion-api" --action "<action_name>"

2. Run the action with a JSON payload that matches the input schema:

oo connector run "fusion-api" --action "<action_name>" --data '<json>' --json

--data takes a JSON object string or @path/to/file.json; omit it to send {}.
The response is { "data": ..., "meta": { "executionId": "..." } }; the execution id lives under meta.executionId.

Each action is listed below with a one-line description; actions that change state carry a [write] or [destructive] tag. Before constructing --data, fetch the action’s live schema with oo connector schema to get its authoritative input fields.

Available actions

cosyvoice_tts_generate — Generate speech audio with CosyVoice TTS.
cphone_nano_banana_result — Get cphone-nano-banana task result
cphone_nano_banana_state — Get cphone-nano-banana task state
cphone_nano_banana_submit — Edit images with Nano Banana [write]
deepseek_ocr_recognize — Recognize text from a single image.
doubao_stt_result — Get doubao-stt task result
doubao_stt_state — Get doubao-stt task state
doubao_stt_submit — Convert speech to text with Doubao [write]
doubao_text_to_image_seedream_generate — Generate images from text with Doubao Seedream
doubao_tts_result — Get doubao-tts task result
doubao_tts_state — Get doubao-tts task state
doubao_tts_submit — Convert text to speech with Doubao [write]
fal_aura_sr_result — Get fal-aura-sr task result
fal_aura_sr_state — Get fal-aura-sr task state
fal_aura_sr_submit — Image super-resolution upscaling [write]
fal_nano_banana_2_result — Get fal-nano-banana-2 task result
fal_nano_banana_2_state — Get fal-nano-banana-2 task state
fal_nano_banana_2_submit — Generate or edit images with Nano Banana 2 [write]
fal_nano_banana_pro_result — Get fal-nano-banana-pro task result
fal_nano_banana_pro_state — Get fal-nano-banana-pro task state
fal_nano_banana_pro_submit — Edit images with Nano Banana Pro [write]
fal_nano_banana_result — Get fal-nano-banana task result
fal_nano_banana_state — Get fal-nano-banana task state
fal_nano_banana_submit — Edit images with Nano Banana [write]
fal_remove_background_result — Image with background removed [destructive]
fal_remove_background_state — Get fal-remove-background task state [destructive]
fal_remove_background_submit — Remove image backgrounds with the Fal.ai BRIA model [destructive]
file_upload_abort_multipart_upload — Abort multipart upload [write]
file_upload_complete_multipart_upload — Complete multipart upload [write]
file_upload_create_multipart_upload — Initialize multipart upload [write]
file_upload_generate_presigned_url — Generate a presigned URL for file upload [write]
file_upload_generate_presigned_urls — Generate a presigned URL for multipart upload parts [write]
jina_reader_read — Read and extract URL content
jina_reader_search — Search web content
oomol_tts_ng_generate — Generate speech audio
oomol_tts_result — Get oomol-tts task result
oomol_tts_state — Get oomol-tts task state
oomol_tts_submit — Convert text to audio [write]
openai_image_async_result — Get asynchronous OpenAI image generation result
openai_image_async_submit — Submit asynchronous OpenAI image generation requests, including gpt-image-2 image creation. [write]
openai_image_edit_async_result — Get asynchronous OpenAI image edit result [write]
openai_image_edit_async_submit — Submit asynchronous OpenAI image edit requests, including gpt-image-2 image editing. [write]
pdf_transform_epub_result — Get pdf-transform-epub task result
pdf_transform_epub_state — Get pdf-transform-epub task state
pdf_transform_epub_submit — Convert a PDF to EPUB format [write]
pdf_transform_markdown_result — Get pdf-transform-markdown task result
pdf_transform_markdown_state — Get pdf-transform-markdown task state
pdf_transform_markdown_submit — Convert a PDF to Markdown format [write]
qwen_asr_filetrans_result — Get Qwen ASR file transcription task result
qwen_asr_filetrans_state — Get Qwen ASR file transcription task state
qwen_asr_filetrans_submit — Submit a long-audio speech recognition task with Qwen ASR file transcription [write]
qwen_doc_turbo_analyze — Analyze documents with Tongyi Qianwen
qwen_image_edit_plus_edit — Edit images with Tongyi Wanxiang [write]
qwen_image_generate — Generate images with Tongyi Qianwen
qwen_mt_image_result — Get qwen-mt-image task result
qwen_mt_image_state — Get qwen-mt-image task state
qwen_mt_image_submit — Image machine translation [write]
qwen_tts_generate — Generate speech audio
qwen_voice_cloning_create — Create a Qwen voice cloning voice and return a preview audio URL. [write]
qwen_voice_design_create — Create a Qwen voice design voice and return preview audio. [write]
qwen_voice_design_delete — Delete a Qwen voice design voice. [destructive]
qwen_voice_design_generate — Generate speech audio using a Qwen voice design voice.
qwen_voice_design_list — List Qwen voice design voices.
qwen_voice_design_query — Get details for a Qwen voice design voice.
seedance_video_result — Query a Seedance video generation task result
seedance_video_state — Get a Seedance video generation task state
seedance_video_submit — Generate videos with Doubao Seedance [write]
text_to_epub_illustrate_generate — Generate illustrations for EPUB content
tinify_png_shrink_compress — Compress PNG images with the Tinify API
wanx_i2v_video_result — Query first-frame-to-video task result
wanx_i2v_video_state — Get wanx-i2v-video task state
wanx_i2v_video_submit — Generate videos from the first frame with Tongyi Wanxiang [write]
wanx_image_result — Get wanx-image task result
wanx_image_state — Get wanx-image task state
wanx_image_submit — Generate and edit images with Tongyi Wanxiang AI [write]
wanx_kf2v_video_result — Get wanx-kf2v-video task result
wanx_kf2v_video_state — Get wanx-kf2v-video task state
wanx_kf2v_video_submit — Generate videos from keyframes with Tongyi Wanxiang [write]
wanx_r2v_video_result — Query reference-to-video task result
wanx_r2v_video_state — Get wanx-r2v-video task state
wanx_r2v_video_submit — Generate videos from reference assets with Tongyi Wanxiang [write]
wanx_t2v_video_result — Query text-to-video task result
wanx_t2v_video_state — Get wanx-t2v-video task state
wanx_t2v_video_submit — Generate text-to-video videos with Tongyi Wanxiang [write]

Safety

Untagged actions are reads (get / list / search) — safe to run directly.
Actions tagged [write] change OOMOL Fusion API state — confirm the exact payload and effect with the user before running.
Actions tagged [destructive] remove or overwrite data — always confirm the target and get explicit approval first.

First-time setup

These are one-time steps — do not repeat them on every call. Run a step only when a command fails for the matching reason.

oo: command not found — install the oo CLI (other platforms: https://cli.oomol.com/install-guide.md):

curl -fsSL https://cli.oomol.com/install.sh | bash    # macOS / Linux

irm https://cli.oomol.com/install.ps1 | iex           # Windows PowerShell

Not signed in / authentication error — sign in to your OOMOL account once:
```
oo auth login
```
HTTP 402 / OOMOL_INSUFFICIENT_CREDIT — billing stop. Recharge at https://console.oomol.com/billing/token-recharge before retrying.

Resources

OOMOL Fusion API homepage: https://www.oomol.com

OOMOL Fusion API

SKILL.md

OOMOL Fusion API

Running an action

Available actions

Safety

First-time setup

Resources

Choose your cookie preferences