描述
QuikVox AI is a powerful WordPress assistant designed to streamline your podcast production workflow. By leveraging the advanced capabilities of Google Gemini AI, it allows you to generate high-quality scripts from your existing content and convert them into natural-sounding audio—all without leaving your WordPress dashboard.
Whether you are an AI news blogger or a content creator looking to expand into audio, QuikVox AI provides the tools to automate the tedious parts of scriptwriting and voice generation.
Key Features
- AI-Powered Script Generation: Automatically extract content from your posts/pages and generate professional podcast scripts using Google Gemini (Flash/Pro/Flash-Lite models).
- Natural AI Voices: Convert scripts into audio using the latest Gemini 2.5 TTS (Generative Audio) models. Choose from 15+ high-quality voices with distinct characteristics.
- Multilingual Support: Generate content in 5 major languages: Japanese, English, Chinese (Simplified/Traditional), and Korean.
- Prompt Management: Save and manage custom prompt sets for different podcast styles. Includes an AI translation tool to help you expand your prompts globally.
- Seamless Media Integration: Generated audio files are automatically saved to your WordPress Media Library and can be embedded directly into your articles via a simple audio player.
- Post & Page Support: Works with both standard Posts and Pages, allowing you to turn any content into audio.
Advanced Voice Selection
Unlike basic TTS plugins, QuikVox AI provides detailed metadata for each voice:
* Gender Identification: Clear male/female labels.
* Tone Characteristics: Voices are tagged with their unique style (e.g., “Warm, deep, informative” or “Soft, narrating”).
* Dynamic UI: The editor dropdown shows icons and descriptions so you can find the perfect voice for your persona.
External Services
This plugin connects to an external service:
- Google Gemini API
– Purpose: Generate podcast scripts from post content and convert text to speech
– Data sent: Post content and optional user prompts
– Service provider: Google
– Privacy Policy: https://policies.google.com/privacy
– Terms of Service: https://policies.google.com/terms
Users must provide their own API key to use this feature. Script and voice generation data is sent only when the user triggers generation actions.
- QuikVox AI License Verification Service
– Endpoint: https://quikvox-ai.com/license/verify
– Purpose: Verify license keys and refresh plan/status information
– Data sent: License key, site URL, home URL, and plugin version
– Service provider: QuikVox AI
– Triggered only when:
– an administrator saves or activates a license key
– an administrator clicks the “License Recheck” button
– the QuikVox AI Settings page is opened and the scheduled next check time has passed
No license verification request is sent from normal front-end page views, post views, or general admin screens.
安装
- Upload the
quikvox-aifolder to the/wp-content/plugins/directory. - Activate the plugin through the ‘Plugins’ menu in WordPress.
- Navigate to Settings > QuikVox AI to enter your Google Gemini API Key and configure the main plugin settings.
- Use the QuikVox AI top-level menu for Talk Scripts and Audio Analytics.
- Optionally, configure Your Voice Generation API Key to enable TTS.
- Start creating by clicking “Create Podcast Script” in the sidebar of any Post or Page editor.
- If you need to issue a test license, run
scripts/issue-license.ps1from PowerShell and provideLICENSE_SIGNING_SECRETat runtime. You can also pass-IssuedDateto set the issue date explicitly. The script generates a Business or Pro license key without storing the secret in the repository.
常见问题
-
Where do I get a Gemini API Key?
-
You can obtain an API key from the Google AI Studio.
-
Are the audio files hosted locally?
-
Yes, generated audio files (WAV format) are saved directly into your
wp-content/uploadsdirectory and registered in your Media Library for full ownership. -
Does it support multi-speaker podcasts?
-
The current version supports single-speaker script generation. Multi-speaker support (dialogue) is available as a Pro feature.
评价
此插件暂无评价。
贡献者及开发者
更新日志
1.0.8
- Reworked the admin menu structure so QuikVox AI appears as a top-level menu with Talk Scripts and Audio Analytics beneath it.
- Kept the main settings page under the WordPress Settings menu and aligned the admin UI with WordPress conventions.
- Improved the admin settings screen by removing inline JavaScript from core controls and tightening settings sanitization.
1.0.7
- Revised Smart Tone admin UI to match Prompt Sets behavior more closely.
- Restored default-star indicators in the Smart Tone list and removed the separate default summary cards.
- Simplified Smart Tone row actions so built-in styles use View/Copy and custom styles use Edit/Delete/Copy as appropriate.
- Added read-only Smart Tone view mode and blocked direct edit/delete operations for built-in styles.
- Documented the current built-in single-speaker Prompt Sets in docs/prompt-sets-single-ja.md.
1.0.6
- Reworked the workflow sidebar and modal layout to clarify selection, generation, and embedding steps.
- Added and reorganized planning docs for roadmap, task tracking, and release context.
- Removed generated pycache artifacts from the repository and ignored future Python cache files.
1.0.5
- Refined the script generator sidebar UI for single and multi-speaker workflows.
- Simplified generation progress popups for script and voice creation.
- Added docs for mockup organization and audio chunk loudness tracking.
1.0.4
- Security: Masked License Key input field with eye icon toggle to prevent credential exposure.
- Security: Masked Service Account JSON (Vertex AI) with blur filter and eye icon toggle.
- Security: Masked Google AI Studio Script Generation API Key with eye icon toggle.
- Security: Masked Google AI Studio Voice Generation API Key with eye icon toggle.
- Fix: Default Gemini model fallback updated from deprecated
gemini-protogemini-2.5-proto resolve 404 errors on script generation. - Fix: Sidebar model fallback list updated to current Gemini 2.5 series.
- Fix: Plugin Check — NonceVerification warnings resolved for redirect notification flags.
- Fix: Plugin Check — Added
wp_unslash()and sanitization to Smart Tone config and auth JSON inputs. - Fix: Plugin Check — Wrapped
error_log()inWP_DEBUGguard (Vertex AI error handler). - Fix: Plugin Check — Replaced
esc_url()withesc_url_raw()for input sanitization in audio URL handler. - Fix: Plugin Check —
stable_tag_mismatchresolved. - Fix: Plugin Check —
plugin_header_nonexistent_domain_pathresolved by creatinglanguages/directory. - Chore: Added
.distignoreto exclude development files from distribution packages. - UI: Updated official website link to https://quikvox-ai.com/.
- UI: Removed redundant “Uses Global Endpoint” label from Vertex AI model selector (behavior is automatic).
1.0.2
- Security: Removed internal API response body from client-facing error messages (Vertex AI TTS and Gemini API).
- Security: Added model ID format validation (regex) for TTS model parameter in voice generation handler.
- Security: Applied input sanitization to Smart Tone text handler for consistency.
- Security: Fixed IDOR vulnerability by adding post read permission check before script generation.
- Security: Replaced unsafe HTML rendering pattern in React component with regex-based tag stripping.
- Security: Corrected URL escaping function to use HTML-context-appropriate method in audio insert handler.
- Security: Suppressed internal URL and model details from client-facing Vertex AI error messages.
- Security: Added Service Account JSON format validation on settings save.
- Security: Removed project_id disclosure from Vertex AI connection test AJAX response.
- UI: Switched connection test status display to textContent to prevent potential HTML injection.
- Removed debug loading log from production script bundle.
1.0.1
- Fixed AI Translate issue in Prompt Sets by improving Gemini 2.5/Thinking model support.
- Implemented comprehensive API response parsing to handle thinking blocks and Markdown code fences.
- Extended API timeout to 60 seconds.
- Removed
response_mime_type: 'application/json'to avoid conflicts with newer Gemini models.
1.0.0
- Official stable release.
- Updated plugin versioning to 1.0.0.
- Verified plan-based features and UI consistency.
0.8.1
- (Previous entries)
- Implemented Vertex AI integration for both script and voice generation.
- Added support for Cloud Text-to-Speech via Vertex AI (MP3 output).
- Implemented text chunking for TTS to handle Gemini TTS byte limits (512 bytes) and timeout issues.
- Improved error handling with a selectable/copyable error modal in the editor.
- Added dynamic download labels (MP3 vs WAV) in the sidebar.
- Fixed endpoint routing for Vertex AI preview models (locations/global).
0.7.8
- Renamed plugin to QuikVox AI (slug: quikvox-ai).
- Migrated all inline scripts/styles to wp_enqueue (admin.js / admin.css).
- Fixed i18n: text domain unified to quikvox-ai and missing 2nd args added.
- Added ABSPATH guards to all PHP files.
0.7.7
- Code consistency improvements for WordPress.org standards.
0.7.6
- WordPress.org submission preparation.
- Removed ElevenLabs API integration (Gemini TTS only).
- Added External Services disclosure section.
- Security improvements: Enhanced nonce verification and data sanitization.
- Code cleanup for WordPress.org compliance.
0.7.5
- Official preparation for WordPress.org directory submission.
- Updated Gemini 2.5 TTS (GA) model support.
- Added support for Gemini 1.5 Flash-Lite.
- Enhanced Voice metadata (Name, Gender, Characteristics) in UI.
- Improved buttons: Scripts can now be embedded directly under the player.
- General UI/UX polishing for the admin settings page.
0.5.0
- Initial beta release with basic script generation and TTS support.