What is the structure of the AI Voice ecosystem and how do different players contribute to deliver AI Voice and Intelligent Engagement? This extract from CPaaSAA’s report AI Voice: Who Will Run The Conversation? is the first in a series of four related articles covering ecosystem roles, telco-native AI Voice, customer buying behaviour, and competitive scenarios. You can download the full report here.
A multi-layered ecosystem
AI Voice does not sit within a single product category. It is delivered through an ecosystem of providers spanning communications platforms, cloud infrastructure, enterprise software, and telecom networks.
The report identifies a broad set of roles:
- Communications platforms (CPaaS, CCaaS, UCaaS)
- Hyperscalers and AI providers
- Telecom operators
- CRM and enterprise workflow platforms
- Specialist AI and voice technology providers
Each plays a different role in enabling conversations, processing data, and delivering outcomes.

The ecosystem diagram shows how these players interact across delivery, assembly and component layers. It illustrates:
- Components: AI models, speech technologies
- Assembly: orchestration and conversational platforms
- Delivery: enterprise integration and operational deployment
This matters because value is not evenly distributed. Control depends on which layer a player occupies.
Headline insight: AI Voice is not a single market; it is a stack where control of orchestration defines value.
How the ecosystem fits together
At a functional level:
- AI components (ASR, TTS, LLMs) provide capability
- Platforms assemble these into usable services
- Enterprise systems connect them to workflows and data
- Networks deliver and increasingly embed these capabilities
This creates both interdependence and competition. For example:
- Hyperscalers supply models but also build platforms
- CPaaS players assemble services but depend on components
- CRM platforms integrate outcomes but may move into orchestration
Stakeholder implications
For CPaaS and engagement platforms:
- Differentiation depends on orchestration and integration
For hyperscalers:
- Moving from components to platforms increases control
For enterprises:
- Supplier choice determines where control of execution sits
What’s next?
This article has outlined the ecosystem structure.
The next article will examine Telco Network AI Voice, including the emerging concept of “AI in every call”.
You can download the full report here.

Andrew Collinson
Andrew Collinson is a telecoms and connected technologies expert, specializing in growth strategy, research, and thought leadership. As founder of Connective Insights, he helps clients translate new technologies into viable business models, with a focus on CPaaS, APIs, platform strategies, AI, and network automation.
Before joining CPaaSAA as Associate Research Partner, Andrew was Research and Commercial Director at STL Partners for 15 years, leading a successful research business. He also moderates events, conducts bespoke research, and advises on telecom innovation, stakeholder dynamics, and digital transformation.

Comments are closed