There are a lot of reasons for horrible complexity.... 1. There are many ways to roll out a video conferencing system because the hardware is all commodity. Many companies will have their IT department buy all the hardware and try to deploy it using the built in methodologies for connecting conference rooms. This is usually when everyone is given a 16 digit pin number to dial into a room, this causes the dial in nightmares. There are also 3rd parties that will basically sell the same solution as the "custom IT solution" as a service. 2. If you want a decent UI experience you move from snapping commodity hardware together to building a software engineering team, software release pipeline, software updates, and all the stuff that comes along with it. This is usually done by a third party but it still costs more than the commodity solution which it is priced against 3. Once you get a software engineering team involved they have to support the different types of hardware out in production. Most companies have generations of conferencing hardware installed with different configurations throughout the company and do not want to pay for all new hardware. Now your software has to be able to implement screen sharing with 5 different codecs and all those codecs have different bugs and different communication protocols. 4. HP tried to solve this by building their own codecs and forcing all rooms to deploy with the same configurations, but this ended up being the most expensive video conferencing system in the world. It also had the best quality and best reliability.
So yes the cost to building a good ui and less complex system can be way more expensive.
So yes the cost to building a good ui and less complex system can be way more expensive.