The domestic epidemic has come to an end, and the number of audio and video interactive apps incubated due to the epidemic has seen a blowout growth. The communication scene has attracted the attention of capital, and the market continues to rise. The demand for IM and RTC capabilities is also growing in areas such as online education, entertainment and social networking, and live streaming. However, for developers, the first task is to choose audio and video communication vendors. Good communication vendors not only increase the development efficiency, but also greatly reduce the operation and service costs after the launch of the product.

In order to solve the actual needs of developers, anyRTC constantly improves its real-time audio and video products, providing developers with the “multi-channel audio and video interaction” technology, which can support up to 49 channels of real-time audio and video interaction, with a maximum audience of 100W people, to meet the needs of developers to quickly build multi-person audio and video interaction scenes. In order to adapt to the diversity of scenarios, we launched an integrated solution of “IM instant messaging + real-time audio and video + cloud recording”, “to solve all communication scenarios with a set of SDK”, to meet the needs of high-quality communication in various scenarios, and become the leader of the next generation OF RTC.

Next generation real-time audio and video

AnyRTC around “low delay, high performance, high concurrency” to achieve “full scene, one network, low delay, full fusion. Set up an AI audio and video lab to continuously optimize perceptual codecs, intelligent transmission networks and voice and video enhancement technologies. The AI technology is perfectly combined with real-time audio and video, constantly challenging the historical problems in the audio and video field, turning the impossible into the possible, and making audio and video communication like air.

Current real-time audio and video formats

IOT field: The rise of smart home, smart wear, smart park concepts, people are more and more dependent on smart devices, at the same time accelerate the development of this field, but also actively adapt to this scene, in addition to embedded Android, Linux, but also support real-time operating system RTOS.

Interactive live broadcasting: show live broadcasting, PK live broadcasting, chat room, online KTV, interactive classes, and other scenes of penetration, interactive live broadcasting has become a social field, the field of education technology vane, gameplay is around interactive live broadcasting, The live broadcasting mode in the anyRTC SDK should be the scene.

Real-time communication: in the large-scale application of strangers making friends, small classes, video conferences and other scenarios, real-time communication has become the standard of various applications, which can effectively solve the problem of low communication efficiency, but also increase the application characteristics.

What are the application scenarios of real-time audio and video

online education

Online education is currently in a relatively mature stage. Due to the different needs of different student groups, abundant online teaching modes are spawned. Our real-time audio and video can support interactive small class, one-to-one tutoring, large interactive live class, two-teacher class, music teaching and other full-scene online teaching modes. Online piano practice partner

Online piano practice partner. At the same time, the teacher can also play the piano to demonstrate and practice for the students. The teacher and the students push their audio and video streams to the real-time transmission network respectively, and then pull each other’s audio and video streams to the terminal to watch, forming a video conversation scene. Other students can watch the teachers and students in class from the CDN side pull stream, which is the scene of one-to-one online piano training partner.

Small classes and one-on-one tutoring scenarios

Small-class classes and one-to-one tutoring scenarios focus on ensuring the quality experience of teachers and students in class. The class is stable and smooth without delay. At present, our real-time audio and video can achieve global end-to-end delay of less than 300ms and minimum delay of 62ms, meeting the needs of ultra-low delay. At the same time, the real-time audio and video service can flexibly control the subscription and publication of audio and video stream, so that the teacher can arbitrarily choose a single or multiple students for classroom q&A interaction; In addition, as an important tool for teachers to teach and explain key points, we can provide interactive whiteboards and high-definition screen sharing to meet the needs of teacher-student interaction.

Large live interactive courses and dual-teacher classes

In large live interactive courses and dual-teacher classes, stable carrying capacity and low-delay interactive experience are mainly guaranteed under QPS. First of all, we adopt a decentralized multi-point distributed global architecture, which supports unlimited students to be online at the same time and supports hundred-million-level concurrency. Secondly, the global acceleration network is utilized to allow multiple users to access nearby nodes, and the delay caused by network transmission is reduced directly through the way of special line cascade, so as to realize the low-delay experience of teachers and students’ remote audio and video interaction with the mic.

Pan-entertainment social

Pan-entertainment social network provides a complete set of solutions including audio and video mic, audience live, mic location management, interactive chat room, suitable for pan-entertainment live, voice chat room, dating, audio and video calls and other scenarios.

In the pan-entertainment live broadcasting scene, the interactive live broadcasting with low delay is the live streaming based on RTC technology, which does not rely on CDN. The delay between the host and audience is about 300ms, and it mainly serves the scenes of multi-person interaction, such as live broadcasting with goods, live broadcasting with microphone, chat and social gaming, etc. Provide high quality guarantee for the real-time interaction between host end and audience end with no perceptual delay. Different from RTC, CDN, another technology that implements audio and video capabilities, tends to delay live broadcasts in 3-5 seconds.

In pan-entertainment live broadcast, there are often multi-mic live broadcast scenes, that is, multi-person live broadcast interactive scenes. We provide two-way audio and video call capabilities for the microphone, so that the audience can clearly watch the confluent live broadcast. At the same time, through THE IM channel to realize the wheat, wheat, grab wheat, round wheat, hold wheat, ban wheat and other wheat management, so that the owner of the broadcast room better management. In addition, through the customization of chat room attributes, provide a variety of storage modes of Key and Value, real-time record client and server user status, real-time update, to ensure the smooth and no lag of multi-mic live.

In more fields of RTC application real estate services, it can help customers to realize VR live video viewing, and use IM signaling channel to realize multi-terminal VR synchronous house viewing; By using real-time audio and video technology, the user can connect with the customer manager in real time, so that the customer manager can watch the video remotely and explain it synchronously.

Online medical remote consultation, help developers to achieve multi-party consultation, upload medical records, medical discussion, background recording and other functions; Online consultation provides interactive whiteboard, time charging, IM communication and high-definition multi-party audio and video, etc.

Collaborative office and video conference of enterprise communication; Financial services, such as remote face-to-face signing, video customer service, involving Internet communication, are our coverage.

What are the technical requirements?

Low latency (within 300ms), hd low latency, high availability of the product, can not affect the entire experience. Such as piano practice, especially the need to hd music mode, video resolution to achieve 720 p, aimed at the high frequency sound of instrumental music and weak tone scale optimized, and the sound of music USES the whole channel sampling, stereo, the audio sampling rate 48 KHZ, audio bit rate to more than 100 KBPS, noise reduction and echo cancellation height reduction with AI music details.

conclusion

In the actual scene, anyRTC real-time audio and video services include video live broadcasting, online education, video social networking, game voice, Internet of Things, family care and other industries. On the stability, reliability, and communication interface coordination, engine through the underlying the research of audio and video technology, the global network coverage of more than 200 + BGP nodes, and accumulated many years of audio and video technology, can ensure the quality of the performance is better, to RTC + IM dual ability for developers to provide better service, more quickly to build a low latency, high quality, communication ability, Realize the whole industry and scene coverage of audio and video communication.