Ping An Insurance (Group) Company of China. has filed a patent for a system and method for multimodal video segmentation in a multi-speaker scenario. The technology segments video transcripts into sentences, detects speaker changes based on audio or visual content, and segments the video into clips accordingly. GlobalData’s report on Ping An Insurance (Group) Company of China gives a 360-degree view of the company including its patenting strategy. Buy the report here.
According to GlobalData’s company profile on Ping An Insurance (Group) Company of China, Digital lending was a key innovation area identified from patents. Ping An Insurance (Group) Company of China's grant share as of January 2024 was 19%. Grant share is based on the ratio of number of grants to total number of patents.
Multimodal video segmentation system for multi-speaker scenario
The patent application (Publication Number: US20240020977A1) describes a system for multimodal video segmentation in a multi-speaker scenario. The system includes a memory to store instructions and a processor to execute these instructions. The process involves segmenting a video transcript with multiple speakers into sentences, detecting speaker changes based on audio or visual content, and segmenting the video into clips accordingly. The processor predicts punctuations, timestamps, and speaker change probabilities using acoustic and visual features, neural networks, and face identification techniques.
Furthermore, the system utilizes a convolutional neural network for binary classification, cross-scene face re-identification, and speech probability calculations to determine speaker change probabilities accurately. The processor tokenizes the transcript, combines speaker change probabilities, and determines clip boundaries based on segmentation probabilities. The method and computer-readable storage medium also outline the process of segmenting the video based on speaker change information. Overall, the system offers a comprehensive approach to segmenting videos with multiple speakers by leveraging both audio and visual cues, enhancing the accuracy and efficiency of the segmentation process.
To know more about GlobalData’s detailed insights on Ping An Insurance (Group) Company of China, buy the report here.
Data Insights
From
The gold standard of business intelligence.
Blending expert knowledge with cutting-edge technology, GlobalData’s unrivalled proprietary data will enable you to decode what’s happening in your market. You can make better informed decisions and gain a future-proof advantage over your competitors.