SDI over IP and VSF & Sumavision

Date: 2021-10-11 Source: Suma Tech

1. Background of TV Station/Studio FULL IP Infrastructure

With the development of business, the continuous expansion of new media, and the development of high-definition network production and broadcasting, the technical architecture based on SDI baseband video interface and dedicated SDI digital video router in the broadcasting has been difficult to meet the needs of expansion, and it is imperative to realize the integration of IP-based infrastructure and SDI technical architecture.

3G/6G/12G means 4-8 times more coaxial cable and some fiber optic proposals in SMPTE 208x
Advantage – leverages existing SDI technologies
Disadvantage – requires more coaxial cable (or fiber hybrid)

10 Gbe to 100 Gbe Ethernet topologies
Advantage – Uses COTS (common-off-the-shelf) Enterprise IT equipment
Disadvantage - More expensive, proprietary switching/packetizing. Generally, upgrade to Full IP at the TV station is mainly divided into three stages:

The first stage: IP-based transmission, which means IP-based compressed services bitrate control and audio/video encoding and transmission via IP protocol;

The second stage: IP transmission of uncompressed audio and video, centralization switching upgrade to Full IP, and the SDI router would be replaced;

The third stage: IP-based uncompressed audio and video throughout the entire process, that is, IP-based uncompressed audio and video in the entire process of acquisition, production, broadcast, storage, transmission, and distribution. Major changes will occur in program quality and production and broadcast processes.

At present, the first stage has been widely used in existing systems, and the standardization in the second stage has been made great progress. However, it still faces problems such as insufficient link bandwidth, system switching capacity, and few available equipment. The stage three requires more support of a stronger IT infrastructure.

IP divided into 2 camps

ASPEN (Adaptive Sample Picture Encapsulation)
AIMS (Alliance for IP Media Solutions)  
Video Services Forum’s Technical Recommendation TR-03/04
Imagine, Grass Valley, SAM, Lawo, Nevion etc.
Open Standards

2. Implementation of SDI over IP technology

In the past, the VIDEO over IP transmission that broadcasters broadcasted and produced in the TV center only used the IP network as a transmission channel, and did not carry out IP-based encapsulation of the video, nor involve the IP-based transmission of SDI signals. SMPTE ST2022 defines the IP packaging format and transmission standards for professional baseband video signals. To realize SDI over IP technology, SDI signals need to be digitized and networked, and also a complete network transmission and data error correction mechanism must be established.

Firstly, the SDI signal is digitized to form a media data segment, and then a data block file and an FEC error correction coded data block based on the SDI signal coded data block are formed according to SMPTE ST2022-6 and SMPTE ST2022-5, and then the two types of data blocks Encapsulated into IP data packets, transmitted via Ethernet. At the receiving end, perform IP unpacking, FEC decoding, data verification and error correction, data file serialization, etc., and finally complete the SDI signal recovery output.

In the past solutions at the IP layer, SDI signal encapsulation was not processed, and only the Ethernet network was used as a data transmission channel.

In the SMPTE ST2022 standard series, the IP-based transmission standard was defined, including:

SMPTE ST 2022-1:2007: Forward Error Correction for Real-Time Video/Audio Transport Over IP Networks
SMPTE ST 2022-2:2007: Unidirectional Transport of Constant Bit Rate MPEG-2 Transport Streams on IP Networks
SMPTE ST 2022-3:2010: Unidirectional Transport of Variable Bit Rate MPEG-2 Transport Streams on IP Networks
SMPTE ST 2022-4:2011: Unidirectional Transport of Non-Piecewise Constant Variable Bit Rate MPEG-2 Streams on IP Networks
SMPTE ST 2022-5:2012: Forward Error Correction for High Bit Rate Media Transport Over IP Networks
SMPTE ST 2022-6:2012: Transport of High Bit Rate Media Signals over IP Networks (HBRMT)
SMPTE ST 2022-7:2013: Seamless Protection Switching of SMPTE ST 2022 IP Datagrams (approval pending)
SMPTE 2059-1:2015: Generation and Alignment of Interface Signals to the SMPTE Epoch
SMPTE 2059-2:2015: SMPTE Profile for Use of IEEE-1588 Precision Time Protocol in Professional Broadcast Applications

SMPTE 2022/SMPTE 2059 Architecture
This architecture example shows an uncompressed SMPTE 2022-5/6 transmitter and receiver integrated with SMPTE 2059.

In the transmit path, it is assumed that the SDI input is synchronized to the SMPTE 2059clock.The SMPTE 20592 module generates timestamps. These timestamps are inserted into the media payload packets by the packetized.

In the receive path the SMPTE 2059-2 module is used for r timestamp and Pulse Per Second (PPS)generation.

In the SMPTE ST2110 standard series: ST2110 puts each part of the signal into a different system, video, audios and ANC all separately routable.

ST 2110-10: System Timing
ST 2110-20: Uncompressed Video Stream
ST 2110-21: Traffic Shaping
ST 2110-22: Compressed video Stream
ST 2110-30: Uncompressed Audio Stream
ST 2110-40: Ancillary Data (Metadata)
NMOS IS-04: Discovery and Registration
NMOS IS-05: Connection Management
NMOS IS-07: Event & Tally
NMOS IS-08: Audio Mapping
NMOS IS-09: System Parameters

Comparison between SMPETST 2022 and SMPTE ST2110 standard series:

ST 2110 and ST 2022-6 are both SDI over IP technologies.

ST 2022-6 is equivalent to the previous generation technology of ST 2110, and 2110 makes up for some of the shortcomings of 2022-6.

The main difference between the two is that ST 2022-6 simply encapsulates the SDI signal into an IP packet, and the video part, audio part, and auxiliary data part are all in the same packet; while ST 2110 puts the video in the SDI signal. The three major blocks of, audio, and auxiliary data are separated and encapsulated in different IP packets for transmission.

And also for bandwidth saved under ST2110 standard:

In a 2110 system, there are three streams running. The video, audio, and auxiliary data streams are transmitted separately, which saves the trouble of embedding/de-embedding just mentioned. Moreover, such transmission not only improves the scalability of the system, but the bandwidth limit of the system is also more flexible. But what follows is a higher requirement for synchronization: it is much more difficult to synchronize three different streams than a single stream in 2022-6, so 2110 uses PTP protocol and RTP timestamp and other methods to synchronize, which is also 2110 -10 content (SMPTE 2059).


Video Services Forum, Inc. (VSF) was founded on October 7, 1998 to support uniform delivery of video services across the separate RBOC’s after the Bellcore sponsored organization, the Video Services Industry Forum (VSIF) was disbanded. Today the VSF is an international association comprised of service providers, users and manufacturers dedicated to interoperability, quality metrics and education for video networking technologies. The organization's activities include:

Providing forums to identify issues involving the development, engineering, installation, testing and maintenance of video networking technologies. Exchanging non-proprietary information to promote the development of video networking technology and foster resolution of issues common to the video services industry. Promoting interoperability by contributing to and supporting development of standards by national and international standards bodies

SDI over IP and VSF & Sumavision

75+ Members from industry chain

Sumavision Technologies is the member of VSF and in participating in working groups:
Ground-Cloud-Cloud-Ground (GCCG)
J2K Activity Group
Interop Workshop Activity Group
Reliable Internet Stream Transport "RIST" Activity Group

Sumavision attended JOINT TASKFORCE ON NETWORK MEDIA- a self-coordinating group of industry bodies working together on the development of IP technologies for professional media systems. Admin Group Members include the Advanced Media Workflow Association (AMWA), the European Broadcasting Union (EBU), the Society of Motion Picture and Television Engineers (SMPTE©), and the Video Services Forum (VSF). This document was developed by an ad-hoc group of industry experts including both manufacturers and broadcasters to provide guidance and drive towards common solutions for the deployment of SMPTE ST 2110-based media networks.

Sumavision passed JT-NM inter-op testing and attended IBC NAB IP-Showcase, the test results are shown as below:





4. Sumavision Solution and Products

Sumavision fully IP solution combines baseband video and audio processing, mezzanine compression (JPEG-XS) video and audio, IP networking features into a single space-saving solution. With this solution and at ST2110/ST2059 standards, customers could choose functions based on specific application needs, spanning playout, production, video encoding, decoding, gateway, switching, and monitoring.
Sumavision-S010 SDI-IP Gateway

Standards based IP in Action (SMPTE, VSF, EBU, etc.)
Bidirectional transport of SD/HD/3G/12G-SDI video signal over 10/25/100 GigE links
Flexible 8K/4K/HD contents selection Interoperability (From legacy SDI systems to a virtualized, IP-based environment)
SMPTE ST2022-6, SMPTE ST2022-7, SMPTE ST2110/SMPTE ST2059, etc.
With JPEG-XS, you can begin producing over IP with high bandwidth today.
NMOS IS-04, IS-05 management*
Remote site connection

Sumavision-Media IP Gateway

1RU rack mount, dual hot-swappable power supplies
2 x 10-GbE and 4x USB ports, 5G network is supported
Multi-format MPEG-2/H.264/HEVC
Support channel bounding output (5G, WLAN, WIFI), maximum 4 channels output, 30Mbps per channel.
SMPTE 2110 Video over IP outputs
Support SRT /UDP/RIST, and transferring between TS over UDP and SRT/RIST
Cloudy Ready
Support IPv4

[2]: What is SMPTE ST2110 and Why Does It Matter? - John Mailhot
[3]: SMPTE ST 2110 in 60 Minutes - Wes Simpson