US9699522B2 - Method for transceiving media files and device for transmitting/receiving using same - Google Patents
Method for transceiving media files and device for transmitting/receiving using same Download PDFInfo
- Publication number
- US9699522B2 US9699522B2 US14/943,346 US201514943346A US9699522B2 US 9699522 B2 US9699522 B2 US 9699522B2 US 201514943346 A US201514943346 A US 201514943346A US 9699522 B2 US9699522 B2 US 9699522B2
- Authority
- US
- United States
- Prior art keywords
- pts
- media
- presentation time
- media presentation
- denotes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- GUGNSJAORJLKGP-UHFFFAOYSA-K sodium 8-methoxypyrene-1,3,6-trisulfonate Chemical compound [Na+].[Na+].[Na+].C1=C2C(OC)=CC(S([O-])(=O)=O)=C(C=C3)C2=C2C3=C(S([O-])(=O)=O)C=C(S([O-])(=O)=O)C2=C1 GUGNSJAORJLKGP-UHFFFAOYSA-K 0.000 claims description 25
- 239000012092 media component Substances 0.000 claims description 17
- 238000001094 photothermal spectroscopy Methods 0.000 description 20
- 239000012634 fragment Substances 0.000 description 17
- 230000032258 transport Effects 0.000 description 16
- 230000001360 synchronised effect Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 8
- 238000013507 mapping Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 230000011664 signaling Effects 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 4
- AWSBQWZZLBPUQH-UHFFFAOYSA-N mdat Chemical compound C1=C2CC(N)CCC2=CC2=C1OCO2 AWSBQWZZLBPUQH-UHFFFAOYSA-N 0.000 description 4
- 239000000284 extract Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 241000985610 Forpus Species 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 101000617289 Homo sapiens Tyrosine-protein phosphatase non-receptor type 9 Proteins 0.000 description 1
- 101100465559 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRE7 gene Proteins 0.000 description 1
- 102100021722 Tyrosine-protein phosphatase non-receptor type 9 Human genes 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 101150076896 pts1 gene Proteins 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007723 transport mechanism Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23608—Remultiplexing multiplex streams, e.g. involving modifying time stamps or remapping the packet identifiers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/238—Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
- H04N21/2381—Adapting the multiplex stream to a specific network, e.g. an Internet Protocol [IP] network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4307—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4307—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
- H04N21/43072—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4307—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
- H04N21/43076—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of the same content streams on multiple devices, e.g. when family members are watching the same movie on different devices
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/438—Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving encoded video stream packets from an IP network
- H04N21/4385—Multiplex stream processing, e.g. multiplex stream decrypting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/462—Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
- H04N21/4622—Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/85406—Content authoring involving a specific file format, e.g. MP4 format
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8547—Content authoring involving timestamps for synchronizing content
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/23439—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
Definitions
- the present disclosure relates to a method and apparatus for transmitting and receiving an MPEG-2 Transport Stream (TS)-based media file.
- TS MPEG-2 Transport Stream
- Digital Multimedia Broadcasting nowadays, digital broadcasting such as terrestrial, cable or satellite broadcasting, or Digital Multimedia Broadcasting (DMB) mostly streams AV (Audio/Video) contents using MPEG-2 TS.
- DMB Digital Multimedia Broadcasting
- a multimedia service which provides contents over an Internet Protocol (IP) network as a main transport network, is activated according to a rapid growth of the internet.
- IP Internet Protocol
- the digital broadcasting is developing towards the direction of requiring more traffic like stereoscopic 3-dimensional (3D) video broadcasting, Ultra High Definition (UHD) broadcasting, 3D multi-view video broadcasting or hologram broadcasting.
- Embodiments provide a method and apparatus for transmitting and receiving an MPEG-2 TS-based media file for playing the same back in synchronization with another media.
- a method of transmitting an MPEG2 TS-based media file includes: mapping a Presentation Time Stamp (PTS) of a Presentation Unit (PU) to a Media Presentation Time (MPT) for synchronization with other media with respect to at least one program element forming a program included in the media file; and transmitting synchronization information including the mapped media presentation time together with the media file, wherein the media presentation time is provided for a first PU in a segment of the transmitted media file.
- PTS Presentation Time Stamp
- MPT Media Presentation Time
- a method of receiving an MPEG2 TS-based media file includes: receiving the media file with synchronization information; extracting an MPT of a PU for at least one program element forming a program included in the media file; and playing back the media file using the extracted media presentation time, wherein the MPT is mapped from a PTS of a first PU in a segment of the received media file.
- an apparatus for transmitting an MPEG2 TS-based media file includes: a controller mapping a PTS of a first PU among PUs included in each segment or a PTS of a first PU after a PTS discontinuity occurs to an MPT for synchronization with other media for a program element forming a program included in the media file; and a network interface unit transmitting synchronization information including the mapped MPT together with the media file.
- an apparatus for receiving an MPEG2 TS-based media file includes: a network interface unit receiving the media file with synchronization information; a decoding unit decoding the received media file; and a synchronization unit synchronizing the decoded media file with other media using an MPT included in the synchronization information, wherein the MPT is mapped from a PTS of a first PU in a segment of the received media file or a PTS of a first PU after a PTS discontinuity occurs.
- MPEG-2 TS-based media file is transported with synchronization information including media presentation time, it is possible to randomly access to the media file and to easily synchronize the media file to other media installed inside a receiving apparatus or attached externally and play the media file.
- the media file can be played back in synchronization with the media presentation time even in a case where the PTS discontinuity occurs.
- PTS Presentation Time Stamp
- FIG. 1 is a view illustrating a configuration of an ISO Base Media file format.
- FIG. 2 is a view illustrating a structure of a MPEG2 Packetized Elementary Stream (PES) packet.
- PES Packetized Elementary Stream
- FIG. 3 is a view illustrating a structure of MPEG TS.
- FIG. 4 is a graph illustrating a Presentation Time Stamp (PTS) of MPEG2 Transport Stream (TS) vs. a Media Presentation Time (MPT).
- PTS Presentation Time Stamp
- MPT Media Presentation Time
- FIG. 5 is a view illustrating an embodiment of a method of presenting any Presentation Unit (PU) included in MPEG2 TS.
- PU Presentation Unit
- FIG. 6 is a block diagram illustrating a schematic configuration of an apparatus for transmitting a media file according to an embodiment.
- FIG. 7 is a flowchart illustrating a method of transmitting a media file according to an embodiment.
- FIG. 8 is a view illustrating an embodiment of a method of presenting any PU included in MPEG2 TS in synchronization with a media play timeline.
- FIGS. 9 and 10 are views illustrating configurations of a Segment Index Box (SIB) including synchronization information.
- SIB Segment Index Box
- FIGS. 11 and 12 are views illustrating configurations of an eXtensible Markup Language (XML) file including synchronization information.
- XML eXtensible Markup Language
- FIG. 13 is view of illustrating an embodiment of a media file structure transmitted from a transmitting apparatus according to an embodiment.
- FIG. 14 is a block diagram illustrating a configuration of an apparatus for receiving a media file according to an embodiment.
- FIG. 15 is a flow chart illustrating a method of receiving a media file according to an embodiment.
- FIG. 16 is a view illustrating an embodiment of a segment list created by using a Media Presentation Description (MPD).
- MPD Media Presentation Description
- FIG. 17 is a view illustrating an embodiment of a method of playing an MPEG2 TS-based media file in synchronization with a 3GPP media file.
- FIGS. 18 and 19 are views illustrating embodiments of a method of implementing a media file using an MPD of MPEG Dynamic Adaptive Streaming over HTTP (DASH).
- DASH Dynamic Adaptive Streaming over HTTP
- FIG. 20 is a block diagram illustrating a configuration of an IPTV receiver according to a first embodiment of the present invention.
- FIG. 21 is a block diagram illustrating a configuration of an IPTV receiver according to a second embodiment of the present invention.
- FIG. 1 illustrates a configuration of an ISO Base Media file format.
- a “fragmented file” format may be adopted.
- the fragmented file may be formed by dividing media tracks into plural fragments, each of which includes a movie fragment (moof) box and a media data (mdat) box.
- the mdat box after the moof box includes fragment media data and this combined moof-mdat pair may form a single fragment.
- Timing fields may be provided a Sample Table (ST) box such as a decoding time-to-sample (stts) box or a composition time-to-sample (ctts) box.
- ST Sample Table
- stts decoding time-to-sample
- ctts composition time-to-sample
- a Movie Fragment Random Access (mfra) file positioned at the end of a media file may be used to provide a reference to fragments including random access points.
- the mfra box uses an absolute byte offset from the beginning of a file referring to the fragments and is not fragmented. Therefore it can provide timing information of a fragmented media file in a stored state like a recording process but is not possibly used in streaming application.
- the ISO Base Media file format as shown in FIG. 1 does not provide timing information for streaming the fragmented tracks.
- a segment is a unit that a media file is transmitted and received and a configuration thereof may be variously defined.
- the segment includes one or more movie fragments and each segment has approximately identical duration in one period.
- AHS 3GPP Adaptive HTTP Streaming
- HAS HTTP Adaptive Streaming
- HTTP Streaming of MPEG media the segment includes one or more movie fragments and each segment has approximately identical duration in one period.
- 3GPP AHS the entire structure of the segment is defined and it is allowed to use the segment as a fine download unit or to download a portion of the segment using a byte range.
- a 3GPP file format may include a Segment Index (sidx) box which provides a track play start time for the tracks in the fragment in order to provide timing information on lost tracks and the play times may be recovered from decoding times.
- sidx Segment Index
- a “3GPP AHS” client may fetch a segment using an approximately created timeline and reconstruct a timing of each track using the sidx box. Once the entire segment or a portion of the segment is downloaded, the timing information may be included in the file in order to be used for playing afterward.
- the segment may be defined as one fragment, and the one fragment may include one track.
- the fragment is a basic unit for downloading and the term ‘segment’ is not used.
- the “MS SS” may provide a FragmentTime attribute for each fragment within a ManifestResponse file and the information may be provided in a StreamFragmentElement field.
- a timing of the track fragment can be sufficiently represented by a single FragmentTime.
- the timing information may be requested for each track in order to enable a start of a synchronized presentation after a random access and a switch between replaceable presentations.
- Storing of the “MS SS” media file is performed in a file in which the timing information is stored, which is performed by storing the media file in the MS PIFF or including the timing information in a mfra box.
- FIG. 2 illustrates an MPEG2 PES packet structure
- FIG. 3 illustrates a MPEG2 TS structure.
- an MPEG2 video stream compressed by an MPEG encoder may be packetized into a PES packet having a predetermined size.
- the PES packet includes a PES header and a PES packet data field and the PES header may include an optional PES HEADER.
- the optional PES HEADER includes an optional field 510 and the optional field 510 includes a PTS field and a DTS field.
- the PTS field may include Presentation Time Stamp (PTS) information and the DTS field may have Decoding Time Stamp (DTS) information.
- PTS Presentation Time Stamp
- DTS Decoding Time Stamp
- Synchronization between a video stream and an audio stream may be achieved with Time Stamp, which is time information representing when to present each decoding unit called an access unit of the video and audio streams.
- a PTS in the audio stream and a PTS and a DTS in video stream may be provided as the time information.
- the video and audio streams may be synchronized with each other by comparing the foregoing time stamp and a System Time Clock (STC) which is a time reference of a decoder and determining whether to present them.
- STC System Time Clock
- a transport packet of MPEG2 TS includes 4 byte header and a following 184 byte payload and has total 188 byte size regardless of an intended transport mechanism.
- the payload may include a PES packet, a Program Association Table (PAT), a Program Map Table (PMT) or the like, and the header may include a plurality of fields such as a sync byte and an adaptation field.
- PES Program Association Table
- PMT Program Map Table
- the adaptation field may include an optional field and the optional field may include a Program Clock Reference (PCR) field to which PCR information of time reference information is written.
- PCR Program Clock Reference
- the PCR is inserted to a header of MPEG2 TS for synchronization between an encoder and a decoder as a reference clock for PTS and STS.
- the PTS is used for synchronized-presentation of element streams included in one program, but does not have any relationship of synchronization with an external clock such as the Universal Time Coordinated (UTC).
- UTC Universal Time Coordinated
- the foregoing discontinuity of the PTS may be delivered by the PCR and a discontinuity indicator as shown in FIG. 3 immediately before the corresponding event occurs.
- the PTS may be used for synchronized presentation of one program, but is not sufficient to be used in presentation in synchronization with a general presentation timeline, for example, a media presentation timeline used in a HTTP streaming service.
- initial synchronization, synchronization after a random access and synchronization after the PTS discontinuity are needed to be considered.
- FIG. 4 illustrates a graph of relationship between PTS of MPEG2 TS and Media Presentation Time (MPT).
- a solid line indicates a media presentation timeline with which the presentations in all the media are synchronized
- a dashed line indicates a PTS timeline representing a local presentation time within a program.
- the MPT indicates times within the corresponding file as defined in the ISO Base Media file format and starts from ‘0’.
- the PTS may be represented as a time-scaled PTS timeline that a value measured in 90 KHz clock is represented as a 33 bit size.
- the media presentation timeline may start from presentation start offset provided in a manifest file such as Media Presentation Description (MPD).
- MPD Media Presentation Description
- the PTS timeline may start from any point and have discontinuity d in some cases.
- MPEG TS allows direct fragmentation within a range where the fragment has a 188 byte size.
- MPEG2 TS-based “Apple HTTP Streaming” or “OIPF HAS” uses the same scheme as the one that the MPEG2 TS is segmented into segments in a predetermined period.
- the segment period is not generally precise for each media component.
- the media component of a program included in the MPEG2 TS divides the MPEG2 TS into segments in a rough period using a PCR or a PTS as a reference, so the segment periods may vary among tracks.
- “Apple HTTP Streaming” and “OIPF HAS” do not designate or refer to additional ways representing timing information for MPEG2 TS, downloads MPEG TS using a rough timeline and sets a start time of the segment to the rough timeline.
- MS SS is not related to delivery of MEG2 TS media files, it may be difficult to directly apply “MS SS” to the transporting of the MPEG2 TS.
- MS SS allows only one track for each fragment, so “MS SS” is not suitable to a transport stream having a plurality of multiplexed programs.
- the corresponding file After downloading a portion of or the entire file, the corresponding file is required to be stored as an MPEG TS file and the timing information is require to be valid for future playback. Accordingly perfect synchronization with the media presentation timeline in the MPEG2 TS file transport using “MS SS” may be hard to achieve.
- FIG. 5 illustrates an embodiment of a method of presenting any presentation unit (PU) included in MPEG2 TS.
- MPEG2 TS packets respectively have a fixed 188 byte size and may be transported in a segment unit including one or more packets, for example, 3 MPEG2 TS packets.
- a client side for example, a receiving apparatus may try to receive any segment, synchronize any presentation unit PU y with the media presentation line and present PU y .
- a PTS discontinuity may occur in a previous segment of a segment including the presentation unit PU y to be presented and thus the corresponding segment may not be received to be valid due to the PTS discontinuity in a random access case.
- MPEG2 TS When the MPEG2 TS is not synchronized with the media presentation time, internally stored or streamed MPECG2 TS-based media file and other media, for example, ISO Bas Media file format media or another MPEG TS-based media are not synchronized with each other and the media file may not be played back.
- MPECG2 TS-based media file and other media for example, ISO Bas Media file format media or another MPEG TS-based media are not synchronized with each other and the media file may not be played back.
- MPEG2 TS media file may be synchronized with other media by signaling to map a timeline of MPEG2 TS-based media file to the media presentation time.
- a transmitting apparatus may map a PTS of a PU for at least one program element forming a program included in the media file to the media presentation time and synchronization information (abbreviated as sync information) including the mapped media presentation time may be transported with the mapped media presentation time.
- sync information synchronization information
- FIG. 6 illustrates a block diagram of a schematic configuration of a media file transmitting apparatus according to an embodiment.
- the illustrated transmitting apparatus may include a controller 110 and a network interface unit 120 .
- the controller 110 may generate sync information to be provided to a client side, for example, a receiving apparatus, by mapping a PTS of a PU for a program element forming a program included in an MPEG2 TS-based media file to a media presentation time.
- the network interface unit 120 may transmits sync information including the mapped media presentation time to the receiving apparatus with the MPEG2 TS-based media file.
- the sync information transmitted to the client side may be provided to a first PU of each segment included in any one program element.
- the sync information may be used to recover a presentation time for all the PUs from the first PU of a specific segment to a next PTS discontinuity occurring point included in MPEG2 TS.
- the sync information may be provided for a first PU after the PTS discontinuity occur in the segment, and the sync information may be used to recover the presentation time for all the PUs from the PTS discontinuity occurring point to a next PTS discontinuity occurring point of a next PTS discontinuity included in an MPEG2 TS.
- the sync information occurring point may be used to recover a presentation time for all the PUs from the PTS discontinuity occurring point to an end of the corresponding segment included in the MPEG2 TS.
- the MPEG2 TS-based playback may include a plurality of program elements belonging to one program and the sync information for any one of the program elements may be transported to the client side.
- the rest of the program elements may be synchronized with one program using a synchronization mechanism provided by the MPEG2 TS.
- FIG. 7 is a flowchart illustrating a method of transmitting a media file according to an embodiment. The illustrated method of transmitting a media file will be described in connection with the apparatus for transmitting a media file according to the embodiment illustrated in FIG. 6 .
- the controller 100 of the transmitting apparatus 100 selects one of program elements included in a media file to be transmitted (operation S 200 ).
- the controller 100 may select video tracks among video tracks and audio tracks forming the program as a reference program element.
- controller 110 maps a PTS of a PU for the selected program element to a media presentation time (operation S 210 ).
- media presentation times for example, MPT 1 to MPT n for a first PU of a segment among PUs belonging to a specific program element may be transported to the client side as sync information.
- PTS 1 denotes a PTS of PU y
- PTS k denotes a PTS of a presentation unit PU N positioned before the PU y
- MPT n denotes a media presentation time for the PU x .
- timescale PTS denotes a clock of the PTS, for example 90 kHz
- timescale denotes a clock of the MPT.
- the controller 110 of the transmitting apparatus 100 recovers PTS s , namely, PTS 1 and PTS k for PU y and PU x respectively, and calculates a next media presentation time MPT 0 by linear interpolation like equation 1 using the recovered PTS 1 and PTS k .
- a media presentation time for example, MPT m for the first PU may transported to the client side as sync information.
- the media presentation time for the specific presentation unit may be calculated by the foregoing linear interpolation with reference to the equation 1.
- a method of mapping the PTS of the presentation unit to a media presentation time is described for an exemplary case where PU x and PU y are included in one program element with reference to FIG. 8 .
- a media presentation time for a specific presentation unit may be calculated using the foregoing method.
- the network interface unit 120 transports sync information including the calculated media presentation times to the client side with the media file (operation S 220 ).
- a specific presentation unit may be searched for on the basis of the PTS for a given media presentation time, which may be used in random access or stream switching.
- PTS 1 which is a PTS value for any media presentation time MPT 0 may be calculated as the following equation 2.
- PTS 1 ((MPT 0 ⁇ MPT n )/timescale)*timescale PTS +PTS k (2)
- the transmitting apparatus 100 transmits media presentation times (MP 1 , MP m and MPT n ) for the first PU or a first PU after the occurrence of the PTS discontinuity among the PUs included in the segment as shown in FIG. 8 , or obtains a media presentation time MPT 0 for another PU according to the foregoing mapping method and transports the obtained MPT to the client side as the sync information.
- media presentation times MP 1 , MP m and MPT n
- the sync information may be structuralized as the following way.
- any one of a plurality of the program elements included in a program of an MPEG2 TS media file is selected as a reference program element and a media presentation time for a first PU of the selected reference program is provided to the client side.
- a duration from the first PU of the selected reference program to a time point of a next PTS discontinuity occurrence (or to an end of the corresponding segment when the PTS discontinuity does not occur) is provided to the client.
- the duration from any one PTS discontinuity to a next PTS discontinuity (or to an end of the corresponding segment when the PTS discontinuity does not occur) is provided to the client side.
- a PTS of a PU of the reference program element immediately after the PTS discontinuity occurrence may be also provided to the client side.
- a media presentation time for all the PUs included in the segment may be recovered by using the sync information created as the forgoing.
- a duration before and after the PTS discontinuity may be more divided into smaller durations and this additional signaling may be used by the client side for a random access or the like.
- syntax of the sync information may enable signaling of presentation times for a plurality of program elements.
- an MPEG2 TS program when an MPEG2 TS program includes two audio tracks L 1 and L 2 of different languages, the client may want to playback only L 1 track between the audio tracks.
- signaling for the media presentation time is provided only to L 2 , the client needs to extract PTS information from L 1 to recover presentation times.
- signaling for media presentation time is provided to both L 1 and L 2 , the client does not need to extract separate PTS information to recover the presentation times.
- the media file is roughly segmented according to a reference track period as required from a transport protocol and boundaries of the segments may be adjusted to MPEG2 TS packets.
- the foregoing sync information may be represented as a binary coded format file or an XML file.
- FIGS. 9 and 10 illustrate embodiments of a configuration of a segment index box including sync information, which is an example representing the sync information in a binary format.
- the sync information as described in relation to FIGS. 6 to 8 may be represented in a binary format on the basis of general ISO Base Media file format data types and has a similar structure to a sidx box used in 3GPP AHS.
- the sync information may be formed of an MPEG2 segment index box m 2 si , and the m 2 si box may be included one or more in a specific segment.
- the m 2 si box may provide media presentation time for PUs included in MPEG2 TS with respect to one or more program elements of MPEG2 TS.
- the segment may be subdivided into a plurality of MPEG2 TS packet groups (MTPGs) and the MTPGs may be formed of a set of adjacent MPEG2 TS packets.
- MTPGs MPEG2 TS packet groups
- At least a first sample namely, a presentation time for a PU of the reference program element may be provided in a first loop.
- a second loop may be performed on each MTPG included in a segment starting from the first MTPG.
- a presentation duration of a reference program element of each MTPG may be provided with byte offsets from a first byte of an MPEG2 TS to a corresponding MTPG in the segment.
- the presentation duration of the MTPG may be a sum of presentation durations of all the PUs of the reference program elements belong to the MTPG.
- a random access point RAP
- a presentation time offset from a presentation time for a first sample in the corresponding segment may be more provided.
- the m 2 si box may be positioned at a start portion of the segment and does not allow a hierarchical structure, so that the m 2 si box may be easily removed.
- flags may form a 24-bit integer flag and defined as follows.
- timescale-present represents whether time scale of a media presentation time exists in the corresponding box and a flag value thereof may be ‘0x000001’.
- pts-present represents whether a PTS of a first PU of a reference track is included for all the written MTPGs and a flag value thereof may be ‘0x000002’.
- timescale is a time scale of media presentation timeline.
- a value thereof may be identical for all the presentations.
- reference-PID provides packet identifiers (PIDs) of MPEG2 TS packets transporting a reference program element.
- program_element_count represents the number of program elements indexed in a next loop and may be designated 1 or a greater value.
- referenc_count represents the number of program elements indexed in a second loop and may have 1 or a greater value.
- PID represents PIDs of MPEG2 TS packets transporting program elements therein to which representation_time is provided and a PID in the corresponding loop is identical to the reference_PID.
- presentation_time may represent a presentation time for a first sample in a program element identified by the PID expressed in the timescale.
- Reference_offset may represent a byte distance from a first byte of an MPEG2 TS segment to a first byte of an MPTG referred by repeating of the corresponding loop.
- MTPG_duration provides a presentation period of a reference program element for the referred MPTG and may be represented in a time scale of a corresponding track.
- PTS may represent an MPEG2 TS PTS of a first PU of a reference program element of the corresponding MPTG.
- contains_RAP may be set to ‘1’ in order to represent that the reference program element in the referred MPTG includes a RAP.
- discontinuity may be set to ‘1’ in order to represent that the corresponding MPTG starts from discontinuity.
- RAP_delta_time may provide a presentation time offset of a RAP in order to correspond to a presentation time for a first sample of the reference program element in the segment, when contains_RAP is set to ‘1’.
- the RAP_delta_time may be maintained as ‘0’.
- FIGS. 11 and 12 illustrate embodiments of a configuration of an XML file including sync information.
- an XML file may include a media presentation time for PUs of one or more program elements of an MPEG2 TS and the segment may be subdivided into adjacent MTPGs.
- a presentation time for at least a first sample, namely, a presentation unit of the reference program element is provided in a first element.
- a second element is for each MTPG included in a segment starting from a first MTPG.
- the second element may provide presentation periods of a reference program element for each MTPG of the segment.
- the presentation period of the MTPG may be a sum of presentation periods of all the PUs of a reference program element belonging to the corresponding group.
- the second element may be provided information on whether the reference program element of the MTPG includes a RAP.
- a presentation time offset may be further provided to a presentation time for a first sample in the corresponding segment.
- FIG. 11 illustrates a schema of sync information having a configuration as shown in FIG. 10 and the detailed description for the illustrated schema is the same as described above in relation to FIGS. 9 and 10 . Therefore description thereof will be omitted.
- FIG. 13 illustrates an embodiment of a file structure of a media file transmitted from a transmitting apparatus according to an embodiment.
- an m 2 si box described in relation to FIGS. 9 and 10 may be attached before an MPEG2 TS divided into a segment unit.
- the MPEG2 TS data may be selectively encapsulated in a mdat box to make an access and demarcation easy.
- the m 2 si box may be attached before the mdat box.
- the media file format may start from a segment type (styp) box, the styp box may have a boxtype set as ‘styp’, a brand of ‘etsf’ and a minor_version set to ‘1’. Except this, the styp box may have the same syntax structure as a file type box (ftyp) defined in the ISO/IEC 14496-12 ISO Base Media file format.
- a receiving apparatus 300 may receive a media file with sync information, extract a media presentation time of a PU from the received sync information and then playback the media file using the extracted media presentation time.
- a configuration of the sync information received by the receiving apparatus 300 may be the same as described above in relation to FIGS. 6 to 13 . Therefore detailed description thereof will be omitted.
- FIG. 14 illustrates a block diagram of a configuration of a media file receiving apparatus according to an embodiment.
- the illustrated receiving apparatus 300 may include a network interface unit 310 , a decoding unit 320 and a synchronization unit 330 .
- the network interface unit 310 of the receiving apparatus 300 receives a media file with the sync information and the decoding unit 320 may decode the received media file and output the decoded media file.
- the synchronization unit 330 may synchronize the decoded media file with a media presentation timeline using a media presentation time included in the sync information.
- the network interface unit 310 may receive sync information having the configuration as described above in relation to FIGS. 6 to 13 with an MPEG2 TS from the transmitting unit 100 .
- the decoding unit 320 may include a second decoder 322 for decoding the MPEG2 TS and the second decoder 322 may be a general MPEG2 TS decoder.
- the second decoder 322 may receive to decode the MPEG2 TS from the network interface unit 310 and output the decoded PUs and PTSs thereof.
- the synchronization unit 330 may identify a presentation unit to which a media presentation time is provided through the received sync information. For this, the network interface unit 310 recovers the PTSs of the PUs of which presentation times are provided and transfers the recovered PTSs to the synchronization unit 330 or receives the presentation time and PTSs of the corresponding unit from the transmitting apparatus 100 .
- elementary stream time recovery may be used.
- the decoding unit 320 may further include a first decoder 321 to decode another media file received through the network interface unit 310 , the first decoder 321 may be a decoder for recovering another format media file such as in a 3GPP file format or an MPEG2 TS decoder which is the same as the second decoder 322 .
- the synchronization unit 330 may synchronize an MPEG2 TS-based media file output from the second decoder 322 with another media file output from the first decoder 321 using decoding information transmitted from the transmitting apparatus 100 , for example, media presentation times for the PUs.
- the receiving apparatus 300 may playback the MPEG2 TS-based media file synchronized by the synchronization unit 330 and another media file together.
- the decoded MPEG2 TS may be stored in a memory (not shown) prepared in the receiving apparatus 300 .
- the sync information included in the received m 2 si box or xml file may be also stored in the receiving apparatus 300 .
- the stored sync information may be used later for synchronized presentation with other media components.
- the byte offsets for the PTS discontinuities may be stored as attributes in the m 2 si box or the xml file including the sync information.
- the byte offsets provided in the synchronization box or file should correspond to a start of the MPEG2 TS file.
- FIG. 16 illustrates an embodiment of a segment list created by using an MPD.
- the client performing HTTP streaming may access to the MPD and create the segment list as shown in FIG. 16 from the MPD at a specific client-local time “NOW”.
- NOW may represent a current value of a clock in the reference client when an MPD instance is formed from the MPD.
- HTTP streaming clients should synchronize their own clocks with a precise universal time.
- FIG. 17 illustrates an embodiment of a method of synchronizing an MPEG2 TS-based media file with a 3GPP media file and playing back.
- a specific PU PU y included in an MPEG2 TS media file may be synchronized with a sample corresponding to the 3GPP media file and played back using the sync information as described above in relation to FIGS. 6 to 12 .
- the m 2 si box including MPD information, 3GPP file format information for segment and sync information may be given as follows.
- SegmentInfo duration PT2S (Segment boundary shown is for segment # 32)
- the UTC used for rendering PU y which is a PU played back according to the mapping method as described above in relation to FIGS. 5 to 8 may be calculated as the following equation 4.
- PU y (((PTS 1 ⁇ PTS k )/time_scale_ext)+presentation_time/time_scale)+availabilityStartTime (4)
- the UTC “2010-05-30T 09:31:02.07Z” used for rendering PU y may be obtained by substituting PTS 1 with PTS “43249500” of PU y , PTS k with PTS “43245300” of a PU to which media presentation time “29977120” is provided, time_scale_ext with clock “90000” of PTS, time_scale with “48000” given in 3GPP file format information, and availabilityStartTime with “2010-05-30T 09:30:10Z” given in the MDP information.
- FIGS. 18 and 19 illustrate embodiments of a method of implementing a media file using a MPD of MPEG Dynamic Adaptive Streaming over HTTP (DASH).
- DASH Dynamic Adaptive Streaming over HTTP
- an index segment which is sync information as described above in relation to FIGS. 6 to 17 may exist separately in MPEG-DASH.
- the media file may be implemented with syntax having the same media file structure as the described above in relation to FIG. 13 and the index data may be stored after the MPEG2 TS media file. Therefore the number of stored files may be greatly reduced.
- the client may confirm how many bytes correspond to the index data and how many tailings correspond to the media data in the media file.
- the client may recognize that a Universal Resource Locator (URL) for a media segment coincides with a URL of the index segment corresponding to the media segment. Accordingly the client may divide the segments in combination with a request.
- URL Universal Resource Locator
- a general client may not use this optimization and may obtain the same effect through two ttp requests.
- Another client which is not able to use the index data may neglect the same and just fetch the media data.
- a method of receiving and processing the MPEG2 TS-based media file as described in relation to FIGS. 1 to 19 may be performed by an IPTV receiver according to an embodiment.
- the IPTV receiver may receive a media file transmitted from a transmitting side, for example, a media server, and a format of the media file may have a configuration as described in relation to FIGS. 6 to 19 .
- the IPTV receiver may synchronize an MPEG2 TS with other media and play back using sync information received from the transmitting side.
- the IPTV receiver may perform a method of receiving a media file as described in relation to FIGS. 14 and 15 .
- FIG. 20 illustrates a block diagram of an IPTV receiver configuration according to an embodiment.
- An IPTV receiver may include a separate tuner for receiving terrestrial broadcasting, cable broadcasting or satellite broadcasting.
- the IPTV receiver having a configuration of receiving an IPTV service provided over an IP network will be mainly described for convenience.
- An ITF indicates Open IPTV Terminal Function and may suggest a receiver formed of function modules necessary for supporting the IPTV service.
- the IPTV receiver may include a network interface unit 502 , a TCP/IP manager 502 , a service delivery manager 504 , a PVR manager 505 , a demultiplexer (Demux) 508 , a data decoder 507 , an audio/video decoder 512 , an display and OSD unit 515 , an application manager 513 and 514 , a service information (SI) and Metadata database (DB) 511 , a service discovery manager 509 , a service control manager 503 , a metadata manager 510 and a content DB 506 .
- SI service information
- DB Metadata database
- the network interface unit 501 receives and transmits packets from and to a network. Namely, the network interface unit 501 receives a service or media contents from a service provider over the network.
- the TCP/IP manager 502 manages packet delivery from a source to a destination with respect to packets received by the IPTV receiver and packets transmitted from the IPTV receiver. In addition, the TCP/IP manager 502 classifies the received packets so as to correspond to a proper protocol, and outputs the classified packets to the service discovery manager 509 , the service control manager 503 and the metadata manager 510 .
- the service delivery manager 504 controls the received service data.
- the service delivery manager 504 may use a RTP/RTCP.
- the service delivery manager 504 parses the received data packet according to the RTP and transports the parsed data packet to the Demux 508 or stores the same to the content DB 506 according to a control of the service manager 514 .
- the service delivery manager 504 feeds the network reception information back to a server side providing the service using the RTCP.
- the Demux 508 demultiplexes the received packets into audio, video and Program Specific Information data and transports the demultiplexed data respectively to the audio/video decoder 512 and the data decoder 507 .
- the data decoder 507 decodes, for example, service information such as PSI. Namely, the data decoder 507 receives a PSI section, a program and Service Information Protocol (PSIP) section or a DVB-SI section demultiplexed by the Demux 508 and decodes the received sections.
- PSIP program and Service Information Protocol
- DVB-SI DVB-SI section demultiplexed by the Demux 508 and decodes the received sections.
- the data decoder 507 decodes the received sections to make a database for service information and the database for service information is stored in the SI & Metadata DB 511 .
- the audio/video decoder 512 decodes video data and audio data received from the Demux 508 .
- the video data and audio data decoded in the audio/video decoder 512 are provided to a user through the display and OSD unit 515 .
- the application manager 513 and 514 manages the entire state of the IPTV receiver, provides a user interface and manages other managers.
- the application manager includes a user interface manager 513 and a service manager 514 .
- the user interface manager 513 provides a Graphic User Interface (GUI) for a user using an On Screen Display (OSD), and performs receiver operations according to a key input from the user. For example, when receiving a key input for a channel selection from the user, the user interface manager 513 transports the key input signal to the service manager 514 .
- GUI Graphic User Interface
- OSD On Screen Display
- the service manager 514 controls managers related to a service such as the service delivery manager 504 , the service discovery manager 509 , the service control manager 503 and the metadata manager 510 .
- the service manager 514 makes a channel map and selects a channel using the channel map according to the key input received from the user interface manager 513 .
- the service manager 514 receives channel service information for the data decoder 507 and sets audio and video PIDs of the selected channel to the Demux 508 .
- the service discovery manager 509 provides information necessary to select a service provider which provides services. When receiving a signal for the channel selection from the service manager 514 , the service discovery manager 509 searches for a service using the information.
- the service control manager 503 selects and controls the service.
- the service control manager 503 uses the Internet Group Management Protocol (IGMP) or RTSP when the user selects a live broadcasting service which has the same scheme as the prior art, or selects and controls the service using the RTSP when the user selects a service of Video on Demand (VOD).
- IGMP Internet Group Management Protocol
- RTSP Video on Demand
- the RTP may provide a trick mode for the real time streaming.
- the service control manager 503 may initialize and manage sessions using an IP Multimedia Subsystem (IMS) through an INC gateway.
- IMS IP Multimedia Subsystem
- the metadata manager 510 manages metadata associated with the services and stores the metadata in the SI and metadata DB 511 .
- the SI and metadata DB 511 and the content DB 506 may be respectively implemented using a nonvolatile RAM (NVRAM) or a flash memory and may also be implemented in logically separated two regions on the same storage area.
- NVRAM nonvolatile RAM
- the PVR manager 505 is a module for recording and presenting live streaming contents, and may collect metadata for the recorded content and create additional information to be provided to the user such as a thumbnail image or an index.
- a function of a controller of the IPTV receiver may be implemented as divided into a plurality of modules like the TCP/IP manager 502 , the service delivery manager 504 , a PVR manager 505 , the application manager 513 and 514 , the service discovery manager 509 , the service control manager 503 and the metadata manager 510 as shown in FIG. 20 .
- the TCP/IP manager 502 may filter SD&S information using the foregoing target package information and control the network interface unit 501 to request only a payload or a segment corresponding to a specific package, for example, a package to which the IPTV receiver is subscribed, to a server and receive the same.
- the TCP/IP manager 502 may filter SD&S information received in a multicast scheme using the target package information and cause the decoder 507 to parse and process only a payload or a segment corresponding to a specific package.
- FIG. 21 illustrates a functional block diagram of a configuration of the IPTV according to another embodiment.
- a solid line indicates a data path and a dashed line indicates a control signal path.
- a cable modem or a DSL modem, etc 601 which is an interface causing the ITF to connect to an IP network in a physical level, demodulates a signal transmitted through a physical medium and recovers a digital signal.
- An Ethernet NIC 602 recovers IP data from a signal received through the physical interface.
- An IP network stack 607 processes each layer according to the IP protocol stack.
- An XML parser 609 parses an XML document from among received IP data.
- a file handler 608 processes data transported in a file type through FLUTE or the like from among the received IP data.
- An SI handler 611 processes a portion corresponding to IPTV service information from among the receive file type data and stores the processed data in a storage 612 .
- An EPG handler 610 processes a portion corresponding to IPTV EPG information from among the receive file type data and stores the processed data in the storage 612 .
- the storage 612 stores various data such as the SI data and the EPG information data and the like.
- An SI decoder 613 reads the SI data from the storage 612 , and then, analyzes and obtains channel map information.
- An EPG decoder 614 analyzes EPG data stored in the storage 612 and recovers information necessary to form an EPG.
- An ITF operation controller 615 is a main controller controlling an ITF operation of a channel change or an EPG display.
- a channel service manager 616 may perform an operation such as a channel change according to a user input.
- An application manager 617 may perform an application service such as an EPG display according to a user input.
- An MPEG-2 demultiplexer 603 extracts MPEG2 TS data from a received IP datagram and transports the extracted data to a corresponding module according to the PID.
- An MPEG2 PSI/PSIP parser 604 extracts the PID of A/V data or PSI/PSIP data including connection information on program elements, and parses the extracted data.
- An A/V decoder 605 decodes input audio and video data and delivers the decoded data to a display module 606 .
- the display module 606 displays the decoded A/V data or application.
- the invention can also be embodied as computer readable codes on a computer readable recording medium.
- the computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage.
- the computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. Also, functional programs, codes, and code segments for accomplishing the present invention can be easily construed by programmers skilled in the art to which the present invention pertains.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Computer Security & Cryptography (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
A method and apparatus for transmitting and receiving an MPEG-2 TS-based media file are provided. In the method of transmitting an MPEG2 Transport Stream (TS)-based media file, a Presentation Time Stamp (PTS) of a Presentation Unit (PU) is mapped to a Media Presentation Time (MPT) for synchronization with other media with respect to at least one program element forming a program included in the media file; and synchronization information including the mapped media presentation time is transmitted together with the media file, wherein the media presentation time is provided for a first PU in a segment of the transmitted media file.
Description
This application is a continuation of U.S. patent application Ser. No. 14/811,321 filed Jul. 28, 2015, which is a continuation of U.S. patent application Ser. No. 13/811,198 filed Jan. 24, 2013, now U.S. Pat. No. 9,131,274, which is a National Stage Entry of International Application No. PCT/KR2011/005307 filed on Jul. 19, 2011, and claims priority to U.S. Provisional Application Nos. 61/365,746 filed Jul. 19, 2010 and 61/453,948 filed Mar. 17, 2011, all of which are hereby incorporated by reference for all purposes as if fully set forth herein in their entireties.
The present disclosure relates to a method and apparatus for transmitting and receiving an MPEG-2 Transport Stream (TS)-based media file.
Nowadays, digital broadcasting such as terrestrial, cable or satellite broadcasting, or Digital Multimedia Broadcasting (DMB) mostly streams AV (Audio/Video) contents using MPEG-2 TS.
In addition, a multimedia service, which provides contents over an Internet Protocol (IP) network as a main transport network, is activated according to a rapid growth of the internet. The digital broadcasting is developing towards the direction of requiring more traffic like stereoscopic 3-dimensional (3D) video broadcasting, Ultra High Definition (UHD) broadcasting, 3D multi-view video broadcasting or hologram broadcasting.
However it may be less efficient to transport content of higher resolution than the prior art HDTV using MPEG-2 TS having a sequence of fixed length packets of 188 bytes over the IP network.
Embodiments provide a method and apparatus for transmitting and receiving an MPEG-2 TS-based media file for playing the same back in synchronization with another media.
In one embodiment, a method of transmitting an MPEG2 TS-based media file, the method includes: mapping a Presentation Time Stamp (PTS) of a Presentation Unit (PU) to a Media Presentation Time (MPT) for synchronization with other media with respect to at least one program element forming a program included in the media file; and transmitting synchronization information including the mapped media presentation time together with the media file, wherein the media presentation time is provided for a first PU in a segment of the transmitted media file.
In another embodiment, a method of receiving an MPEG2 TS-based media file, the method includes: receiving the media file with synchronization information; extracting an MPT of a PU for at least one program element forming a program included in the media file; and playing back the media file using the extracted media presentation time, wherein the MPT is mapped from a PTS of a first PU in a segment of the received media file.
In further another embodiment, an apparatus for transmitting an MPEG2 TS-based media file, the apparatus includes: a controller mapping a PTS of a first PU among PUs included in each segment or a PTS of a first PU after a PTS discontinuity occurs to an MPT for synchronization with other media for a program element forming a program included in the media file; and a network interface unit transmitting synchronization information including the mapped MPT together with the media file.
In still further another embodiment, an apparatus for receiving an MPEG2 TS-based media file, the apparatus includes: a network interface unit receiving the media file with synchronization information; a decoding unit decoding the received media file; and a synchronization unit synchronizing the decoded media file with other media using an MPT included in the synchronization information, wherein the MPT is mapped from a PTS of a first PU in a segment of the received media file or a PTS of a first PU after a PTS discontinuity occurs.
According to embodiments, since MPEG-2 TS-based media file is transported with synchronization information including media presentation time, it is possible to randomly access to the media file and to easily synchronize the media file to other media installed inside a receiving apparatus or attached externally and play the media file.
In addition, since media presentation time for a time point when discontinuity occurs at a Presentation Time Stamp (PTS) is provided to a receiving apparatus side, the media file can be played back in synchronization with the media presentation time even in a case where the PTS discontinuity occurs.
The details of one or more embodiments are set forth in the accompanying drawings and the description below. Other features will be apparent from the description and drawings, and from the claims.
Reference will now be made in detail to the embodiments of the present disclosure, examples of which are illustrated in the accompanying drawings.
A method of and apparatus for transmitting and receiving MPEG-2 Transport Stream (TS)-based media file according to an embodiment will be described in detail with reference to the accompanying drawings. The invention may, however, be embodied in many different forms and should not be construed as being limited to the embodiments set forth herein; rather, that alternate embodiments included in other retrogressive inventions or falling within the spirit and scope of the present disclosure can easily be derived through adding, altering, and changing, and will fully convey the concept of the invention to those skilled in the art.
Referring to FIG. 1 , the fragmented file may be formed by dividing media tracks into plural fragments, each of which includes a movie fragment (moof) box and a media data (mdat) box.
The mdat box after the moof box includes fragment media data and this combined moof-mdat pair may form a single fragment.
In the fragmented ISO Base Media file format, general timing fields may be provided a Sample Table (ST) box such as a decoding time-to-sample (stts) box or a composition time-to-sample (ctts) box. However since the above described timing fields exist in moov (Movie) box, they are rarely used in transporting timing information.
Meanwhile since the moov box is finely encoded, it is assumed all the previous samples are valid to a client side, for example a receiving apparatus, in order to access to a desired sample. The above described assumption is not proper to a streaming application.
A Movie Fragment Random Access (mfra) file positioned at the end of a media file may be used to provide a reference to fragments including random access points.
However the mfra box uses an absolute byte offset from the beginning of a file referring to the fragments and is not fragmented. Therefore it can provide timing information of a fragmented media file in a stored state like a recording process but is not possibly used in streaming application.
Accordingly, the ISO Base Media file format as shown in FIG. 1 does not provide timing information for streaming the fragmented tracks.
On the other hand, a segment is a unit that a media file is transmitted and received and a configuration thereof may be variously defined.
For example, in “3GPP Adaptive HTTP Streaming (AHS)”, “OIPF HTTP Adaptive Streaming (HAS)”, and “HTTP Streaming of MPEG media”, the segment includes one or more movie fragments and each segment has approximately identical duration in one period. In “3GPP AHS”, the entire structure of the segment is defined and it is allowed to use the segment as a fine download unit or to download a portion of the segment using a byte range.
A 3GPP file format may include a Segment Index (sidx) box which provides a track play start time for the tracks in the fragment in order to provide timing information on lost tracks and the play times may be recovered from decoding times.
A “3GPP AHS” client may fetch a segment using an approximately created timeline and reconstruct a timing of each track using the sidx box. Once the entire segment or a portion of the segment is downloaded, the timing information may be included in the file in order to be used for playing afterward.
In “Microsoft Smooth Streaming (MS SS)”, the segment may be defined as one fragment, and the one fragment may include one track. The fragment is a basic unit for downloading and the term ‘segment’ is not used.
The “MS SS” may provide a FragmentTime attribute for each fragment within a ManifestResponse file and the information may be provided in a StreamFragmentElement field.
In the foregoing “MS SS”, since only one track exists in the track fragment, a timing of the track fragment can be sufficiently represented by a single FragmentTime. The timing information may be requested for each track in order to enable a start of a synchronized presentation after a random access and a switch between replaceable presentations.
Storing of the “MS SS” media file is performed in a file in which the timing information is stored, which is performed by storing the media file in the MS PIFF or including the timing information in a mfra box.
Referring to FIG. 2 , an MPEG2 video stream compressed by an MPEG encoder may be packetized into a PES packet having a predetermined size.
The PES packet includes a PES header and a PES packet data field and the PES header may include an optional PES HEADER. In addition, the optional PES HEADER includes an optional field 510 and the optional field 510 includes a PTS field and a DTS field.
The PTS field may include Presentation Time Stamp (PTS) information and the DTS field may have Decoding Time Stamp (DTS) information.
Synchronization between a video stream and an audio stream may be achieved with Time Stamp, which is time information representing when to present each decoding unit called an access unit of the video and audio streams.
For example, a PTS in the audio stream and a PTS and a DTS in video stream may be provided as the time information. The video and audio streams may be synchronized with each other by comparing the foregoing time stamp and a System Time Clock (STC) which is a time reference of a decoder and determining whether to present them.
Referring to FIG. 3 , a transport packet of MPEG2 TS includes 4 byte header and a following 184 byte payload and has total 188 byte size regardless of an intended transport mechanism.
The payload may include a PES packet, a Program Association Table (PAT), a Program Map Table (PMT) or the like, and the header may include a plurality of fields such as a sync byte and an adaptation field.
The adaptation field may include an optional field and the optional field may include a Program Clock Reference (PCR) field to which PCR information of time reference information is written.
The PCR is inserted to a header of MPEG2 TS for synchronization between an encoder and a decoder as a reference clock for PTS and STS.
The PTS is used for synchronized-presentation of element streams included in one program, but does not have any relationship of synchronization with an external clock such as the Universal Time Coordinated (UTC).
In addition, continuity of the PTS is not secured, so discontinuity or reset may occur in the PTS.
For example, in an application field of MPEG2 TS such as digital video broadcasting, the foregoing discontinuity of the PTS may be delivered by the PCR and a discontinuity indicator as shown in FIG. 3 immediately before the corresponding event occurs.
The PTS may be used for synchronized presentation of one program, but is not sufficient to be used in presentation in synchronization with a general presentation timeline, for example, a media presentation timeline used in a HTTP streaming service.
In order to use the PTS in synchronization with the general presentation timeline, initial synchronization, synchronization after a random access and synchronization after the PTS discontinuity are needed to be considered.
In FIG. 4 , a solid line indicates a media presentation timeline with which the presentations in all the media are synchronized, and a dashed line indicates a PTS timeline representing a local presentation time within a program.
The MPT indicates times within the corresponding file as defined in the ISO Base Media file format and starts from ‘0’.
The PTS may be represented as a time-scaled PTS timeline that a value measured in 90 KHz clock is represented as a 33 bit size.
Referring to FIG. 4 , the media presentation timeline may start from presentation start offset provided in a manifest file such as Media Presentation Description (MPD).
The PTS timeline may start from any point and have discontinuity d in some cases.
Other than ISO Base Media, MPEG TS allows direct fragmentation within a range where the fragment has a 188 byte size. MPEG2 TS-based “Apple HTTP Streaming” or “OIPF HAS” uses the same scheme as the one that the MPEG2 TS is segmented into segments in a predetermined period.
In this case, the segment period is not generally precise for each media component. The media component of a program included in the MPEG2 TS divides the MPEG2 TS into segments in a rough period using a PCR or a PTS as a reference, so the segment periods may vary among tracks.
“Apple HTTP Streaming” and “OIPF HAS” do not designate or refer to additional ways representing timing information for MPEG2 TS, downloads MPEG TS using a rough timeline and sets a start time of the segment to the rough timeline.
Since “MS SS” is not related to delivery of MEG2 TS media files, it may be difficult to directly apply “MS SS” to the transporting of the MPEG2 TS. In addition, “MS SS” allows only one track for each fragment, so “MS SS” is not suitable to a transport stream having a plurality of multiplexed programs.
After downloading a portion of or the entire file, the corresponding file is required to be stored as an MPEG TS file and the timing information is require to be valid for future playback. Accordingly perfect synchronization with the media presentation timeline in the MPEG2 TS file transport using “MS SS” may be hard to achieve.
Referring to FIG. 5 , MPEG2 TS packets respectively have a fixed 188 byte size and may be transported in a segment unit including one or more packets, for example, 3 MPEG2 TS packets.
Meanwhile, a client side, for example, a receiving apparatus may try to receive any segment, synchronize any presentation unit PUy with the media presentation line and present PUy.
At this time, a PTS discontinuity may occur in a previous segment of a segment including the presentation unit PUy to be presented and thus the corresponding segment may not be received to be valid due to the PTS discontinuity in a random access case.
That is, in a MPEG2 TS-based media file, precise media presentation time information for a PU is not provided for a random access or a continuous access in a segment. This may cause more limitation in the PTS discontinuity case.
When the MPEG2 TS is not synchronized with the media presentation time, internally stored or streamed MPECG2 TS-based media file and other media, for example, ISO Bas Media file format media or another MPEG TS-based media are not synchronized with each other and the media file may not be played back.
According to an embodiment, MPEG2 TS media file may be synchronized with other media by signaling to map a timeline of MPEG2 TS-based media file to the media presentation time.
For example, a transmitting apparatus according to an embodiment may map a PTS of a PU for at least one program element forming a program included in the media file to the media presentation time and synchronization information (abbreviated as sync information) including the mapped media presentation time may be transported with the mapped media presentation time.
Referring to FIG. 6 , the controller 110 may generate sync information to be provided to a client side, for example, a receiving apparatus, by mapping a PTS of a PU for a program element forming a program included in an MPEG2 TS-based media file to a media presentation time.
The network interface unit 120 may transmits sync information including the mapped media presentation time to the receiving apparatus with the MPEG2 TS-based media file.
According to an embodiment, the sync information transmitted to the client side may be provided to a first PU of each segment included in any one program element.
In this case, the sync information may be used to recover a presentation time for all the PUs from the first PU of a specific segment to a next PTS discontinuity occurring point included in MPEG2 TS.
In addition, the sync information may be provided for a first PU after the PTS discontinuity occur in the segment, and the sync information may be used to recover the presentation time for all the PUs from the PTS discontinuity occurring point to a next PTS discontinuity occurring point of a next PTS discontinuity included in an MPEG2 TS.
Meanwhile, the PTS discontinuity does not occur any more, the sync information occurring point may be used to recover a presentation time for all the PUs from the PTS discontinuity occurring point to an end of the corresponding segment included in the MPEG2 TS.
The MPEG2 TS-based playback may include a plurality of program elements belonging to one program and the sync information for any one of the program elements may be transported to the client side.
That is, when any one program element is synchronized with the media presentation time, the rest of the program elements may be synchronized with one program using a synchronization mechanism provided by the MPEG2 TS.
Hereinafter a method of transmitting a media file according to embodiments will be described with reference to FIGS. 7 to 13 .
Referring to FIG. 7 , the controller 100 of the transmitting apparatus 100 selects one of program elements included in a media file to be transmitted (operation S200).
For example, the controller 100 may select video tracks among video tracks and audio tracks forming the program as a reference program element.
Then the controller 110 maps a PTS of a PU for the selected program element to a media presentation time (operation S210).
Referring to FIG. 8 , media presentation times, for example, MPT1 to MPTn for a first PU of a segment among PUs belonging to a specific program element may be transported to the client side as sync information.
Meanwhile, when a specific presentation unit PUy is to be presented, a media presentation time MPT0 for PUy may be calculated by mapping a PTS of the PU as the following equation 1
MPTc=((PTS1−PTSk)/timescalePTS)*timescale+MPTn (1)
MPTc=((PTS1−PTSk)/timescalePTS)*timescale+MPTn (1)
In equation 1, PTS1 denotes a PTS of PUy, PTSk denotes a PTS of a presentation unit PUN positioned before the PUy, and MPTn denotes a media presentation time for the PUx.
timescalePTS denotes a clock of the PTS, for example 90 kHz, and timescale denotes a clock of the MPT.
For example, the controller 110 of the transmitting apparatus 100 recovers PTSs, namely, PTS1 and PTSk for PUy and PUx respectively, and calculates a next media presentation time MPT0 by linear interpolation like equation 1 using the recovered PTS1 and PTSk.
As shown in FIG. 8 , after the occurrence of the PTS discontinuity in the segment, a media presentation time, for example, MPTm for the first PU may transported to the client side as sync information.
Accordingly, even in a case where the discontinuity occurs in a specific segment, the media presentation time for the specific presentation unit may be calculated by the foregoing linear interpolation with reference to the equation 1.
As the above, a method of mapping the PTS of the presentation unit to a media presentation time is described for an exemplary case where PUx and PUy are included in one program element with reference to FIG. 8 . However in a case where the PUx and PUy may be respectively included to different program elements, a media presentation time for a specific presentation unit may be calculated using the foregoing method.
The network interface unit 120 transports sync information including the calculated media presentation times to the client side with the media file (operation S220).
According to the foregoing mapping scheme, a specific presentation unit may be searched for on the basis of the PTS for a given media presentation time, which may be used in random access or stream switching.
For example, PTS1 which is a PTS value for any media presentation time MPT0 may be calculated as the following equation 2.
PTS1=((MPT0−MPTn)/timescale)*timescalePTS+PTSk (2)
PTS1=((MPT0−MPTn)/timescale)*timescalePTS+PTSk (2)
When the media presentation time is represented by a second unit, not by a clock, the equation 2 may be simplified as the following equation 3
PTS1=(MPTo−MPTn)*timescalePTS+PTSk (3)
PTS1=(MPTo−MPTn)*timescalePTS+PTSk (3)
According to an embodiment of the present invention, the transmitting apparatus 100 transmits media presentation times (MP1, MPm and MPTn) for the first PU or a first PU after the occurrence of the PTS discontinuity among the PUs included in the segment as shown in FIG. 8 , or obtains a media presentation time MPT0 for another PU according to the foregoing mapping method and transports the obtained MPT to the client side as the sync information.
Hereinafter a configuration of sync information transported to the client side will be described in detail with reference to FIGS. 9 to 12 .
The sync information may be structuralized as the following way.
At first, any one of a plurality of the program elements included in a program of an MPEG2 TS media file is selected as a reference program element and a media presentation time for a first PU of the selected reference program is provided to the client side.
In order to handle the PTS discontinuity included in a segment, a duration from the first PU of the selected reference program to a time point of a next PTS discontinuity occurrence (or to an end of the corresponding segment when the PTS discontinuity does not occur) is provided to the client.
When the PTS discontinuity occurs twice or more, the duration from any one PTS discontinuity to a next PTS discontinuity (or to an end of the corresponding segment when the PTS discontinuity does not occur) is provided to the client side.
In addition, a PTS of a PU of the reference program element immediately after the PTS discontinuity occurrence may be also provided to the client side.
A media presentation time for all the PUs included in the segment may be recovered by using the sync information created as the forgoing.
However, in addition to signaled information for synchronized presentation as the foregoing, a duration before and after the PTS discontinuity may be more divided into smaller durations and this additional signaling may be used by the client side for a random access or the like.
As described above, although a signaling of a presentation time for any one program element in a program may be necessary and sufficient to synchronize any program element with a media presentation timeline, syntax of the sync information may enable signaling of presentation times for a plurality of program elements.
This may be advantageous for the client to try to selectively present program elements.
For example, when an MPEG2 TS program includes two audio tracks L1 and L2 of different languages, the client may want to playback only L1 track between the audio tracks.
At this time, if signaling for the media presentation time is provided only to L2, the client needs to extract PTS information from L1 to recover presentation times. However signaling for media presentation time is provided to both L1 and L2, the client does not need to extract separate PTS information to recover the presentation times.
In order to segment a transmitted MPEG2 TS media file, the media file is roughly segmented according to a reference track period as required from a transport protocol and boundaries of the segments may be adjusted to MPEG2 TS packets.
According to an embodiment, the foregoing sync information may be represented as a binary coded format file or an XML file.
According to an embodiment, the sync information as described in relation to FIGS. 6 to 8 may be represented in a binary format on the basis of general ISO Base Media file format data types and has a similar structure to a sidx box used in 3GPP AHS.
Referring to FIG. 9 , the sync information may be formed of an MPEG2 segment index box m2 si, and the m2 si box may be included one or more in a specific segment.
The m2 si box may provide media presentation time for PUs included in MPEG2 TS with respect to one or more program elements of MPEG2 TS.
The segment may be subdivided into a plurality of MPEG2 TS packet groups (MTPGs) and the MTPGs may be formed of a set of adjacent MPEG2 TS packets.
When one program element, for example, a video program element is selected as a reference program element, at least a first sample, namely, a presentation time for a PU of the reference program element may be provided in a first loop.
A second loop may be performed on each MTPG included in a segment starting from the first MTPG. In the second loop, a presentation duration of a reference program element of each MTPG may be provided with byte offsets from a first byte of an MPEG2 TS to a corresponding MTPG in the segment.
The presentation duration of the MTPG may be a sum of presentation durations of all the PUs of the reference program elements belong to the MTPG.
In the second loop, information on whether a random access point (RAP) is included in a reference program element of an MTPG may be provided. When the RAP is included, a presentation time offset from a presentation time for a first sample in the corresponding segment may be more provided.
Other than sidx box of 3GPP AHS, the m2 si box may be positioned at a start portion of the segment and does not allow a hierarchical structure, so that the m2 si box may be easily removed.
Referring to a syntax structure as shown in FIG. 10 , flags may form a 24-bit integer flag and defined as follows.
timescale-present represents whether time scale of a media presentation time exists in the corresponding box and a flag value thereof may be ‘0x000001’.
pts-present represents whether a PTS of a first PU of a reference track is included for all the written MTPGs and a flag value thereof may be ‘0x000002’.
timescale is a time scale of media presentation timeline. When the timescale is provided one or more in one media presentation, a value thereof may be identical for all the presentations.
reference-PID provides packet identifiers (PIDs) of MPEG2 TS packets transporting a reference program element.
program_element_count represents the number of program elements indexed in a next loop and may be designated 1 or a greater value.
referenc_count represents the number of program elements indexed in a second loop and may have 1 or a greater value.
PID represents PIDs of MPEG2 TS packets transporting program elements therein to which representation_time is provided and a PID in the corresponding loop is identical to the reference_PID.
presentation_time may represent a presentation time for a first sample in a program element identified by the PID expressed in the timescale.
Reference_offset may represent a byte distance from a first byte of an MPEG2 TS segment to a first byte of an MPTG referred by repeating of the corresponding loop.
MTPG_duration provides a presentation period of a reference program element for the referred MPTG and may be represented in a time scale of a corresponding track.
PTS may represent an MPEG2 TS PTS of a first PU of a reference program element of the corresponding MPTG.
contains_RAP may be set to ‘1’ in order to represent that the reference program element in the referred MPTG includes a RAP.
discontinuity may be set to ‘1’ in order to represent that the corresponding MPTG starts from discontinuity.
RAP_delta_time may provide a presentation time offset of a RAP in order to correspond to a presentation time for a first sample of the reference program element in the segment, when contains_RAP is set to ‘1’. When contains_RAP is set to ‘0’, the RAP_delta_time may be maintained as ‘0’.
Referring to FIG. 11 , an XML file may include a media presentation time for PUs of one or more program elements of an MPEG2 TS and the segment may be subdivided into adjacent MTPGs.
When one program element, for example, a video program element is selected as a reference program element, a presentation time for at least a first sample, namely, a presentation unit of the reference program element is provided in a first element.
In addition, a second element is for each MTPG included in a segment starting from a first MTPG. The second element may provide presentation periods of a reference program element for each MTPG of the segment.
The presentation period of the MTPG may be a sum of presentation periods of all the PUs of a reference program element belonging to the corresponding group.
The second element may be provided information on whether the reference program element of the MTPG includes a RAP. When the RAP is included, a presentation time offset may be further provided to a presentation time for a first sample in the corresponding segment.
Referring to FIG. 13 , an m2 si box described in relation to FIGS. 9 and 10 may be attached before an MPEG2 TS divided into a segment unit. The MPEG2 TS data may be selectively encapsulated in a mdat box to make an access and demarcation easy.
As illustrated in FIG. 13 , when the sync information is appended in a binary format, the m2 si box may be attached before the mdat box.
The media file format may start from a segment type (styp) box, the styp box may have a boxtype set as ‘styp’, a brand of ‘etsf’ and a minor_version set to ‘1’. Except this, the styp box may have the same syntax structure as a file type box (ftyp) defined in the ISO/IEC 14496-12 ISO Base Media file format.
Hereinafter embodiments of a media file receiving method and apparatus will be described with reference to FIGS. 14 and 15 .
A receiving apparatus 300 according to an embodiment may receive a media file with sync information, extract a media presentation time of a PU from the received sync information and then playback the media file using the extracted media presentation time.
A configuration of the sync information received by the receiving apparatus 300 may be the same as described above in relation to FIGS. 6 to 13 . Therefore detailed description thereof will be omitted.
Referring to FIG. 14 , the network interface unit 310 of the receiving apparatus 300, for example, a HTTP interface, receives a media file with the sync information and the decoding unit 320 may decode the received media file and output the decoded media file.
The synchronization unit 330 may synchronize the decoded media file with a media presentation timeline using a media presentation time included in the sync information.
For example, the network interface unit 310 may receive sync information having the configuration as described above in relation to FIGS. 6 to 13 with an MPEG2 TS from the transmitting unit 100.
The decoding unit 320 may include a second decoder 322 for decoding the MPEG2 TS and the second decoder 322 may be a general MPEG2 TS decoder.
The second decoder 322 may receive to decode the MPEG2 TS from the network interface unit 310 and output the decoded PUs and PTSs thereof.
The synchronization unit 330 may identify a presentation unit to which a media presentation time is provided through the received sync information. For this, the network interface unit 310 recovers the PTSs of the PUs of which presentation times are provided and transfers the recovered PTSs to the synchronization unit 330 or receives the presentation time and PTSs of the corresponding unit from the transmitting apparatus 100.
For recovering the PTSs, elementary stream time recovery may be used.
The decoding unit 320 may further include a first decoder 321 to decode another media file received through the network interface unit 310, the first decoder 321 may be a decoder for recovering another format media file such as in a 3GPP file format or an MPEG2 TS decoder which is the same as the second decoder 322.
The synchronization unit 330 may synchronize an MPEG2 TS-based media file output from the second decoder 322 with another media file output from the first decoder 321 using decoding information transmitted from the transmitting apparatus 100, for example, media presentation times for the PUs.
The receiving apparatus 300 may playback the MPEG2 TS-based media file synchronized by the synchronization unit 330 and another media file together.
According to another embodiment, the decoded MPEG2 TS may be stored in a memory (not shown) prepared in the receiving apparatus 300. The sync information included in the received m2 si box or xml file may be also stored in the receiving apparatus 300.
The stored sync information may be used later for synchronized presentation with other media components.
In addition, the byte offsets for the PTS discontinuities may be stored as attributes in the m2 si box or the xml file including the sync information.
In this case, since all the segments of MPEG2 TS are combined, the byte offsets provided in the synchronization box or file should correspond to a start of the MPEG2 TS file.
The client performing HTTP streaming may access to the MPD and create the segment list as shown in FIG. 16 from the MPD at a specific client-local time “NOW”.
“NOW” may represent a current value of a clock in the reference client when an MPD instance is formed from the MPD.
The client that is not synchronized with a HTTP streaming server but synchronized with the UTC may experience problems of accessing to segments due to validity. HTTP streaming clients should synchronize their own clocks with a precise universal time.
Referring to FIG. 17 , a specific PU PUy included in an MPEG2 TS media file may be synchronized with a sample corresponding to the 3GPP media file and played back using the sync information as described above in relation to FIGS. 6 to 12 .
In the embodiment as shown in FIG. 17 , the m2 si box including MPD information, 3GPP file format information for segment and sync information may be given as follows.
-
- MPD Information:
availabilityStartTime = 2010-05-30T 09:30:10Z | ||
SegmentInfo duration = PT2S (Segment boundary shown is | ||
for segment # 32) | ||
-
- 3GPP FF Information for segment:
time_scale = 48000 | ||
track_ID 1 = (Video) | ||
decode_time = 2974186 | ||
composition_offset = 1602 | ||
|
||
decode_time = 2976000 | ||
-
- m2 si (corresponding)
PID = 101 | ||
presentation_time = 2977120 | ||
In this case, the UTC used for rendering PUy which is a PU played back according to the mapping method as described above in relation to FIGS. 5 to 8 may be calculated as the following equation 4.
PUy=(((PTS1−PTSk)/time_scale_ext)+presentation_time/time_scale)+availabilityStartTime (4)
PUy=(((PTS1−PTSk)/time_scale_ext)+presentation_time/time_scale)+availabilityStartTime (4)
In equation 4, the UTC “2010-05-30T 09:31:02.07Z” used for rendering PUy may be obtained by substituting PTS1 with PTS “43249500” of PUy, PTSk with PTS “43245300” of a PU to which media presentation time “29977120” is provided, time_scale_ext with clock “90000” of PTS, time_scale with “48000” given in 3GPP file format information, and availabilityStartTime with “2010-05-30T 09:30:10Z” given in the MDP information.
As shown in FIG. 18 , an index segment which is sync information as described above in relation to FIGS. 6 to 17 may exist separately in MPEG-DASH.
The media file may be implemented with syntax having the same media file structure as the described above in relation to FIG. 13 and the index data may be stored after the MPEG2 TS media file. Therefore the number of stored files may be greatly reduced.
In addition, the client may confirm how many bytes correspond to the index data and how many tailings correspond to the media data in the media file.
In this case, the client may recognize that a Universal Resource Locator (URL) for a media segment coincides with a URL of the index segment corresponding to the media segment. Accordingly the client may divide the segments in combination with a request.
A general client may not use this optimization and may obtain the same effect through two ttp requests.
Another client which is not able to use the index data may neglect the same and just fetch the media data.
A method of receiving and processing the MPEG2 TS-based media file as described in relation to FIGS. 1 to 19 may be performed by an IPTV receiver according to an embodiment.
For this, the IPTV receiver may receive a media file transmitted from a transmitting side, for example, a media server, and a format of the media file may have a configuration as described in relation to FIGS. 6 to 19 .
The IPTV receiver may synchronize an MPEG2 TS with other media and play back using sync information received from the transmitting side.
The IPTV receiver may perform a method of receiving a media file as described in relation to FIGS. 14 and 15 .
Hereinafter, a configuration of the IPTV receiver according to an embodiment will be described in detail with reference to FIGS. 20 and 21 .
An IPTV receiver according to an embodiment may include a separate tuner for receiving terrestrial broadcasting, cable broadcasting or satellite broadcasting. However, in this disclosure, the IPTV receiver having a configuration of receiving an IPTV service provided over an IP network will be mainly described for convenience.
An ITF indicates Open IPTV Terminal Function and may suggest a receiver formed of function modules necessary for supporting the IPTV service.
The IPTV receiver may include a network interface unit 502, a TCP/IP manager 502, a service delivery manager 504, a PVR manager 505, a demultiplexer (Demux) 508, a data decoder 507, an audio/video decoder 512, an display and OSD unit 515, an application manager 513 and 514, a service information (SI) and Metadata database (DB) 511, a service discovery manager 509, a service control manager 503, a metadata manager 510 and a content DB 506.
Referring to FIG. 20 , the network interface unit 501 receives and transmits packets from and to a network. Namely, the network interface unit 501 receives a service or media contents from a service provider over the network.
The TCP/IP manager 502 manages packet delivery from a source to a destination with respect to packets received by the IPTV receiver and packets transmitted from the IPTV receiver. In addition, the TCP/IP manager 502 classifies the received packets so as to correspond to a proper protocol, and outputs the classified packets to the service discovery manager 509, the service control manager 503 and the metadata manager 510.
The service delivery manager 504 controls the received service data. For example, in a case of controlling real-time streaming, the service delivery manager 504 may use a RTP/RTCP.
When the real-time streaming data is transported using the RTP, the service delivery manager 504 parses the received data packet according to the RTP and transports the parsed data packet to the Demux 508 or stores the same to the content DB 506 according to a control of the service manager 514. In addition, the service delivery manager 504 feeds the network reception information back to a server side providing the service using the RTCP.
The Demux 508 demultiplexes the received packets into audio, video and Program Specific Information data and transports the demultiplexed data respectively to the audio/video decoder 512 and the data decoder 507.
The data decoder 507 decodes, for example, service information such as PSI. Namely, the data decoder 507 receives a PSI section, a program and Service Information Protocol (PSIP) section or a DVB-SI section demultiplexed by the Demux 508 and decodes the received sections.
The data decoder 507 decodes the received sections to make a database for service information and the database for service information is stored in the SI & Metadata DB 511.
The audio/video decoder 512 decodes video data and audio data received from the Demux 508. The video data and audio data decoded in the audio/video decoder 512 are provided to a user through the display and OSD unit 515.
The application manager 513 and 514 manages the entire state of the IPTV receiver, provides a user interface and manages other managers. For this, the application manager includes a user interface manager 513 and a service manager 514.
The user interface manager 513 provides a Graphic User Interface (GUI) for a user using an On Screen Display (OSD), and performs receiver operations according to a key input from the user. For example, when receiving a key input for a channel selection from the user, the user interface manager 513 transports the key input signal to the service manager 514.
The service manager 514 controls managers related to a service such as the service delivery manager 504, the service discovery manager 509, the service control manager 503 and the metadata manager 510.
The service manager 514 makes a channel map and selects a channel using the channel map according to the key input received from the user interface manager 513. The service manager 514 receives channel service information for the data decoder 507 and sets audio and video PIDs of the selected channel to the Demux 508.
The service discovery manager 509 provides information necessary to select a service provider which provides services. When receiving a signal for the channel selection from the service manager 514, the service discovery manager 509 searches for a service using the information.
The service control manager 503 selects and controls the service. For example, the service control manager 503 uses the Internet Group Management Protocol (IGMP) or RTSP when the user selects a live broadcasting service which has the same scheme as the prior art, or selects and controls the service using the RTSP when the user selects a service of Video on Demand (VOD).
The RTP may provide a trick mode for the real time streaming. In addition, the service control manager 503 may initialize and manage sessions using an IP Multimedia Subsystem (IMS) through an INC gateway. The above-described protocols are examples and may be replaced with other protocols.
The metadata manager 510 manages metadata associated with the services and stores the metadata in the SI and metadata DB 511.
The SI and metadata DB 511 and the content DB 506 may be respectively implemented using a nonvolatile RAM (NVRAM) or a flash memory and may also be implemented in logically separated two regions on the same storage area.
The PVR manager 505 is a module for recording and presenting live streaming contents, and may collect metadata for the recorded content and create additional information to be provided to the user such as a thumbnail image or an index.
A function of a controller of the IPTV receiver according to an embodiment may be implemented as divided into a plurality of modules like the TCP/IP manager 502, the service delivery manager 504, a PVR manager 505, the application manager 513 and 514, the service discovery manager 509, the service control manager 503 and the metadata manager 510 as shown in FIG. 20 .
For example, the TCP/IP manager 502 may filter SD&S information using the foregoing target package information and control the network interface unit 501 to request only a payload or a segment corresponding to a specific package, for example, a package to which the IPTV receiver is subscribed, to a server and receive the same.
Alternatively, the TCP/IP manager 502 may filter SD&S information received in a multicast scheme using the target package information and cause the decoder 507 to parse and process only a payload or a segment corresponding to a specific package.
A cable modem or a DSL modem, etc 601, which is an interface causing the ITF to connect to an IP network in a physical level, demodulates a signal transmitted through a physical medium and recovers a digital signal.
An Ethernet NIC 602 recovers IP data from a signal received through the physical interface. An IP network stack 607 processes each layer according to the IP protocol stack.
An XML parser 609 parses an XML document from among received IP data. A file handler 608 processes data transported in a file type through FLUTE or the like from among the received IP data.
An SI handler 611 processes a portion corresponding to IPTV service information from among the receive file type data and stores the processed data in a storage 612. An EPG handler 610 processes a portion corresponding to IPTV EPG information from among the receive file type data and stores the processed data in the storage 612.
The storage 612 stores various data such as the SI data and the EPG information data and the like.
An SI decoder 613 reads the SI data from the storage 612, and then, analyzes and obtains channel map information. An EPG decoder 614 analyzes EPG data stored in the storage 612 and recovers information necessary to form an EPG.
An ITF operation controller 615 is a main controller controlling an ITF operation of a channel change or an EPG display.
A channel service manager 616 may perform an operation such as a channel change according to a user input. An application manager 617 may perform an application service such as an EPG display according to a user input.
An MPEG-2 demultiplexer 603 extracts MPEG2 TS data from a received IP datagram and transports the extracted data to a corresponding module according to the PID.
An MPEG2 PSI/PSIP parser 604 extracts the PID of A/V data or PSI/PSIP data including connection information on program elements, and parses the extracted data.
An A/V decoder 605 decodes input audio and video data and delivers the decoded data to a display module 606. The display module 606 displays the decoded A/V data or application.
Hitherto an apparatus and method for transmitting and receiving media data according to embodiments has been described by exemplifying a case where a media file transmitted and received between the transmitting apparatus 100 and the receiving apparatus 300 includes MPEG2 TS data. However, the present disclosure is not limited thereto and may be applied to various transport stream formats besides the MPEG2 TS.
The invention can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage.
The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. Also, functional programs, codes, and code segments for accomplishing the present invention can be easily construed by programmers skilled in the art to which the present invention pertains.
Although embodiments have been described with reference to a number of illustrative embodiments thereof, it should be understood that numerous other modifications and embodiments can be devised by those skilled in the art that will fall within the spirit and scope of the principles of this disclosure. More particularly, various variations and modifications are possible in the component parts and/or arrangements of the subject combination arrangement within the scope of the disclosure, the drawings and the appended claims. In addition to variations and modifications in the component parts and/or arrangements, alternative uses will also be apparent to those skilled in the art.
Claims (16)
1. A method of transmitting media segments including a media component in an MPEG-2 transport stream, the method comprising:
segmenting the media component into the media segments,
wherein each of the media segments includes a first presentation time stamp (PTS) of a first presentation unit (PU), and
wherein the first PTS of the first PU is mapped to a media presentation timeline; and
transmitting the media segments,
wherein a first media presentation time of the first PU is calculated based on differences between the first PTS of the first PU and a second PTS of a previous PU, and
wherein the first media presentation time is calculated by:
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
where MPT1 denotes the first media presentation time of the first PU, PTS1 denotes the first PTS, PTS0 denotes the second PTS, timescalePTS denotes a clock frequency of the PTS, timescale denotes a clock frequency of the media presentation timeline, and MPT0 denotes a media presentation time of the previous PU.
2. The method of claim 1 , wherein the first media presentation time is mapped to the media presentation timeline for synchronization with other media components included in a program.
3. The method of claim 1 , wherein the first media presentation time is acquired by using interpolation after a discontinuity occurred.
4. The method of claim 1 , wherein the first PTS of the first PU is adjusted for 33 bit.
5. An apparatus for transmitting media segments including a media component in an MPEG-2 transport stream, the method comprising:
a processor to segmenting the media component into the media segments,
wherein each of the media segments includes a first presentation time stamp (PTS) of a first presentation unit (PU), and
wherein the first PTS of the first PU is mapped to a media presentation timeline; and
a transmitter to transmit the media segments,
wherein a first media presentation time of the first PU is calculated based on differences between the first PTS of the first PU and a second PTS of a previous PU, and
wherein the first media presentation time is calculated by:
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
where MPT1 denotes the first media presentation time of the first PU, PTS1 denotes the first PTS, PTS0 denotes the second PTS, timescalePTS denotes a clock frequency of the PTS, timescale denotes a clock frequency of the media presentation timeline, and MPT0 denotes a media presentation time of the previous PU.
6. The apparatus of claim 5 , wherein the first media presentation time is mapped to the media presentation timeline for synchronization with other media components included in a program.
7. The apparatus of claim 5 , wherein the first media presentation time is acquired by using interpolation after a discontinuity occurred.
8. The apparatus of claim 5 , wherein the first PTS of the first PU is adjusted for 33 bit.
9. A method of receiving media segments including a media component in an MPEG-2 transport stream, the method comprising:
receiving the media segments including the segmented media component,
wherein each of the media segments includes a first presentation time stamp (PTS) of a first presentation unit (PU), and
wherein the first PTS of the first PU is mapped to a media presentation timeline; and
decoding and outputting the media component using a first media presentation time,
wherein the first media presentation time of the first PU is calculated based on differences between the first PTS of the first PU and a second PTS of a previous PU, and
wherein the first media presentation time is calculated by:
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
where MPT1 denotes the first media presentation time of the first PU, PTS1 denotes the first PTS, PTS0 denotes the second PTS, timescalePTS denotes a clock frequency of the PTS, timescale denotes a clock frequency of the media presentation timeline, and MPT0 denotes a media presentation time of the previous PU.
10. The method of claim 9 , wherein the first media presentation time is mapped to the media presentation timeline for synchronization with other media components included in a program.
11. The method of claim 9 , wherein the first media presentation time is acquired by using interpolation after a discontinuity occurred.
12. The method of claim 9 , wherein the first PTS of the first PU is adjusted for 33 bit.
13. An apparatus for receiving media segments including a media component in an MPEG-2 transport stream, the apparatus comprising:
a network interface to receive the media segments including the segmented media component,
wherein each of the media segments includes a first presentation time stamp (PTS) of a first presentation unit (PU), and
wherein the first PTS of the first PU is mapped to a media presentation timeline; and
a decoder to decode and output the media component using a first media presentation time,
wherein the first media presentation time of the first PU is calculated based on differences between the first PTS of the first PU and a second PTS of a previous PU, and
wherein the first media presentation time is calculated by:
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
MPT1=((PTS1−PTS0)/timescalePTS)*timescale+MPT0,
where MPT1 denotes the first media presentation time of the first PU, PTS1 denotes the first PTS, PTS0 denotes the second PTS, timescalePTS denotes a clock frequency of the PTS, timescale denotes a clock frequency of the media presentation timeline, and MPT0 denotes a media presentation time of the previous PU.
14. The apparatus of claim 13 , wherein the first media presentation time is mapped to the media presentation timeline for synchronization with other media components included in a program.
15. The apparatus of claim 13 , wherein the first media presentation time is acquired by using interpolation after a discontinuity occurred.
16. The apparatus of claim 13 , wherein the first PTS of the first PU is adjusted for 33 bit.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/943,346 US9699522B2 (en) | 2010-07-19 | 2015-11-17 | Method for transceiving media files and device for transmitting/receiving using same |
US15/613,723 US10129609B2 (en) | 2010-07-19 | 2017-06-05 | Method for transceiving media files and device for transmitting/receiving using same |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36574610P | 2010-07-19 | 2010-07-19 | |
US201161453948P | 2011-03-17 | 2011-03-17 | |
PCT/KR2011/005307 WO2012011724A2 (en) | 2010-07-19 | 2011-07-19 | Method for transceiving media files and device for transmitting/receiving using same |
US201313811198A | 2013-01-24 | 2013-01-24 | |
US14/811,321 US9699521B2 (en) | 2010-07-19 | 2015-07-28 | Method for transceiving media files and device for transmitting/receiving using same |
US14/943,346 US9699522B2 (en) | 2010-07-19 | 2015-11-17 | Method for transceiving media files and device for transmitting/receiving using same |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/811,321 Continuation US9699521B2 (en) | 2010-07-19 | 2015-07-28 | Method for transceiving media files and device for transmitting/receiving using same |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/613,723 Continuation US10129609B2 (en) | 2010-07-19 | 2017-06-05 | Method for transceiving media files and device for transmitting/receiving using same |
Publications (2)
Publication Number | Publication Date |
---|---|
US20160073178A1 US20160073178A1 (en) | 2016-03-10 |
US9699522B2 true US9699522B2 (en) | 2017-07-04 |
Family
ID=45497289
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/811,198 Active 2032-06-01 US9131274B2 (en) | 2010-07-19 | 2011-07-19 | Method for transceiving media files and device for transmitting/receiving using same |
US14/811,321 Active US9699521B2 (en) | 2010-07-19 | 2015-07-28 | Method for transceiving media files and device for transmitting/receiving using same |
US14/943,346 Active US9699522B2 (en) | 2010-07-19 | 2015-11-17 | Method for transceiving media files and device for transmitting/receiving using same |
US15/613,723 Active US10129609B2 (en) | 2010-07-19 | 2017-06-05 | Method for transceiving media files and device for transmitting/receiving using same |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/811,198 Active 2032-06-01 US9131274B2 (en) | 2010-07-19 | 2011-07-19 | Method for transceiving media files and device for transmitting/receiving using same |
US14/811,321 Active US9699521B2 (en) | 2010-07-19 | 2015-07-28 | Method for transceiving media files and device for transmitting/receiving using same |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/613,723 Active US10129609B2 (en) | 2010-07-19 | 2017-06-05 | Method for transceiving media files and device for transmitting/receiving using same |
Country Status (5)
Country | Link |
---|---|
US (4) | US9131274B2 (en) |
EP (1) | EP2597870B1 (en) |
KR (6) | KR101727050B1 (en) |
CN (2) | CN103109540B (en) |
WO (1) | WO2012011724A2 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10575270B2 (en) * | 2015-12-16 | 2020-02-25 | Sonos, Inc. | Synchronization of content between networked devices |
US10873820B2 (en) | 2016-09-29 | 2020-12-22 | Sonos, Inc. | Conditional content enhancement |
US11514099B2 (en) | 2011-09-21 | 2022-11-29 | Sonos, Inc. | Media sharing across service providers |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2597870B1 (en) | 2010-07-19 | 2019-05-22 | LG Electronics Inc. | Method for transmitting media files |
US9137555B2 (en) * | 2010-10-05 | 2015-09-15 | Telefonaktiebolaget L M Ericsson (Publ) | Client, a content creator entity and methods thereof for media streaming |
TW201246873A (en) * | 2011-02-11 | 2012-11-16 | Interdigital Patent Holdings | Method and apparatus for updating metadata cross reference to related applications |
CN107071513B (en) | 2011-03-16 | 2020-03-10 | 艾迪尔哈布股份有限公司 | Method, client and server for providing media content |
CN103688532B (en) * | 2011-07-29 | 2018-05-04 | 索尼公司 | Transmit diostribution device and method, stream transmission reception device and method, streaming system as a stream |
US9590814B2 (en) * | 2011-08-01 | 2017-03-07 | Qualcomm Incorporated | Method and apparatus for transport of dynamic adaptive streaming over HTTP (DASH) initialization segment description fragments as user service description fragments |
KR101719998B1 (en) * | 2011-12-12 | 2017-03-27 | 엘지전자 주식회사 | Device and method for receiving media content |
US9425948B2 (en) * | 2012-01-26 | 2016-08-23 | Qualcomm Incorporated | Techniques for synchronizing a clock of a wired connection when transmitted over a wireless channel |
WO2014171718A1 (en) * | 2013-04-16 | 2014-10-23 | 엘지전자 주식회사 | Broadcasting transmission device, broadcasting reception device, operating method of broadcasting transmission device and operating method of broadcasting reception device |
WO2014208972A1 (en) * | 2013-06-24 | 2014-12-31 | Samsung Electronics Co., Ltd. | Method and apparatus for converting content in multimedia system |
JP2015015706A (en) * | 2013-07-03 | 2015-01-22 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | Data transmission method, data reproduction method, data transmitter and data reproducer |
JP6498882B2 (en) * | 2013-07-22 | 2019-04-10 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America | Storage method, playback method, storage device, and playback device |
JP6510205B2 (en) * | 2013-10-11 | 2019-05-08 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America | Transmission method, reception method, transmission apparatus and reception apparatus |
US9838729B2 (en) * | 2013-11-06 | 2017-12-05 | Avago Technologies General Ip (Singapore) Pte. Ltd. | Recovering channel bonded program streams |
US20150201253A1 (en) * | 2014-01-10 | 2015-07-16 | Samsung Electronics Co., Ltd. | Methods and apparatus for universal presentation timeline alignment |
CN105917655B (en) * | 2014-01-13 | 2019-07-09 | Lg电子株式会社 | The device and method of broadcasted content are sent or received via one or more network |
KR102171652B1 (en) * | 2014-01-31 | 2020-10-29 | 소니 주식회사 | Transmission device, transmission method, reception device, and reception method |
CN106464677A (en) * | 2014-04-09 | 2017-02-22 | Lg电子株式会社 | Method and apparatus for transmitting/receiving broadcast signal |
CN104079957B (en) * | 2014-06-25 | 2017-09-01 | 广东欧珀移动通信有限公司 | A kind of method and system of multimedia equipment simultaneously operating |
US10645432B2 (en) | 2015-01-07 | 2020-05-05 | Samsung Electronics Co., Ltd. | Method and apparatus for transmitting and receiving media information in communication system |
US10085051B2 (en) * | 2015-02-13 | 2018-09-25 | Samsung Electronics Co., Ltd. | Method and apparatus for converting MMTP stream to MPEG-2TS |
CN106034262B (en) * | 2015-03-13 | 2021-01-22 | 中兴通讯股份有限公司 | Adaptive streaming media processing method and device |
WO2017022281A1 (en) * | 2015-08-06 | 2017-02-09 | 日立マクセル株式会社 | Broadcast receiving device, output video information generating method, broadcast receiving method and video recording method |
CN106470291A (en) * | 2015-08-19 | 2017-03-01 | 飞思卡尔半导体公司 | Recover in the interruption in time synchronized from audio/video decoder |
EP3968645A1 (en) * | 2015-12-11 | 2022-03-16 | InterDigital Madison Patent Holdings, SAS | Scheduling multiple-layer video segments |
TWI559753B (en) * | 2016-03-16 | 2016-11-21 | 晶睿通訊股份有限公司 | Method for transmitting a video on demand |
CN105872484B (en) * | 2016-06-04 | 2017-07-21 | 武汉诚迈科技有限公司 | A kind of abnormal guard method of monitoring video |
KR102090070B1 (en) * | 2018-10-31 | 2020-03-17 | 카테노이드 주식회사 | Streaming server, client terminal and audio/video live streaming system using the same |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20040036447A (en) | 2002-10-26 | 2004-04-30 | 한국전자통신연구원 | Apparatus and method for injection of synchronized stream data in digital broadcasting environment |
US20050122974A1 (en) * | 2003-11-25 | 2005-06-09 | Friedemann Hubler | Method and apparatus for storage or relocation of defined positions in a data stream |
KR20060039728A (en) | 2004-11-03 | 2006-05-09 | 한국전자통신연구원 | Terrestrial dmb broadcasting tranceiving system for synchronizing data service with audio/video service |
WO2006126852A1 (en) | 2005-05-26 | 2006-11-30 | Electronics And Telecommunications Research Institute | Method and apparatus for synchronizing data service with video service in digital multimedia broadcasting |
KR20060131502A (en) | 2005-06-16 | 2006-12-20 | 삼성전자주식회사 | Method for multimedia data service and apparatus for the same |
KR20070014376A (en) | 2005-07-28 | 2007-02-01 | 삼성전자주식회사 | Method for generating and reproducing multi channel for transmitting for the multi channel in digital multimedia broadcasting system |
CN101188589A (en) | 2006-11-15 | 2008-05-28 | 上海瑞高信息技术有限公司 | Synchronization adaptation method and device for broadcast data in satellite synchronization distribution network |
CN101282230A (en) | 2007-04-05 | 2008-10-08 | 中兴通讯股份有限公司 | Method for implementing broadcast data whole-network synchronization |
KR20080090356A (en) | 2007-04-04 | 2008-10-08 | 한국전자통신연구원 | Storage/playback method and apparatus for mpeg-2 transport stream based on iso base media file format |
KR20090107000A (en) | 2008-04-07 | 2009-10-12 | 엘지전자 주식회사 | Method of receiving a broadcasting signal and apparatus for receiving a broadcasting signal |
US20110239078A1 (en) * | 2006-06-09 | 2011-09-29 | Qualcomm Incorporated | Enhanced block-request streaming using cooperative parallel http and forward error correction |
US9131274B2 (en) * | 2010-07-19 | 2015-09-08 | Lg Electronics Inc. | Method for transceiving media files and device for transmitting/receiving using same |
-
2011
- 2011-07-19 EP EP11809851.6A patent/EP2597870B1/en active Active
- 2011-07-19 KR KR1020167035531A patent/KR101727050B1/en active IP Right Grant
- 2011-07-19 KR KR1020137001532A patent/KR101476934B1/en active IP Right Grant
- 2011-07-19 KR KR1020147020005A patent/KR101651137B1/en active IP Right Grant
- 2011-07-19 CN CN201180044822.4A patent/CN103109540B/en active Active
- 2011-07-19 KR KR1020167022031A patent/KR101689616B1/en active IP Right Grant
- 2011-07-19 CN CN201610186917.4A patent/CN105611320B/en active Active
- 2011-07-19 US US13/811,198 patent/US9131274B2/en active Active
- 2011-07-19 WO PCT/KR2011/005307 patent/WO2012011724A2/en active Application Filing
- 2011-07-19 KR KR1020177009516A patent/KR101759959B1/en active IP Right Grant
- 2011-07-19 KR KR1020147000387A patent/KR20140008478A/en not_active Application Discontinuation
-
2015
- 2015-07-28 US US14/811,321 patent/US9699521B2/en active Active
- 2015-11-17 US US14/943,346 patent/US9699522B2/en active Active
-
2017
- 2017-06-05 US US15/613,723 patent/US10129609B2/en active Active
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20040036447A (en) | 2002-10-26 | 2004-04-30 | 한국전자통신연구원 | Apparatus and method for injection of synchronized stream data in digital broadcasting environment |
US20040103430A1 (en) | 2002-10-26 | 2004-05-27 | Lee Yong Ju | Apparatus and method for injecting synchronized stream data in digital broadcasting environment |
US20050122974A1 (en) * | 2003-11-25 | 2005-06-09 | Friedemann Hubler | Method and apparatus for storage or relocation of defined positions in a data stream |
KR20060039728A (en) | 2004-11-03 | 2006-05-09 | 한국전자통신연구원 | Terrestrial dmb broadcasting tranceiving system for synchronizing data service with audio/video service |
WO2006126852A1 (en) | 2005-05-26 | 2006-11-30 | Electronics And Telecommunications Research Institute | Method and apparatus for synchronizing data service with video service in digital multimedia broadcasting |
KR20060131502A (en) | 2005-06-16 | 2006-12-20 | 삼성전자주식회사 | Method for multimedia data service and apparatus for the same |
KR20070014376A (en) | 2005-07-28 | 2007-02-01 | 삼성전자주식회사 | Method for generating and reproducing multi channel for transmitting for the multi channel in digital multimedia broadcasting system |
KR100724867B1 (en) | 2005-07-28 | 2007-06-04 | 삼성전자주식회사 | Method for generating and reproducing multi channel for transmitting for the multi channel in digital multimedia broadcasting system |
US20110239078A1 (en) * | 2006-06-09 | 2011-09-29 | Qualcomm Incorporated | Enhanced block-request streaming using cooperative parallel http and forward error correction |
CN101188589A (en) | 2006-11-15 | 2008-05-28 | 上海瑞高信息技术有限公司 | Synchronization adaptation method and device for broadcast data in satellite synchronization distribution network |
KR20080090356A (en) | 2007-04-04 | 2008-10-08 | 한국전자통신연구원 | Storage/playback method and apparatus for mpeg-2 transport stream based on iso base media file format |
US20100135646A1 (en) * | 2007-04-04 | 2010-06-03 | Gun Bang | Storage/playback method and apparatus for mpeg-2 transport stream based on iso base media file format |
CN101282230A (en) | 2007-04-05 | 2008-10-08 | 中兴通讯股份有限公司 | Method for implementing broadcast data whole-network synchronization |
KR20090107000A (en) | 2008-04-07 | 2009-10-12 | 엘지전자 주식회사 | Method of receiving a broadcasting signal and apparatus for receiving a broadcasting signal |
US9131274B2 (en) * | 2010-07-19 | 2015-09-08 | Lg Electronics Inc. | Method for transceiving media files and device for transmitting/receiving using same |
Non-Patent Citations (6)
Title |
---|
"Open IPTV Forum – Release 2 Specification HTTP Adaptive Streaming DRAFT V0.04.01", 3GPP DRAFT; OIPF-DR-HAS-V0_04_01-2010-04-23, 3RD GENERATION PARTNERSHIP PROJECT (3GPP), MOBILE COMPETENCE CENTRE ; 650, ROUTE DES LUCIOLES ; F-06921 SOPHIA-ANTIPOLIS CEDEX ; FRANCE, vol. SA WG4, no. Vancouver, Canada; 20100426, OIPF-DR-, 29 April 2010 (2010-04-29), Mobile Competence Centre ; 650, route des Lucioles ; F-06921 Sophia-Antipolis Cedex ; France, XP050437971 |
Guide to MPEG Fundamentals and Protocol Analysis, Mar. 1, 2007 , Mar. 1, 2007 (Mar. 1, 2007) , pp. 1-101 , XP002505551. |
Haakon Riiser et al: "Low overhead container format for adaptive streaming", Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems, MMSYSIO, Jan. 1, 2010 (Jan. 1, 2010), p. 193, XP055028059, New York, New York, USA DOI: 101145/17308361730859 ISBN: 978-1-60-558914-5. |
HJ LEE, WAQAR ZIA: "Response to CfP on HTTP Streaming of MPEG Media", 93. MPEG MEETING; 26-7-2010 - 30-7-2010; GENEVA; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11), no. M17898, m17898, 21 July 2010 (2010-07-21), XP030046488 |
LGE: "Response to CfP on HTTP Streaming of MPEG Media", XP-030046488, ISO/IEC JTC1/SC29/WG11, MPEG Meeting 93, No. M 17898, Geneva, Switzerland, Jul. 26-30, 2010. |
Open IPTV Forum a Release 2 Specification HTTP Adaptive Streaming Draft V0.04.01, 3GPP Draft; OIPF-DR-HAS-VO 04 01-2010-04-23, 3rd Generation Partnership Project (3GPP), Mobile Competence Centre ; 650, Route Des Lucioles ; F-06921 Sophia-Antipolis Cedex; France, vol. SA WG4, No. Vancouver, Canada; Apr. 26, 2010, Apr. 29, 2010 (Apr. 29, 2010), XP050437971. |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11514099B2 (en) | 2011-09-21 | 2022-11-29 | Sonos, Inc. | Media sharing across service providers |
US10575270B2 (en) * | 2015-12-16 | 2020-02-25 | Sonos, Inc. | Synchronization of content between networked devices |
US10880848B2 (en) * | 2015-12-16 | 2020-12-29 | Sonos, Inc. | Synchronization of content between networked devices |
US11323974B2 (en) * | 2015-12-16 | 2022-05-03 | Sonos, Inc. | Synchronization of content between networked devices |
US10873820B2 (en) | 2016-09-29 | 2020-12-22 | Sonos, Inc. | Conditional content enhancement |
US11337018B2 (en) | 2016-09-29 | 2022-05-17 | Sonos, Inc. | Conditional content enhancement |
US11546710B2 (en) | 2016-09-29 | 2023-01-03 | Sonos, Inc. | Conditional content enhancement |
US11902752B2 (en) | 2016-09-29 | 2024-02-13 | Sonos, Inc. | Conditional content enhancement |
Also Published As
Publication number | Publication date |
---|---|
KR101689616B1 (en) | 2016-12-26 |
KR20170042811A (en) | 2017-04-19 |
CN105611320B (en) | 2018-10-19 |
KR101727050B1 (en) | 2017-04-14 |
CN105611320A (en) | 2016-05-25 |
WO2012011724A2 (en) | 2012-01-26 |
CN103109540A (en) | 2013-05-15 |
KR20140008478A (en) | 2014-01-21 |
US9699521B2 (en) | 2017-07-04 |
EP2597870B1 (en) | 2019-05-22 |
KR101651137B1 (en) | 2016-08-25 |
US10129609B2 (en) | 2018-11-13 |
KR101759959B1 (en) | 2017-07-20 |
US20130125187A1 (en) | 2013-05-16 |
KR20160099736A (en) | 2016-08-22 |
US20170272840A1 (en) | 2017-09-21 |
US20160029068A1 (en) | 2016-01-28 |
KR20140093763A (en) | 2014-07-28 |
KR101476934B1 (en) | 2014-12-30 |
EP2597870A2 (en) | 2013-05-29 |
KR20130050953A (en) | 2013-05-16 |
CN103109540B (en) | 2016-04-27 |
KR20160150111A (en) | 2016-12-28 |
WO2012011724A3 (en) | 2012-05-03 |
US9131274B2 (en) | 2015-09-08 |
EP2597870A4 (en) | 2014-01-22 |
US20160073178A1 (en) | 2016-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10129609B2 (en) | Method for transceiving media files and device for transmitting/receiving using same | |
US10009660B2 (en) | Media content transceiving method and transceiving apparatus using same | |
US9781188B2 (en) | Method for transreceiving media content and device for transreceiving using same | |
US9426506B2 (en) | Apparatuses for providing and receiving augmented broadcasting service in hybrid broadcasting environment | |
GB2499539A (en) | Method for transreceiving media content and device for transreceiving using same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: LG ELECTRONICS INC., KOREA, REPUBLIC OF Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KIM, KYUNGHO;LEE, HYEONJAE;ZIA, WAQAR;REEL/FRAME:037059/0226 Effective date: 20121228 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |