Transcoding is the direct analog-to-analog or digital-to-digital conversion of one encoding to another,1 such as for movie data files or audio files. This is usually done in cases where a target device (or workflow) does not support the format or has limited storage capacity that mandates a reduced file size,1 or to convert incompatible or obsolete data to a better-supported or modern format. In the analog video world, transcoding is the process of converting PAL to SECAM or to NTSC or vice-versa. Transcoding can be performed just while files are being searched, as well as for presentation. For example, Cineon and DPX files have been widely used as a common format for digital cinema, but the data size of a two-hour movie is about 8 terabytes (TB).1 That large size can increase the cost and difficulty of handling movie files. However, transcoding into a JPEG2000 lossless format has better compression performance than other lossless coding technologies, and in many cases, JPEG2000 can compress images to half-size.1
Transcoding is commonly a lossy process, introducing generation loss; however, transcoding can be lossless if the input is losslessly compressed and the output is either losslessly compressed or uncompressed.1 The process of lossy-to-lossy transcoding introduces varying degrees of generation loss. In other cases, the transcoding of lossy to lossless or uncompressed is technically a lossless conversion because no information is lost, however the process is irreversible and is more suitably known as destructive.
In contrast to a conversion, the prefix "trans" emphasizes a conversion from a source to a different destination.
The most popular definition of transcoding refers to a two-step process in which the original data/file is decoded to an intermediate uncompressed format (e.g., PCM for audio or YUV for video), which is then encoded into the target format.
One may also re-encode data in the same format, for a number of reasons:
- If one wishes to edit data in a compressed format (for instance, perform image editing on a JPEG image), one will generally decode it, edit it, then re-encode it. This re-encoding causes digital generation loss; thus if one wishes to edit a file repeatedly, one should only decode it once, and make all edits on that copy, rather than repeatedly re-encoding it. Similarly, if encoding to a lossy format is required, it should be deferred until the data is finalised, e.g. after mastering.
- Lower bitrate
- Transrating is a process similar to transcoding in which files are coded to a lower bitrate without changing video formats; this can include sample rate conversion, but may use an identical sampling rate with higher compression. This allows one to fit given media into smaller storage space (for instance, fitting a DVD onto a Video CD), or over a lower bandwidth channel.
- Image scaling
- Changing the picture size of video is known as transsizing, and is used if the output resolution differs from the resolution of the media. On a powerful enough device, image scaling can be done on playback, but it can also be done by re-encoding, particularly as part of transrating (such as a downsampled image requiring a lower bitrate).
One can also use formats with bitrate peeling, that allow one to easily lower the bitrate without re-encoding, but quality is often lower than a re-encode. For example, in Vorbis bitrate peeling as of 2008, the quality is inferior to re-encoding.
The key drawback of transcoding in lossy formats is decreased quality. Compression artifacts are cumulative, so transcoding causes a progressive loss of quality with each successive generation, known as digital generation loss. For this reason, transcoding is generally discouraged unless unavoidable.
For users wanting to be able to re-encode audio into any format, and for digital audio editing, it is best to retain a master copy in a lossless format (such as FLAC, ALAC, TTA, WavPack, and others) that take around half the storage space needed when compared to original uncompressed PCM formats (such as WAV, and AIFF), as lossless formats usually have the added benefit of having meta data options, which are either completely missing or very limited in PCM formats. These lossless formats can be transcoded to PCM formats or transcoded directly from one lossless format to another lossless format, without any loss in quality. They can be transcoded into a lossy format, but these copies will then not be able to be transcoded into another format of any kind (PCM, lossless, or lossy) without a subsequent loss of quality.
For image editing users are advised to capture or save images in a raw or uncompressed format, and then edit a copy of that master version, only converting to lossy formats if smaller file sized images are needed for final distribution. As with audio, transcoding from lossy format to another format of any type will result in a loss of quality.
For video editing, (for video converting), images are normally compressed directly during the recording process due to the huge file sizes that would be created if they were not, and because the huge storage demands being too cumbersome for the user otherwise. However, the amount of compression used at the recording stage can be highly variable, and is dependent on a number of factors, including the quality of images being recorded (e.g. analog or digital, standard def. or high def., etc.), and type of equipment available to the user, which is often related to budget constraints – as highest quality digital video equipment, and storage space, may be expensive. Effectively this means that any transcoding will involve some cumulative image loss, hence the best practicable solution is that the original format of recording is deemed the master copy, and subsequent finished versions (that are often in another format and/or at a much smaller file size) are edited from copies of that master.
Although transcoding can be found in many areas of content adaptation, it is commonly used in the area of mobile phone content adaptation. In this case, transcoding is a must, due to the diversity of mobile devices and their capabilities. This diversity requires an intermediate state of content adaptation in order to make sure that the source content will adequately function on the target device to which it is sent.
One of the most popular technologies in which transcoding is used is the Multimedia Messaging Service (MMS), which is the technology used to send or receive messages with media (image, sound, text and video) between mobile phones. For example, when a camera phone is used to take a digital picture, a high-quality image of usually at least 640x480 pixels is created. When sending the image to another phone, this high resolution image might be transcoded to a lower resolution image with fewer colors in order to better fit the target device's screen size and color limitations. This size and color reduction improves the user experience on the target device, and is sometimes the only way for content to be sent between different mobile devices.
Transcoding is extensively used by home theatre PC software to reduce the usage of disk space by video files. The most common operation in this application is the transcoding of MPEG-2 files to the MPEG-4 or H.264 format.
Real-time transcoding in a many-to-many way (any input format to any output format) is becoming a necessity to provide true search capability for any multimedia content on any mobile device, with over 500 million videos on the web and a plethora of mobile devices.
- "Advancements in Compression and Transcoding: 2008 and Beyond", Society of Motion Picture and Television Engineers (SMPTE), 2008, webpage: SMPTE-spm.
- Federal Standard 1037C
- List of Portable Multimedia Software
- Huifang Sun, Xuemin Chen, and Tihao Chiang, Digital Video Transcoding for Transmission and Storage, New York, CRC Press, 2005.
- Comparison of video converters
- Comparison of programming languages (basic instructions) § Type conversions