ce5d03ad4dc311abad52226d8041d603c9f36af5
[fanfix.git] / docs / images.md

Terminal Emulator Multimedia Standard - Proposed Design

Version: 1

Purpose

Multiple standards exist to incorporate image data in text-based terminals and terminal emulators. Few standards have wide adoption despite frequent user requests for these features and hardware support for several of the standards.

A group including developers of several widely-used terminal emulators has been working on defining the needs and limitations for a standard that can be implemented in current-gen terminal emulators. The discussion has been primarily captured here: https://gitlab.freedesktop.org/terminal-wg/specifications/issues/12

This document collects many of the reported desires and practical constraints of that discussion into a proposed standard that encompasses three independent new features:

  1. A method to transfer multimedia data for immediate display within the screen cell grid (“Direct Multimedia”).

  2. A method to transfer multimedia data to a terminal-managed cache, and later display that data within the screen cell grid (“Cached Multimedia”).

  3. A method to assign cell data to different layers with options for both layer and cell transparency (“Layers”).

A terminal may implement any combination of these features independently of each other. If all features are supported, then all of the design goals outlined in this document can be met.

The same mechanisms that can put raster-based images on the screen are also readily generalizable to other media types such as vector-based images and animations. This document is thus a “multimedia” proposal rather than a “simple images” proposal.

Acknowledgements

This proposal has been informed from the following prior work:

Design Goals - Core

The core (“must-have”) design goals are:

Design Goals - Secondary

The secondary (“nice-to-have”) design goals are listed below. These might not all be possible, but will kept in mind:

Out Of Scope

The following items are out of scope:

Definitions

Terminal - The hardware, or a program that simulates hardware, comprising a keyboard, screen, and mouse.

Application - A program that utilizes the terminal for its input/output with the user.

Multiplexer - A special case of an application that simulates one or more “inner” terminals for other applications to use, and composes these inner terminals into a combined screen to emit to one or more “outer” terminals that obtain input/output from the user. Multiplexers are thus both applications and terminals.

X - The column coordinate of a cell. This standard is 1-based (like ECMA-48): the left-most column of the screen is numbered 1.

Y - The row coordinate of a cell. This standard is 1-based (like ECMA-48): the top-most row of the screen is numbered 1.

Z - The layer that text or multimedia is placed on. This proposal uses a right-hand coordinate system with (X, Y, Z) = (1, 1, 1) defined as the top-left corner on the default layer; positive Z projects “away” from the user and “into” or “behind” the screen. Rendering the Cells on the screen must produce the same result as painter’s algorithm (see “Layers - Rendering” section below).

Cell - A fixed-width-and-height rectangle on the screen. The cells of the screen are arranged in a grid of X columns and Y rows. A Cell has dimensions of cellWidth and cellHeight pixels. Every Cell has a coordinate of (X, Y) (or (X, Y, Z) when the terminal supports the layers feature).

Tile - One or more contiguous Cells with data to be displayed. The data can be text or image data, but not both. A Tile has width of 1, 2, or more, and a coordinate of (X, Y, Z) that is the same as its left-most (first) Cell’s (X, Y, Z). In practice, Tiles are typically one Cell wide for ASCII and Latin language glyphs, and two Cells wide for “fullwidth” glyphs as used in Asian langauges, emojis, and symbols. This standard does not preclude Tiles from encompassing entire grapheme clusters. Note that ECMA-48 / ANSI X3.64 operations are performed against Tiles, not Cells: if a 2-Cell-wide Tile is deleted via backspace, then the cursor will decrement on screen by two columns.

Layer - A screen-sized grid of Cells that have the same Z coordinate. Layers are drawn to the screen in descending Z order. Layers may have optional additional attributes such as transparency. Layer support is an orthogonal (independent) option to multimedia support. It is acceptable for terminals to support multimedia without layers and vice versa.

All Features - Detection

Applications can detect support for these features using Primary Device Attributes (DA) and DECID (ESC Z, or 0x9A).

Terminals that support this standard will repond with additional parameter(s): “224” for direct multimedia, “225” for cached multimedia, and “226” for layers. A recap of the parameters xterm supports is listed below, with these new feature responses included:

VT220 (and higher) Response Description
1 132-columns
2 Printer
3 ReGIS graphics
4 Sixel graphics
6 Selective erase
8 User-defined keys
9 National Replacement Character sets
1 5 Technical characters
1 6 Locator port
1 7 Terminal state interrogation
1 8 User windows
2 1 Horizontal scrolling
2 2 ANSI color, e.g., VT525
2 8 Rectangular editing
2 9 ANSI text locator (i.e., DEC Locator mode)
2 2 4 Direct Multimedia Version 1
2 2 5 Cached Multimedia Version 1
2 2 6 Layers

Direct Multimedia - Summary

Direct Multimedia - Required Support For Existing Sequences

A terminal with direct multimedia feature must support the following defined xterm sequences:

Sequence Description
CSI 16 t Responds with CSI 6 ; cellHeight ; cellWidth t
CSI 18 t Responds with CSI 8 ; rows ; columns t

Direct Multimedia - New Sequences

Direct Multimedia - Error Handling

Direct Multimedia - Cursor Position

Direct Multimedia - Wire Format

Direct Multimedia - Examples

Cached Multimedia - Summary

Pixel data that has scrolled off the displayed screen and into the scrollback buffer is required to be persistent even if the cache entry containing that image data has been evicted by the terminal or removed by the application.

Cached Multimedia - Required Support For Existing Sequences

A terminal with cached multimedia feature must support the following defined xterm sequences:

Sequence Description
CSI 16 t Responds with CSI 6 ; cellHeight ; cellWidth t
CSI 18 t Responds with CSI 8 ; rows ; columns t

Cached Multimedia - New Sequences

Cached Multimedia - Error Handling

Cached Multimedia - Cursor Position

Cached Multimedia - Scrollback

Cached Multimedia - Wire Format

Cached Multimedia - Examples

Layers - Summary

Layers introduce the concept of a layer “Z” coordinate to the existing rows (“Y”) by columns (“X”) grid. Put another way, the two-dimensional grid of columns-by-rows becomes a three-dimensional cube of columns-by-rows-by-layers. For this document, the column, row, and layer coordinates are referred to as X, Y, and Z. This cartesian coordinate system is right-handed, with the Z axis pointing “away” from the user “into” the screen.

An application treats the Z coordinate exactly as it does X and Y (rows and columns) coordinates:

New sequences are provided to set and query Z, Y, X, to set and query the screen cube size, and control visibility of Cells in-front-of other Cells.

Operations that act on more than one Cell are defined such to act on all layers simultaneously by default.

Layers - Number of Layers

A terminal is required to provide between 1 and a finite number of layers.

The number of layers may be different between the primary and alternate screens.

An application may request that the terminal allocate additional layers. The terminal is free to honor or ignore such requests as it sees fit.

The scrollback buffer is permitted, and recommended, to contain only a “flattened” single layer.

Layers - Terminal State

The terminal maintains a complex state at all times. This state includes variables such as cursor position, foreground/background color, attributes to apply to the next displayed character, and so on. The layers feature adds more variables to the state, and these variables are required to be stored with DECSC (ESC 7) and restored with DECRC (ESC 8). The new variables are listed below:

Mnemonic Description Default value
Z Cursor position Z 1
MSL Manipulate single layer off / disabled
TFT Text foreground transparent false
TBT Text background transparent false

Layers - Required Support For Existing Sequences

A terminal with layers feature must support the standard VT100/VT102 sequences defined in their respective manuals.

Layers - New Sequences

A terminal with layer feature must support the following new sequences:

Sequence Command Description
CSI ? z ; y ; x H CUPZ Move cursor to (x, y, z)
CSI ? z ; y ; x H SLA Set layer alpha
CSI ? 3 0 0 1 h DECSET 3001 Enable Manupulate Single Layer (MSL)
CSI ? 3 0 0 1 l DECRST 3001 Disable Manupulate Single Layer (MSL)
CSI ? l ; h ; w t RSZCUBE Resize cube to (layers, height, width)

Default parameters and ranges are listed below:

Command Position / Variable Default Value Minumum Maximum
CUPZ 1 / z 1 1 # layers
CUPZ 2 / y 1 1 # rows
CUPZ 3 / x 1 1 # columns
SLA 1 / alpha 255 0 255
RSZCUBE 1 / l 1 1 varies
RSZCUBE 2 / h 80 1 varies
RSZCUBE 3 / w 24 1 varies

The terminal must also support the following new queries:

Query Response Description
CSI ? 1 0 0 n CSI ? z ; y ; x n Report cursor Z, Y, X position
CSI ? 1 8 t CSI ? 8 ; l ; h ; w t Report the text area cube layers, height, width

The terminal must support the following new Set Graphics Rendition (SGR) character attributes commands:

SGR Parameter Description
230 Set text foreground color to transparent
239 Set text foreground color to solid (opaque)
240 Set text background color to transparent
249 Set text background color to solid (opaque)

Layers - Error Handling

No additional error reporting is provided for layer feature.

Layers - Rendering

A terminal with layer feature will display its Cells such that the screen will appear as if it was rendered in the manner of the pseudo-code below:

``` for each layer Z, in descending order from maxZ to minZ:

for each row Y, in ascending order from minY to maxY:

for each column X, in ascending order from minX to maxX:

  if tile at (X, Y, Z) background color is solid:
    draw rectangle of background color with layer alpha

  if tile at (X, Y, Z) foreground color is solid:
    if tile at (X, Y, Z) is glyph:
      draw glyph with foreground color with layer alpha
    else
      draw pixel data of tile as red/green/blue/alpha pixels with
         layer alpha

  advance X by tile width
next column

advance Y by 1

next row

decrease Z by 1 next layer ```

A terminal is free to optimize its rendering as it sees fit, so long as the final screen output looks equivalent to the above method.

Layers - Integration With Existing Sequences

Sequences that insert characters/lines, delete characters/lines, or modify larger regions are changed to act upon multiple layers as defined below. By default, MSL (Modify All Layers) is off/unset, and Z is 1, so if the application never changes MSL or Z then these sequences will produce the same visible output as a terminal without layer support.

A terminal is not required to support all of these sequences; however, for those sequences it does support, if it supports the layers feature then the sequences must behave as shown below:

Sequence Command Additional behavior
BS (0x08) Backspace Only current layer affected if MSL=on
DEL (0x7F) Delete Only current layer affected if MSL=on
IND (0x84) Index Only current layer affected if MSL=on
RI (0x8D Reverse Index Only current layer affected if MSL=on
ESC # 3 DECDHL Cells on all layers always affected
ESC # 4 DECDHL Cells on all layers always affected
ESC # 5 DECSWL Cells on all layers always affected
ESC # 6 DECDWL Cells on all layers always affected
ESC # 8 DECALN All layers > 1 cleared; Z, MSL, TFT, TBT reset to default
ESC 7 DECSC Also store Z, MSL, TFT, TBT
ESC 8 DECRC Also restore Z, MSL, TFT, TBT
ESC c RIS All layers > 1 cleared; Z, MSL, TFT, TBT reset to default
CSI @ ICH Only current layer affected if MSL=on
CSI J ED Only current layer affected if MSL=on
CSI K EL Only current layer affected if MSL=on
CSI ? K DECSEL Only current layer affected if MSL=on
CSI L IL Only current layer affected if MSL=on
CSI M DL Only current layer affected if MSL=on
CSI X ECH Only current layer affected if MSL=on
CSI M DL Only current layer affected if MSL=on
CSI P DCH Only current layer affected if MSL=on
CSI R DECSTBM Cells on all layers always affected
CSI $ t DECARA Only current layer affected if MSL=on
CSI $ v DECCRA Only current layer affected if MSL=on
CSI x DECSACE Cells on all layers always affected
CSI $ x DECFRA Only current layer affected if MSL=on
CSI $ z DECERA Only current layer affected if MSL=on

The VT52 sub-mode commands:

Sequence Command Additional behavior
ESC J ED Only current layer affected if MSL=on
ESC K EL Only current layer affected if MSL=on

Layers - Use With Multiplexers

Layers are inteded to provide a means for multiplexers to pass on the job of multimedia support to the “outer” or host terminal. The proposed mechanics of that is outlined in the pseudo-code below:

``` for each inner terminal in descending order from maxZ to minZ:

emit CUPZ(inner terminal Z, inner terminal Y, inner terminal X)

draw inner terminal text with standard VT100/VT102/xterm sequences

for each multimedia sequence emitted by the inner terminal: emit CUP(inner terminal Y, inner terminal X) emit multimedia sequences to outer terminal next multimedia sequence

decrease Z by 1 next inner terminal ```

The method above may not be effective for complex multi-terminal screen layouts, but is hoped to work well for many simple cases.

Layers - Examples

References